Gene Mpal_1355 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpal_1355 
Symbol 
ID7269960 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosphaerula palustris E1-9c 
KingdomArchaea 
Replicon accessionNC_011832 
Strand
Start bp1399446 
End bp1400648 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content64% 
IMG OID643569989 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_002466411 
Protein GI219851979 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.358064 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGATC TCCGGATCAC CCCCCTGCTG GCTCTTCAGG TCGTGATCTT CACCGATACC 
TTCTCATTCG GGGTGGTACT CCCCTTCATG GTCTTTCTGG TGAGCGGTTT CGGGGGGGAC
GCCCTGGCCT TCGGAGCCAT CGGCGCAATC TATCCGTTCT TCCAGACGAT CAGCGCTCCG
GTCCTTGGTC GCTGGTCTGA CAAGGTGGGG AAGAGCCGGG TGCTGCTGAT CTCGCAGGCC
GGGACTGCTC TCTCCTGGTT GCTCTTTCTG GGCATCTTCT TCCTCCCGTC CACCGAGGTC
TTCGGACTGC CGCTTCCGAT CATCCTGATG CTTGGTGCCC GGGCAGTCGA TGGCATCACC
GGCGGCAATG TCGGCGTCGC AAACGCCTGC CTCGGGGACC TGATCCCGGA GGCCGAACGG
ACAATGGCAT TCGGACGGTT GAACGTCGCC TCGAACACCG GCTACATCCT CGGGCCGGCG
ATCGCCGGGG TGGTCGCCTG GTCAGGCGGG TCGCTGATGG TCCCGGTGCT GCTTGCCCTG
ATGGTCTCTG CTGCTGGTGT CGTGATGATC CGGGTCGGGT TGCCGGATAT CTGCAGAACC
CGGGCGACTG GAGAGGCCGG CACTGGCCTT TTCAATCTCT TCAGGCGCCG CGGGGTCAGA
TCGCTGCTGG GGTTGACACT GCTGTACTTC CTGGCCTTCA ACATCTTCTA CACCGCGTTT
CCGGTGGACG CAGCCGGCCG GCTGGGTTGG TCGGCGGCGA CCCTCGGCCT CTACTTTGCC
TTCCTCTCCG GGGTGATGGT CGTGGTACAG GGGCCGGTAC TGAAACGTGC TGCAGAACGG
TGGAGCCCCC GGACCCTGTT GATCGGCGGG GCACTGGTGC TGGCAGTCGG GTTCGCCCTG
CTCTGGTTTG GGCAGACCAC GACCGGGATA TCGGCTGCGA TCCTCTTTTC CCTCGGCAAC
GGTCTGGCCT GGCCCTCGTT TCTGACATTA CTCTCGCTCA GTGTCGGGCC CGACGAGCAG
GGGGCCGTCC AGGGGGCAGC CAGTGGGGTC GGGTCGTTCG CGAGTATCAT CGGGTTGCTG
CTCGGAGGGG TCCTGTATCT TGCGATCGGT CCGTCCACCT TCCTCTTCTG CACAGGAATT
GTCCTGCTGA CGGCCGGCCT CTTCATCGCC GGCCACTCAA CCTCAGAGGA GAGAGTCCAG
TGA
 
Protein sequence
MTDLRITPLL ALQVVIFTDT FSFGVVLPFM VFLVSGFGGD ALAFGAIGAI YPFFQTISAP 
VLGRWSDKVG KSRVLLISQA GTALSWLLFL GIFFLPSTEV FGLPLPIILM LGARAVDGIT
GGNVGVANAC LGDLIPEAER TMAFGRLNVA SNTGYILGPA IAGVVAWSGG SLMVPVLLAL
MVSAAGVVMI RVGLPDICRT RATGEAGTGL FNLFRRRGVR SLLGLTLLYF LAFNIFYTAF
PVDAAGRLGW SAATLGLYFA FLSGVMVVVQ GPVLKRAAER WSPRTLLIGG ALVLAVGFAL
LWFGQTTTGI SAAILFSLGN GLAWPSFLTL LSLSVGPDEQ GAVQGAASGV GSFASIIGLL
LGGVLYLAIG PSTFLFCTGI VLLTAGLFIA GHSTSEERVQ