Gene Mlg_0203 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_0203 
Symbol 
ID4269649 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp235946 
End bp237244 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content67% 
IMG OID638124927 
Productmajor facilitator transporter 
Protein accessionYP_741048 
Protein GI114319365 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2271] Sugar phosphate permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGACTGCTT ACGACCACCG CCACTTCCAG CAGGTCCGCT GGACCATCTA TATCATCCTG 
ATCCTGGCCT ACATGCTGGT GTTTTTTCAC CGTATGGCCC CGGGATCAGT CTCCGGTGAC
CTTACCGAGG CCTTTGGCAC CAGCGCCGCC GCCCTCGGCT CGCTGGCGGC GATGTACTAC
TACATCTACA CCGCCATGCA GATCCCCTCC GGGGTACTGG CGGATACCCT CGGGTCACGC
TGGGCGGTGT TGGCGGGCAG CCTGGTGGCG GGCGTGGGCT CGATCCTGTT CGGCCTGGCG
GACACCTTCG CCATGGCCAG CGTGGGCCGC TTCCTGGTGG GGCTGGGGGT CTCCACCGTC
TTCGTCGGGC TGATGAAGAG CAACAGCGTC TGGTTCAGCG AACGCCAGTA CGGCTCCATC
AGCGGTCTGA CCCTGTTGCT GGGCAATGCC GGGGCCATCG CCGCCACCGG GCCGCTGGCC
CTGGTGCTGG ACCACTACGA CTGGCGTACG GTGTTCGTCG CCCTGGGGGT ATTCTCCATC
GTGCTGGCGG TGGCCACCTG GTTGAAGGTC TATAACAAGC CCGAGGATGC GGGCTTCCCA
TCGGTGCGGG AGATGGAGGG CAAGACCGCC CACGCCGCCC GCGACCAGCA CTGGCTGAAG
GACCTGCGGG CGGTGTTCGC CAACCGCAAG CTTTGGCCCG GGGCCGTGTA CGATTTCGGC
ATCACCGGCA GTTTCTTCGG CTTCGTGGGG CTATGGGCGG TACCGCTGCT GCGCGACCTG
CACGAGCTGG ACCGCAGCGC CGCCTCGCTC TATCCGACCC TGGCCACGGT CGCCTTCGCC
ATCGGTTGCC TGGTGGCGGG GATGTACTCG GACCGGGTGG GCCGGCGCCG GCCGGTACTG
ATCGGCGGGG CGGTGGTCTA TTTCGCCGTC TGCCTGGGGT TGTGGCTGCT GCCCTGGGGC
CCCGGGCCGC TGGCCATGGC GCTGTTCGTG GCGCTGGGGC TGTCCGCCGG TTGCTTCGTG
GTGGCCTACG CCCACGCCAA GGAGGTGACC GCGCCGGCCC TGGCCGGCAT GGGCATCGCC
TTCGTCAACA CCGGGCTATT CCTGGGCGCG GCCTTGTTTC AGCCCATCTT CGGCTGGATC
ATGGACCTGT TACTGCTCGC CGCCGGGCGC AGCGACTACG GCGTCGCGGA ATACCAGGGG
GGGCTGGTGC TGCTGTGCGC CTTTGCCGCG CTGGCCCTGG CCGCCTCCCT GCTGCTCCAC
GAGACCCATT GCCGTAACAT ACATGTGAAA CACGACTGA
 
Protein sequence
MTAYDHRHFQ QVRWTIYIIL ILAYMLVFFH RMAPGSVSGD LTEAFGTSAA ALGSLAAMYY 
YIYTAMQIPS GVLADTLGSR WAVLAGSLVA GVGSILFGLA DTFAMASVGR FLVGLGVSTV
FVGLMKSNSV WFSERQYGSI SGLTLLLGNA GAIAATGPLA LVLDHYDWRT VFVALGVFSI
VLAVATWLKV YNKPEDAGFP SVREMEGKTA HAARDQHWLK DLRAVFANRK LWPGAVYDFG
ITGSFFGFVG LWAVPLLRDL HELDRSAASL YPTLATVAFA IGCLVAGMYS DRVGRRRPVL
IGGAVVYFAV CLGLWLLPWG PGPLAMALFV ALGLSAGCFV VAYAHAKEVT APALAGMGIA
FVNTGLFLGA ALFQPIFGWI MDLLLLAAGR SDYGVAEYQG GLVLLCAFAA LALAASLLLH
ETHCRNIHVK HD