Gene Mlg_0309 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_0309 
Symbol 
ID4270769 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp350247 
End bp351905 
Gene Length1659 bp 
Protein Length552 aa 
Translation table11 
GC content66% 
IMG OID638125035 
Productcholine/carnitine/betaine transporter 
Protein accessionYP_741154 
Protein GI114319471 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1292] Choline-glycine betaine transporter 
TIGRFAM ID[TIGR00842] choline/carnitine/betaine transport 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.670789 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGTAC AGGCGAAATC CGGATTGCTC AAGGGGATGA ACCCAACGGT CACCGTGGTG 
ACGGTGGTGG TCATGCTGGT CTCGGTCATC CTCGGCGCCG GCTGGACGGA GCAATCCGCC
GCTGCCATCA GCACGGCGCG TGAGACCCTG AACCCCTTCC TGGAATGGTA CTACGTGGTG
CTGGTGGCGG TGTTCTTCGT CTTTTGCCTG TGGCTGGGCG TGGGCCGGTA CAAGAATGTG
CGCCTGGGTG GCGATACCGA GCGGCCGGAG TTCACCACCT TCTCCTGGGT GGCCATGCTG
TTTGCGGCCG GAACGGGGGT GGGGATCATC TTCTGGAGCA TCGCCGAGCC CATCATGCAC
TTCGACTCCA ACCCCTTCGC TGCCGACGGC AGCCCCGAGT CGGCGGCAGT GGCTATGCGC
CTGACCTTCT TTCACTGGGG GCTCCCGGGG TGGGCGATCT TCGGGCTGGT GGGGCTGGTG
CTGGCCTATT TCAGTTTCCG TCACAGCCTG CCGTTGACCG TCCGCTCGGC CCTCTACCCC
TTCCTGGGGC ACCGTATCCA CGGCCCGATC GGTGATCTGG TGGACAGCCT GGCGGTGTTC
GGCACGGTCT TCGGCATCGC GACGACCCTG GGGTTGGGGG TGCAGCAGAT GAACACCGGA
CTGGGCCAGC TCTTCGGACT GGACACCACG CTCACCCTGC AACTGGGGGT CACCGCGGTG
ATCATGTTCA TCGCCACGGC GTCGGTGGTC TCCGGGGTGA AGCGCGGGGT TCGGCTGCTC
TCCGAGGCCA ACTTCTGGCT GAGTGTGGTC ATTGTGCTCT TTTTCCTGAT CTTCGGCCCC
ACCCACTACC TGTTTGCGCT GACCATCGAG TCCACCGGCG AGTACCTGCA GAACCTGCTG
GCCATGACCT TCTACACCCA CGCCAACAAG GACACCGGCT GGCAGGCGGA GTGGACAGTC
TTCTTCTGGG GCTGGTGGAT TGCCTGGTCG CCCTTCGTGG GCATGTTCAT CGCCCGCATC
TCCCGCGGGC GCACGGTGCG CGAGTTCATG CTCGGGGTGC TGCTGATGCC CACGGTGATC
ACCTTCGTCT GGATCGGGCT GTTCGGCGGC ACCGCCCTGC ACCAGGAGCT GTTCGGCGAC
GGTGGCGTGG TTGCGGCGGT GAGTCAGGAT GTCTCGGTGG CCATCTTCCA CACCATCGAG
GGGATGCAAT TGGGCCTTCT GGGGCAGGCG GCGGGGGTGC TGGTCACGGT CCTGATCGCC
ACCTATCTGA TCACCTCCGC CAATGCCGGC ACCCTGGTCA TCAACACGCT CCTGGCCAAT
GGGGACACCG ATCCGCCCAC CGGGCACCGG ATCCTCTGGG GCGTGGTGCT GGCCCTGCTG
ACGGCGGTGC TCCTGGTGGC CGGTGGGCTG GAGACGCTGC AGGCGGCGGT CATTCTGGCG
GGGCTGCCCT TTTCGTTAGT GATGGTGGCG ATGCTGCTCG GTTTGGTGAA GGCCCTGGAG
CAGGAGCGCT ACGCACCGCG TCCCGGTGCC CGTACGCTGG CGCCGACCGA ACCCTGGGTG
GGCATGGACC AGGCCGGCGA TACCCTGCAC GAGCCCCGCG GCACGGCCGG GGCGGCCGGG
GAGTACCAGC CCAGTGCCCG GACCGGCGCC GAGGGCTAG
 
Protein sequence
MAVQAKSGLL KGMNPTVTVV TVVVMLVSVI LGAGWTEQSA AAISTARETL NPFLEWYYVV 
LVAVFFVFCL WLGVGRYKNV RLGGDTERPE FTTFSWVAML FAAGTGVGII FWSIAEPIMH
FDSNPFAADG SPESAAVAMR LTFFHWGLPG WAIFGLVGLV LAYFSFRHSL PLTVRSALYP
FLGHRIHGPI GDLVDSLAVF GTVFGIATTL GLGVQQMNTG LGQLFGLDTT LTLQLGVTAV
IMFIATASVV SGVKRGVRLL SEANFWLSVV IVLFFLIFGP THYLFALTIE STGEYLQNLL
AMTFYTHANK DTGWQAEWTV FFWGWWIAWS PFVGMFIARI SRGRTVREFM LGVLLMPTVI
TFVWIGLFGG TALHQELFGD GGVVAAVSQD VSVAIFHTIE GMQLGLLGQA AGVLVTVLIA
TYLITSANAG TLVINTLLAN GDTDPPTGHR ILWGVVLALL TAVLLVAGGL ETLQAAVILA
GLPFSLVMVA MLLGLVKALE QERYAPRPGA RTLAPTEPWV GMDQAGDTLH EPRGTAGAAG
EYQPSARTGA EG