Gene Mlg_1044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1044 
Symbol 
ID4270517 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp1194069 
End bp1195259 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content70% 
IMG OID638125796 
Productmajor facilitator transporter 
Protein accessionYP_741887 
Protein GI114320204 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.122672 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTAAGGA ACCGACTCGC GCGGCATCCG CTGGCCGTCA TTGTCATCGC CCAGCTGTTC 
GGCACCTCGC TTTGGTTCAG TGTCAACGGC GTGGGGCTGG CCCTCAGCGA AGCGGTGGCG
CTCAGCGACA CCGGCCTGGG CCTGCTCACC ATGGCGGTTC AGGCCGGCTT CATCACCGGC
ACCCTGATCA TTGCCACCAC GGGGCTCGCC GACCGGGTGC GGGCAAGCCG GCTGTTCGCC
ATTGCGGCGG TGACCGGCGC GGTGATCAAC GCCGGCTTCA TCCTGGTGGC GGGTGATCTC
CACCTGGGGG TGACAGCCCG CTTTCTCACC GGCCTGTGCC TGGCGGGGAT CTATCCCCTG
GGTATGAAAC TGGTAGTCAG TTGGACGCCC CGCTACGCCG GCGCGGCGCT GGGCTGGCTG
GTGGGCATGC TGACCCTGGG CACCGCCCTG CCTCACCTGC TGCGCGGGGC CACCTTCGAA
CTGCCCTGGC AGTGGCCGCT GCTGCTGGCC TCCGGCCTGG CGCTGGTGGC GGCCTGGCTC
ATCCATTCCC TCGGTGATGG CCCCGAGCTG CCCGGACCGG CGCCGGGTGG CCGGCCCTGG
GCCGGGCTAG CAGCCTTCGG CTGCGGCAAC TTCCGGGCTT CGGCCTTCGG CTACTTCGGG
CACTGCTGGG AGTTGTACGC CTTCTGGACC CTGGTGCCTT TCCTGGTCGG CCGCGAGATC
GAGCGCCTGG CACTGGGCCC GGGCTGGCTG CCCTGGTTGG CCTTCGCGGT CATCGCCCTG
GGCCTGCCCG GCTGCGTCTG GGGCGGGCGC ATCAGCCGCT GGCTGAGCAG CTTCAATGTG
GCCCGCCTGA CCCTGGCCAT CTCCGGCACC CTGTGCCTGC TCTATCCGCT GCTGGGGGAT
GCCCCACCCC TCTTCCTTCT GGCACTACTT GCCGTTTGGG GGCTGGCCGT GATCGCCGAC
TCGCCCCAGT TCTCCGCCCT GGCATCGGCG ACGGCGCCGC GCGCGCGCCT GGGCGCGGCG
CTGGCTATCA TGAACGCCAT CGGCTTTGCC ATGACCCTGC CCGCCATTGC GCTAACCACC
CATTTCTGGT CGCAGCAGGA GCTGGGGGTG ATGTGGTGGC TGCTCCCCGG ACCGGTGCTG
GGCCTGCTGG CCCTGCACCG TATGAACCGG CACGCACTGC GGGAAATCTG A
 
Protein sequence
MLRNRLARHP LAVIVIAQLF GTSLWFSVNG VGLALSEAVA LSDTGLGLLT MAVQAGFITG 
TLIIATTGLA DRVRASRLFA IAAVTGAVIN AGFILVAGDL HLGVTARFLT GLCLAGIYPL
GMKLVVSWTP RYAGAALGWL VGMLTLGTAL PHLLRGATFE LPWQWPLLLA SGLALVAAWL
IHSLGDGPEL PGPAPGGRPW AGLAAFGCGN FRASAFGYFG HCWELYAFWT LVPFLVGREI
ERLALGPGWL PWLAFAVIAL GLPGCVWGGR ISRWLSSFNV ARLTLAISGT LCLLYPLLGD
APPLFLLALL AVWGLAVIAD SPQFSALASA TAPRARLGAA LAIMNAIGFA MTLPAIALTT
HFWSQQELGV MWWLLPGPVL GLLALHRMNR HALREI