Gene Mlg_0140 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_0140 
Symbol 
ID4269833 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp161630 
End bp163204 
Gene Length1575 bp 
Protein Length524 aa 
Translation table11 
GC content71% 
IMG OID638124864 
Producteight transmembrane protein EpsH 
Protein accessionYP_740985 
Protein GI114319302 
COG category 
COG ID 
TIGRFAM ID[TIGR02602] eight transmembrane protein EpsH (proposed exosortase)
[TIGR02914] EpsI family protein
[TIGR03109] exosortase 1 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones56 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGTTG AGTGGTCGCA GGGCGCCGGG GCGGTGCACC CGGCACCGGG CTGGCCCCGG 
GCGCTGACCG CCATGGCGGT AGCGCTGACC GCGCTGCTGG TGGCGTTCTT CCCCACCTTC
ACCAGCATGG TGGAGACCTG GCAGCGCTCG GAGACCTTCG CCCACGGGTT TCTCATCGTC
CCCATCGTGG CCTTCCTGGT CTTCCGCCTG CGCCATGAGC TGGCGTCTCT GCAGCCCCGG
CCCTGCCTGC TGGCCGTCAT CCCGCTGGTG CTGATCACCC TGATGTGGGT GATGGGCGAG
CTGGTGGATG TCATCTCGGT GCGCCAGTTC GCCGCCGTGC TGATGGTCCC GGCCCTGGTC
TGGCTGGTGC TCGGCGGCGA GATCACCCGC CGGCTGCAGT TCCCGCTGGC CTACCTGCTG
TTCGCCGTAC CCTTCGGCGA ATTCATGGTG GAGCCGATGA TGATCTGGAC CGCCGACTTC
ACGGTGGGCG CCATCCGCCT CACCGGGGTC CCGGTCTACC GGGACGGCCT GTTCTTCGAC
CTGCCCACCG GCCGCTGGTC GGTGGTGGCC GCCTGCAGCG GCGTGCGCTA CCTGATCGCC
TCGGTGGCCC TGGGCACCCT GTTCGCCTAT CTCATGTACC GAAGCTGGAC CCGGCGGCTG
ATCTTCGTGG CCATCGCCAT CCTCGTGCCC ATCGTTGCCA ACTGGCTGCG CGCCTACGGC
ATCGTGATGA TCGGCCACCT GAGCGACATG CGCTTGGCCG CCGGGGTGGA CCACCTACTC
TATGGCTGGG TCTTCTTCGG GGTGGTCATC CTGTTGATGT TCCTGATCGG GGCCCTCTGG
CGGGAAGACC ACCTGTCCAC AGGCCAGAAC AGCGGGGACA AGACGCCAGG CACCGGGGTA
GCCGAGCCCG CGCCCCGCAC CGGGCGCCTG GTCACCGCCA CCGCCCTGGC GCTGCTGCTC
ACTGTCAGCG GCCCACTCTA TGCCGGCTGG ATGAACCACC GGGACCTGGG CGAAGTGCAG
GGCCTGGGCA GTGGCCCGCT GCTGGTCGAG GGCTGGCAGG CCGCGAGCGA GGCCGACGCC
GAGCCCTGGA CCCCCGGCTA CCGTAACGCC CGCGCCAGCC GCGGCGGCCT GGTCGTCCAG
GAGGAGGCCG GGCACGCCGT TGGCCTCCAC ATCGACTACT ATCGCGCCCA GCACCGCCAC
GGCAACATGG TCGGCTGGGC CAACACCCTG GCCGGCCGTC ACCGCGACGA CTGGCGACAG
CACAGCGGCG GCCGGACCGC CGTGCCCGGC ATCGACCGGC AGGGCGACCG GGTGCTGCTC
TCCGGCCCGG ATGGCCGCCG GGTGGTGGCC TGGCGCTGGT ACTGGGTGGG TGGCCGGCTG
ACCACCAGCG GCCACGAGGT CAAGGCACGA GAGGCCCTCA GCCGCCTGCT GGGCGGGCGG
GACGACGCCG CCCTGGTGGT CCTCTACGCC GACTACCGCA ACGACCCCGC GGAGGCCGAG
GCCGCATTGG CGCAGTACGC GGCGCAGGCC CTGCCCCAGG CGCTCCGCCT GCTCGACGGC
GTGGCGGGAC CATGA
 
Protein sequence
MAVEWSQGAG AVHPAPGWPR ALTAMAVALT ALLVAFFPTF TSMVETWQRS ETFAHGFLIV 
PIVAFLVFRL RHELASLQPR PCLLAVIPLV LITLMWVMGE LVDVISVRQF AAVLMVPALV
WLVLGGEITR RLQFPLAYLL FAVPFGEFMV EPMMIWTADF TVGAIRLTGV PVYRDGLFFD
LPTGRWSVVA ACSGVRYLIA SVALGTLFAY LMYRSWTRRL IFVAIAILVP IVANWLRAYG
IVMIGHLSDM RLAAGVDHLL YGWVFFGVVI LLMFLIGALW REDHLSTGQN SGDKTPGTGV
AEPAPRTGRL VTATALALLL TVSGPLYAGW MNHRDLGEVQ GLGSGPLLVE GWQAASEADA
EPWTPGYRNA RASRGGLVVQ EEAGHAVGLH IDYYRAQHRH GNMVGWANTL AGRHRDDWRQ
HSGGRTAVPG IDRQGDRVLL SGPDGRRVVA WRWYWVGGRL TTSGHEVKAR EALSRLLGGR
DDAALVVLYA DYRNDPAEAE AALAQYAAQA LPQALRLLDG VAGP