Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_0140 |
Symbol | |
ID | 4269833 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | - |
Start bp | 161630 |
End bp | 163204 |
Gene Length | 1575 bp |
Protein Length | 524 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 638124864 |
Product | eight transmembrane protein EpsH |
Protein accession | YP_740985 |
Protein GI | 114319302 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR02602] eight transmembrane protein EpsH (proposed exosortase) [TIGR02914] EpsI family protein [TIGR03109] exosortase 1 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 56 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCGTTG AGTGGTCGCA GGGCGCCGGG GCGGTGCACC CGGCACCGGG CTGGCCCCGG GCGCTGACCG CCATGGCGGT AGCGCTGACC GCGCTGCTGG TGGCGTTCTT CCCCACCTTC ACCAGCATGG TGGAGACCTG GCAGCGCTCG GAGACCTTCG CCCACGGGTT TCTCATCGTC CCCATCGTGG CCTTCCTGGT CTTCCGCCTG CGCCATGAGC TGGCGTCTCT GCAGCCCCGG CCCTGCCTGC TGGCCGTCAT CCCGCTGGTG CTGATCACCC TGATGTGGGT GATGGGCGAG CTGGTGGATG TCATCTCGGT GCGCCAGTTC GCCGCCGTGC TGATGGTCCC GGCCCTGGTC TGGCTGGTGC TCGGCGGCGA GATCACCCGC CGGCTGCAGT TCCCGCTGGC CTACCTGCTG TTCGCCGTAC CCTTCGGCGA ATTCATGGTG GAGCCGATGA TGATCTGGAC CGCCGACTTC ACGGTGGGCG CCATCCGCCT CACCGGGGTC CCGGTCTACC GGGACGGCCT GTTCTTCGAC CTGCCCACCG GCCGCTGGTC GGTGGTGGCC GCCTGCAGCG GCGTGCGCTA CCTGATCGCC TCGGTGGCCC TGGGCACCCT GTTCGCCTAT CTCATGTACC GAAGCTGGAC CCGGCGGCTG ATCTTCGTGG CCATCGCCAT CCTCGTGCCC ATCGTTGCCA ACTGGCTGCG CGCCTACGGC ATCGTGATGA TCGGCCACCT GAGCGACATG CGCTTGGCCG CCGGGGTGGA CCACCTACTC TATGGCTGGG TCTTCTTCGG GGTGGTCATC CTGTTGATGT TCCTGATCGG GGCCCTCTGG CGGGAAGACC ACCTGTCCAC AGGCCAGAAC AGCGGGGACA AGACGCCAGG CACCGGGGTA GCCGAGCCCG CGCCCCGCAC CGGGCGCCTG GTCACCGCCA CCGCCCTGGC GCTGCTGCTC ACTGTCAGCG GCCCACTCTA TGCCGGCTGG ATGAACCACC GGGACCTGGG CGAAGTGCAG GGCCTGGGCA GTGGCCCGCT GCTGGTCGAG GGCTGGCAGG CCGCGAGCGA GGCCGACGCC GAGCCCTGGA CCCCCGGCTA CCGTAACGCC CGCGCCAGCC GCGGCGGCCT GGTCGTCCAG GAGGAGGCCG GGCACGCCGT TGGCCTCCAC ATCGACTACT ATCGCGCCCA GCACCGCCAC GGCAACATGG TCGGCTGGGC CAACACCCTG GCCGGCCGTC ACCGCGACGA CTGGCGACAG CACAGCGGCG GCCGGACCGC CGTGCCCGGC ATCGACCGGC AGGGCGACCG GGTGCTGCTC TCCGGCCCGG ATGGCCGCCG GGTGGTGGCC TGGCGCTGGT ACTGGGTGGG TGGCCGGCTG ACCACCAGCG GCCACGAGGT CAAGGCACGA GAGGCCCTCA GCCGCCTGCT GGGCGGGCGG GACGACGCCG CCCTGGTGGT CCTCTACGCC GACTACCGCA ACGACCCCGC GGAGGCCGAG GCCGCATTGG CGCAGTACGC GGCGCAGGCC CTGCCCCAGG CGCTCCGCCT GCTCGACGGC GTGGCGGGAC CATGA
|
Protein sequence | MAVEWSQGAG AVHPAPGWPR ALTAMAVALT ALLVAFFPTF TSMVETWQRS ETFAHGFLIV PIVAFLVFRL RHELASLQPR PCLLAVIPLV LITLMWVMGE LVDVISVRQF AAVLMVPALV WLVLGGEITR RLQFPLAYLL FAVPFGEFMV EPMMIWTADF TVGAIRLTGV PVYRDGLFFD LPTGRWSVVA ACSGVRYLIA SVALGTLFAY LMYRSWTRRL IFVAIAILVP IVANWLRAYG IVMIGHLSDM RLAAGVDHLL YGWVFFGVVI LLMFLIGALW REDHLSTGQN SGDKTPGTGV AEPAPRTGRL VTATALALLL TVSGPLYAGW MNHRDLGEVQ GLGSGPLLVE GWQAASEADA EPWTPGYRNA RASRGGLVVQ EEAGHAVGLH IDYYRAQHRH GNMVGWANTL AGRHRDDWRQ HSGGRTAVPG IDRQGDRVLL SGPDGRRVVA WRWYWVGGRL TTSGHEVKAR EALSRLLGGR DDAALVVLYA DYRNDPAEAE AALAQYAAQA LPQALRLLDG VAGP
|
| |