Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mthe_1038 |
Symbol | |
ID | 4463106 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanosaeta thermophila PT |
Kingdom | Archaea |
Replicon accession | NC_008553 |
Strand | - |
Start bp | 1120909 |
End bp | 1122483 |
Gene Length | 1575 bp |
Protein Length | 524 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 639700056 |
Product | extracellular solute-binding protein |
Protein accession | YP_843462 |
Protein GI | 116754344 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATATCA TGCATAAAAA GATCGCGATA CTTGTTTTGA TTTTAGCGAC TACAGCAATA CTCCAATGCG CCTCTGCTGA GGAGACTGTG CTCCGGATAG GGACCCAGGA CGTGGTCAAA TCCGCCAGTC TCCTAGGAGA TTCCAGCATG GGGGTATTCG CACACCTCTC CAATCCACCC CTGATGAAGA TGAACCCTGA CGGCACTCTG AGCGGCCAGA CAGCGAAGAG TTACTCTGTC TCAGAGGATG GGATGACCTG GAGATTCGAG ATAGACGATA ACCTCTACTG GAGCGATGGC ACGAAGCTCA CCCCAGAGGA CGTGAAGTTC ACATTCGAGT ACATCTCAGA GAAGTACCCA CCAGCCGGCT GGATCAAAAA TACGGTGGAT GAGATATCCG TGGACGGTAA TGCAGTCGTC TTCAAACTAA ACAAGCCCTA CTCCCGTCTG AACCTCGAGT TCACCACATA CAACATACTG CCGAAGCATG TCTGGGAGAA CATCGAGAAG CCGACCGAAT ACACCAATGA AGGTCCCATG ATAGGTTGCG GTCCTTTCGT CATTGAAAAG ACAGACCTCG GAGCTGGCGT GATATACTTC AAGAGAAACC CCTACTGGAA GGGCAAGGAG CCGAAGATCG ATTCGATAGA GCTGCACATG TATGAGAATG CAGATGTTCT TTCAATGGCA CTTGAGAAGG GAGATGTTGA TGCATACTAC AAGTACGCTG GCACATATCC GTACACAGGC ATTCAAAAGC TGAAGGATAC TGGAAACTTC GATTTCGTGG AGAAGGACAA CGTCGGCCTG GTCTTCCTGG GATTCAATCT GAACAAGGAG CCGATGTCAG ATCTCCAGTT CAGAGAAGCC ATGGCTTATG CCATAAACTA CACGGAGATC CTCAAGATCG ATGCCCTCGG ATACGGCTCC GTTCCGAACC GCGGCTTCGT TCCGCCGAGC ATGGCCTACT TCAAGGATAC ACCGGCTCTC AAGTATGATC CCAAAAAAGC CAGGGAGAGC CTGGAGGAGG CAGGCTACAG GGACGGAAAC GGCAACGGGA TTCTGGAGGA TCCAAGCGGC AATGACGTGA AGCTCCTCCT GCTGGCCCGC TCGAAGTTCC AGAGGGTCGC AGAGCTCGTG AAGGAGTACA TCTCCGCGGT CGGCATTGAT GTGGAGCTCA AGGTCGTCGA CGATGCGACA TGGATAAAGC TCAAGGATGA GTATCAGTAT GATCTCACCA TCACGAGGAC AACTCCGTGG GGAATGATGA TGCACGCGAA CTGGGGGACC GGGTACTTCG ACTCACGCAG GACCGGTGAG GGCGTGCTCC ACGTCCTGAG CGACCCCGAG TTCCAGAAGC TCTGCGATGA CATCCTCGCG GCCACAAGTG ATGAGGAACT TGAGGAGTAC GCGTACATGC TCCAGGATTA CTATGCTGAG AACCTGCCCG GGATCGCGCT GTACTGGAAC AAGGTCTTCA CGCCGTACAA CAAGAGATGG ACCGGCTGGA ACTCAGATCC GCTGTACGGC ATCTACAACC TAGACAACTT CCTGAACGTA GAGAGGGTGG CATGA
|
Protein sequence | MNIMHKKIAI LVLILATTAI LQCASAEETV LRIGTQDVVK SASLLGDSSM GVFAHLSNPP LMKMNPDGTL SGQTAKSYSV SEDGMTWRFE IDDNLYWSDG TKLTPEDVKF TFEYISEKYP PAGWIKNTVD EISVDGNAVV FKLNKPYSRL NLEFTTYNIL PKHVWENIEK PTEYTNEGPM IGCGPFVIEK TDLGAGVIYF KRNPYWKGKE PKIDSIELHM YENADVLSMA LEKGDVDAYY KYAGTYPYTG IQKLKDTGNF DFVEKDNVGL VFLGFNLNKE PMSDLQFREA MAYAINYTEI LKIDALGYGS VPNRGFVPPS MAYFKDTPAL KYDPKKARES LEEAGYRDGN GNGILEDPSG NDVKLLLLAR SKFQRVAELV KEYISAVGID VELKVVDDAT WIKLKDEYQY DLTITRTTPW GMMMHANWGT GYFDSRRTGE GVLHVLSDPE FQKLCDDILA ATSDEELEEY AYMLQDYYAE NLPGIALYWN KVFTPYNKRW TGWNSDPLYG IYNLDNFLNV ERVA
|
| |