Gene Mthe_1038 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMthe_1038 
Symbol 
ID4463106 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosaeta thermophila PT 
KingdomArchaea 
Replicon accessionNC_008553 
Strand
Start bp1120909 
End bp1122483 
Gene Length1575 bp 
Protein Length524 aa 
Translation table11 
GC content53% 
IMG OID639700056 
Productextracellular solute-binding protein 
Protein accessionYP_843462 
Protein GI116754344 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATATCA TGCATAAAAA GATCGCGATA CTTGTTTTGA TTTTAGCGAC TACAGCAATA 
CTCCAATGCG CCTCTGCTGA GGAGACTGTG CTCCGGATAG GGACCCAGGA CGTGGTCAAA
TCCGCCAGTC TCCTAGGAGA TTCCAGCATG GGGGTATTCG CACACCTCTC CAATCCACCC
CTGATGAAGA TGAACCCTGA CGGCACTCTG AGCGGCCAGA CAGCGAAGAG TTACTCTGTC
TCAGAGGATG GGATGACCTG GAGATTCGAG ATAGACGATA ACCTCTACTG GAGCGATGGC
ACGAAGCTCA CCCCAGAGGA CGTGAAGTTC ACATTCGAGT ACATCTCAGA GAAGTACCCA
CCAGCCGGCT GGATCAAAAA TACGGTGGAT GAGATATCCG TGGACGGTAA TGCAGTCGTC
TTCAAACTAA ACAAGCCCTA CTCCCGTCTG AACCTCGAGT TCACCACATA CAACATACTG
CCGAAGCATG TCTGGGAGAA CATCGAGAAG CCGACCGAAT ACACCAATGA AGGTCCCATG
ATAGGTTGCG GTCCTTTCGT CATTGAAAAG ACAGACCTCG GAGCTGGCGT GATATACTTC
AAGAGAAACC CCTACTGGAA GGGCAAGGAG CCGAAGATCG ATTCGATAGA GCTGCACATG
TATGAGAATG CAGATGTTCT TTCAATGGCA CTTGAGAAGG GAGATGTTGA TGCATACTAC
AAGTACGCTG GCACATATCC GTACACAGGC ATTCAAAAGC TGAAGGATAC TGGAAACTTC
GATTTCGTGG AGAAGGACAA CGTCGGCCTG GTCTTCCTGG GATTCAATCT GAACAAGGAG
CCGATGTCAG ATCTCCAGTT CAGAGAAGCC ATGGCTTATG CCATAAACTA CACGGAGATC
CTCAAGATCG ATGCCCTCGG ATACGGCTCC GTTCCGAACC GCGGCTTCGT TCCGCCGAGC
ATGGCCTACT TCAAGGATAC ACCGGCTCTC AAGTATGATC CCAAAAAAGC CAGGGAGAGC
CTGGAGGAGG CAGGCTACAG GGACGGAAAC GGCAACGGGA TTCTGGAGGA TCCAAGCGGC
AATGACGTGA AGCTCCTCCT GCTGGCCCGC TCGAAGTTCC AGAGGGTCGC AGAGCTCGTG
AAGGAGTACA TCTCCGCGGT CGGCATTGAT GTGGAGCTCA AGGTCGTCGA CGATGCGACA
TGGATAAAGC TCAAGGATGA GTATCAGTAT GATCTCACCA TCACGAGGAC AACTCCGTGG
GGAATGATGA TGCACGCGAA CTGGGGGACC GGGTACTTCG ACTCACGCAG GACCGGTGAG
GGCGTGCTCC ACGTCCTGAG CGACCCCGAG TTCCAGAAGC TCTGCGATGA CATCCTCGCG
GCCACAAGTG ATGAGGAACT TGAGGAGTAC GCGTACATGC TCCAGGATTA CTATGCTGAG
AACCTGCCCG GGATCGCGCT GTACTGGAAC AAGGTCTTCA CGCCGTACAA CAAGAGATGG
ACCGGCTGGA ACTCAGATCC GCTGTACGGC ATCTACAACC TAGACAACTT CCTGAACGTA
GAGAGGGTGG CATGA
 
Protein sequence
MNIMHKKIAI LVLILATTAI LQCASAEETV LRIGTQDVVK SASLLGDSSM GVFAHLSNPP 
LMKMNPDGTL SGQTAKSYSV SEDGMTWRFE IDDNLYWSDG TKLTPEDVKF TFEYISEKYP
PAGWIKNTVD EISVDGNAVV FKLNKPYSRL NLEFTTYNIL PKHVWENIEK PTEYTNEGPM
IGCGPFVIEK TDLGAGVIYF KRNPYWKGKE PKIDSIELHM YENADVLSMA LEKGDVDAYY
KYAGTYPYTG IQKLKDTGNF DFVEKDNVGL VFLGFNLNKE PMSDLQFREA MAYAINYTEI
LKIDALGYGS VPNRGFVPPS MAYFKDTPAL KYDPKKARES LEEAGYRDGN GNGILEDPSG
NDVKLLLLAR SKFQRVAELV KEYISAVGID VELKVVDDAT WIKLKDEYQY DLTITRTTPW
GMMMHANWGT GYFDSRRTGE GVLHVLSDPE FQKLCDDILA ATSDEELEEY AYMLQDYYAE
NLPGIALYWN KVFTPYNKRW TGWNSDPLYG IYNLDNFLNV ERVA