Gene Mthe_0106 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMthe_0106 
Symbol 
ID4462498 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosaeta thermophila PT 
KingdomArchaea 
Replicon accessionNC_008553 
Strand
Start bp98528 
End bp99637 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content57% 
IMG OID639699115 
Producthypothetical protein 
Protein accessionYP_842548 
Protein GI116753430 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR03282] putative methanogenesis marker 13 metalloprotein 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTACAG AAGTTCAGGT TATTCACCCA CGCCCCAGCT CGATAGTGGC GGCCCTTTAC 
ACCCTGCGTG ATCTCGGCGT GGATGTGATC ATACTCCACG GCCCCAGCGG ATGCTGCTTC
AAGCACGCGA GGTTGCTGGA GGAGGATGGT GTCAGGGTTC TGACCACAGG CCTCGATGAG
AAGGGCTTCG TCTTCGGAGG GCAGGAGCAG CTTGTGAGGC TCCTGAGAAG AGCTGTGGAG
CTGTTCCATC CGAAGATCAT AGCTGTCGCT GGAACATGCA GCAGCATGAT CATAGGCGAG
GATCTGCACA AGGCTGTGGT GGAGGCGAAT ATCGGCGTCC CAGTGATAGA GGCTGAGGTT
CACGCCGGAT ACAGAGACAA CACAAAGGGC GTGATCATAA CACTTGAGGC GGCAAGGGAT
GCTGGAATAA TAGATGATCA AGAGCTCAAT CGACAGAAAA AGCTCCTGGA GAAGGCGACG
GAGATAGAGA GGCTTTATGG GGCAGCGAGC AGCGAGTACC TGCCTCCCGA GAGGGGCGAT
GTCAAGTTCG TCGTTGCCTC GAGGCTCCTG GATCTTCTCA TGGAGGGCAA GCGCGGGCTC
AACATACTCA ACGCGAAGAA GGAGACCGCG TACATGTTCG CGGACATCAA CCTCTCTCTC
ATAGAGGTCG CCAGGGCGCT GGGGTGTGAG GAGAAAATAA GAACTCTGGC GAACCTAGAC
TCTGTTGTTG GCCTGCCAAA GGTGAGGAGG GATGCAGGGA ACATAGCAGG CGCTCTCATG
GAGAGAGGGA TGAACTTCGA GATCACAGGA GGTCTTGATG AATATCCAGT CTCGGGAGAG
CGCATCGCAG ATATGATTGA GGGGGAGGCG TACGACTTCG CCGTGATCTC CGGCGTTCCG
CATGCGATAC CGATCGATGC GCTTGGCGGA ATGGAGCTCT TCTCGATAAC AAACGGCCCA
CGCCAGGTCA GGCCTCTGAG GGATATGGGC CATCACCATG TTGTGGTGGA GATAGATCTT
CACCCCAAGA CGATGGGCGT CAGCACCATC GTCGAGTCCG AGTTCGGCGC CACGCTGAGG
GAGATCGCGA GGAGAAGAGG GGTTCTGTGA
 
Protein sequence
MSTEVQVIHP RPSSIVAALY TLRDLGVDVI ILHGPSGCCF KHARLLEEDG VRVLTTGLDE 
KGFVFGGQEQ LVRLLRRAVE LFHPKIIAVA GTCSSMIIGE DLHKAVVEAN IGVPVIEAEV
HAGYRDNTKG VIITLEAARD AGIIDDQELN RQKKLLEKAT EIERLYGAAS SEYLPPERGD
VKFVVASRLL DLLMEGKRGL NILNAKKETA YMFADINLSL IEVARALGCE EKIRTLANLD
SVVGLPKVRR DAGNIAGALM ERGMNFEITG GLDEYPVSGE RIADMIEGEA YDFAVISGVP
HAIPIDALGG MELFSITNGP RQVRPLRDMG HHHVVVEIDL HPKTMGVSTI VESEFGATLR
EIARRRGVL