Gene Mthe_0201 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMthe_0201 
Symbol 
ID4462767 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosaeta thermophila PT 
KingdomArchaea 
Replicon accessionNC_008553 
Strand
Start bp198736 
End bp200190 
Gene Length1455 bp 
Protein Length484 aa 
Translation table11 
GC content55% 
IMG OID639699208 
Productradical SAM domain-containing protein 
Protein accessionYP_842639 
Protein GI116753521 
COG category[C] Energy production and conversion 
COG ID[COG1032] Fe-S oxidoreductase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCGTTC TGCTTCTTGC CTCGCCGGTG GTGCAGCCGG ACTTCGACAG GATCGCGAGG 
ATACCCGACC TCGGCCTGGT CTCGCTTGCA GCCGCAATCG ATGATCTGTG TGATGTGCAT
GTCGCAGATC TCCACGGCAT AAAAGACCCG GATGAATTTG TGAGAAGGCA TGCGAACCGC
TACGATCTGA TAGGGCTCAC AGCGATGAGC TTTCAGTATG CAAGGGCCCT CGAGCTCGCC
AGGATAGCCA AGGATGCGGG GGCGGAGGTC GTGATCGGCG GATACCACCC CACGCTCTTC
TACAGGGAGA TAGGATCGAG CAGCGATTTG ATGCTCATAG ACTACATAGT CCGCGGTGAG
GGTGAGAGGA CCTTCAGGGA GCTGGTGCAG GCTCTGATCA GGGGCAGCCA GCTCGATGAT
GTTCCTGGTC TCTCCTACAG ATTTGGCTCG GAGATGAAGC ACAACCCTCC GAGAGCTCTG
CTCAATCCTG AGGAGATCGA GATGCCAAAC AGGGATGCTC GCCTGATCAG AGATGGTTTC
TATGCGTTTG ATGTCCCTGT TGACTCCGTG GAGACGAGCA GGGGGTGCAC ACAGGGATGC
AAGTTCTGCT CGATAAACAG CATGTACGGC AGGAGCTTCC GCAAGTTTGA GATCAAGAGG
GTCATTGAGG ACATACAGGA TGCGGAGGAG CACGGCGCAG GCTCGATATT CTTCCCCGAT
GACAACATCA CTCTTGATGT GAAAAGATTG GAGGCGATAT GCGATGCCAT CATAGACGCA
GGTCTAACGC ATCTCAGGTA CAAGACGCAG GCATCTGCAT CAGGCATCGC CTCCAGTGAG
AGGCTCGTTA AAAAGATGGG CGAGGCAGGT TTCGATGGTG TATTCCTTGG CGTTGAGAGC
GCCAGCAAGA GGAACCTCCA GTTTTTCGGA AAGGGCAGGA TGTCGGATCA TGCAGAGCGT
GCTGTGAGAT ACCTCCACGA TAACGACATC ATCGTATCCA CAGGGCTCAT CGGCGGAAAT
CCAGATGACA CTGCAGAGGA CATGTGGGCG AACTTCCACC TCGCAAGACA GCTAAAAGTC
GATTTTCCCA TATTTTATAT CAACACACCT TACCCCAAGA CCCCGATGCG CGAGGAACTG
GAGCGGATGG GGCTGATAAC GAACAACGAC TTCAGGTTCT ACGATGGTCT TCACGCCAAT
GTACGCACAA AGCACTTGAG CGCTGAGGAG GTGCAGTATA TCACGTGGGA GATGAACGCC
AGGTACTACA ACTGGGAGTG GTTCAAGTAC AACAAGGTCA AGAGGCTTTA TCCGAGGTGG
TTCGCGAGGG AGGCCCTGCG GCTCGCCCCG ATCTACGCGA AAAGAAAGCT CGAGCTCCTC
CTGAGGATAA AGAGCAGAAG AGATATGTTT CGGGAGGACC TGGCGCGGGG GGAGCTGTGC
AAGGGTGTGG CGTGA
 
Protein sequence
MRVLLLASPV VQPDFDRIAR IPDLGLVSLA AAIDDLCDVH VADLHGIKDP DEFVRRHANR 
YDLIGLTAMS FQYARALELA RIAKDAGAEV VIGGYHPTLF YREIGSSSDL MLIDYIVRGE
GERTFRELVQ ALIRGSQLDD VPGLSYRFGS EMKHNPPRAL LNPEEIEMPN RDARLIRDGF
YAFDVPVDSV ETSRGCTQGC KFCSINSMYG RSFRKFEIKR VIEDIQDAEE HGAGSIFFPD
DNITLDVKRL EAICDAIIDA GLTHLRYKTQ ASASGIASSE RLVKKMGEAG FDGVFLGVES
ASKRNLQFFG KGRMSDHAER AVRYLHDNDI IVSTGLIGGN PDDTAEDMWA NFHLARQLKV
DFPIFYINTP YPKTPMREEL ERMGLITNND FRFYDGLHAN VRTKHLSAEE VQYITWEMNA
RYYNWEWFKY NKVKRLYPRW FAREALRLAP IYAKRKLELL LRIKSRRDMF REDLARGELC
KGVA