Gene Mthe_1097 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMthe_1097 
Symbol 
ID4463129 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosaeta thermophila PT 
KingdomArchaea 
Replicon accessionNC_008553 
Strand
Start bp1185945 
End bp1187645 
Gene Length1701 bp 
Protein Length566 aa 
Translation table11 
GC content56% 
IMG OID639700114 
Productamidohydrolase 3 
Protein accessionYP_843520 
Protein GI116754402 
COG category[C] Energy production and conversion 
COG ID[COG1229] Formylmethanofuran dehydrogenase subunit A 
TIGRFAM ID[TIGR03121] formylmethanofuran dehydrogenase subunit A 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATAATTA AAGGGGGCAT CGTCTACGAT CCTGCCAACG GGATCTTCGG CGAGGAGATG 
GATATATGCA TAGAGAACGG CCGGATAGCT GAGGACGCCG GCGGGGAGGT GATTGATGCC
AGAGGTCTTC TGGTTATGCC TGGCGGTGTG GATGCGCACT CGCATATCGC AGGGCCGAAG
CTGAACACTG GAAGGATAAT GCGCCCCGAC GACTCCAGGA TGGGAACTGA GCCGAGGACG
AGGGTCTGCA GGCCGTCGAC GGGATATACG GTGCCTAACT GCTACGCCAT AGGCTACAGG
TACGCCAGGC TTGGATACAC CACCGCGTTC GAGGCCGCGA CGCCGATAAT AGAGGCCCGC
CACACGCATG AGGAGCTGGA GGAGATCCCA ATCGTTGACA AGGGCGCCCT TACGCTCTTC
GGAAGCAACT GGACCGTGAT GGAGTGCGTG CGCGAGAACG ATATGGATAT GCTCGCTGCA
TACGTTGCGT GGGGCTTGAG GGCCGCGCGC GGATACGGCG TTAAGATCGT GAATCCGGGC
GGCGGCGAGG CATGGGGTTT CGGATCGAAC GTGAAGAGCG TGCACGATCC GGTGCCGCAC
TTCGATGTAA CTCCAGCACA GATTATACGA GCGCTTGCAG AGGTCAACGA GCGTCTCAGA
CTTCCACACT CGATACACCT GCACTTCAAC AATCTGGGCA GGCCTGGGAA TTACACCACA
GCGATCGAGA CCCTTGAGCT GCTGAAGGAC ATAAAGCCGA GCAGGATGCG GCAGGTTGTG
CATGTCGCGC ACATGCAGTT CTCAGCGTAT GGCGGCACAG GCTGGAAGGA CTTCGAGTCG
AAGGCATCTG CGATAGCGGA GTACTTCAAC CAAACAAATC ATGCTACGAT GGATCTCGGC
CAGATAATAT TCGGCCCTGC CACAACAATG ACCGCTGATG CGCCGCTGGA GTATGCGAAC
GCCAGGATCG GGCACCAGAA ATGGTCGAAC CACGATATAG AGCTAGAGGA GTCCAGCGGT
GTGGTGCCGT GGGTTTACAC GAGAAAGATG CCTGTCAACG CGGTCCAGTG GGCGATAGGG
CTGGAGCTAG CGCTTCTCAC AAAAGATCCT TGGAAGGTGG TCATGACAAC AGACCATCCG
AACGGCGGGC CGTTCGTCAA CTACCCTGAG ATAATATCGC TGCTGATGAG CAGGGAGAAG
CGCGAGGAGG AGATGAAGAC GCTGCACGAG GTGGTGAGAT CAAGAAGCAC CCTACCATCG
ATTGAGAGAG AGATGGATTG GTCAGAGATC GTGATAATGA CGAGAGCTGC ACCTGCGAGA
ATTCTTGGCC TGGAGGACAA GGGGCATCTT GGCATCGGCG CTGATGGAGA TGTATCGATA
TACAACATCA GACCAGATGA GATCGACCCG TCGAAGGATC ATGCAGTGGT GAAGGCAGGC
ATGTCCAGGG CGAAGTACAC GATAAAGGGC GGTGCTGTTG TAGTAAGGGA CGGCGAGATA
GTCGCAGCGC CACAGGGAAG GACATACTGG GTGGATGCCG CTGTTCCTGA GAGCGATATG
GATCGCATGC TGTCTTCTCT CAAAGAGAAG TTCGAGAGGT ACTACAGCAT AAGGATGTCG
AACTACATGG TGCAGGACGC GTATGTACCG AATCCGGTTG TGGTGAATGC TGGAATGCAG
CCTCTGAAAG AGGTGATCTG A
 
Protein sequence
MIIKGGIVYD PANGIFGEEM DICIENGRIA EDAGGEVIDA RGLLVMPGGV DAHSHIAGPK 
LNTGRIMRPD DSRMGTEPRT RVCRPSTGYT VPNCYAIGYR YARLGYTTAF EAATPIIEAR
HTHEELEEIP IVDKGALTLF GSNWTVMECV RENDMDMLAA YVAWGLRAAR GYGVKIVNPG
GGEAWGFGSN VKSVHDPVPH FDVTPAQIIR ALAEVNERLR LPHSIHLHFN NLGRPGNYTT
AIETLELLKD IKPSRMRQVV HVAHMQFSAY GGTGWKDFES KASAIAEYFN QTNHATMDLG
QIIFGPATTM TADAPLEYAN ARIGHQKWSN HDIELEESSG VVPWVYTRKM PVNAVQWAIG
LELALLTKDP WKVVMTTDHP NGGPFVNYPE IISLLMSREK REEEMKTLHE VVRSRSTLPS
IEREMDWSEI VIMTRAAPAR ILGLEDKGHL GIGADGDVSI YNIRPDEIDP SKDHAVVKAG
MSRAKYTIKG GAVVVRDGEI VAAPQGRTYW VDAAVPESDM DRMLSSLKEK FERYYSIRMS
NYMVQDAYVP NPVVVNAGMQ PLKEVI