Gene Mthe_0991 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMthe_0991 
Symbol 
ID4462867 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosaeta thermophila PT 
KingdomArchaea 
Replicon accessionNC_008553 
Strand
Start bp1075697 
End bp1076938 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content54% 
IMG OID639700009 
Productamidohydrolase 
Protein accessionYP_843416 
Protein GI116754298 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTAATTC GCAGTGCATC CATCATTAGA AACGGGTCTC TGCTGAAGAA CATCGACATT 
CTCATCGAGG GGAACCGCAT CTCTGAGGTT GGAAGAGATT TGAGGCCGAA TGATGATGAG
ATCATAGATG CAAGAAACAT GCTCGCAGTT CCGGGTCTTG TGAACAGCCA CACACACCTG
GCCATGACGC TTCTCAGGGG ATACGCAGAT GATATGGAGC TCATTCCCTG GCTTCAGGAG
AAGATATGGC CGCTGGAGGC GAGGTTGAAG CCATCTGATG TTCGTGCTGG AGTGAAGCTG
GGCTGCCTGG AGCTGATAAG ATTCGGCGTG ACGTGCTACA ATGACATGTA CTACTTCATG
GATGAGACTG CTGCTGCCAC CAGGGAGATG GGGATCAGGG GTGTGCTCTC AGGCGTGCTA
TTCGATATGC GGCCGGAGTT CATCAATGAT GTCGAGCCAT TCATAAAAAA ATGGAGAGAT
GACGATCTCA TAAAGCCGGC TGTGGGCCCG CATGCTGTCT ACACGTGTTC AGAGGAGACG
CTTCTCAGGG CAAAGGATAT CGCGGAGAGG TATGATGTCA AGATCCACAT CCACCTCTCA
GAGACCAGGG ATGAGGTCGA TACATTTGTG AACCAGCGGC ACATGAGCCC TGTGGAGTAT
CTTGAAAACC TTGGGTTTCT CAGCGAGAGA GTGGTGGCAG CGCACTGCGT GTGGCTGACG
CCGAGGGACA TCAGGATCCT TGCGGAGAGG CATGTGAACG TCGCCCACTG CCCGATAAGC
AATCTCAAGC TCGCATCAGG CATCGCTCCG GTCGCGACCC TCATCGAGCA TGGGGTGAAC
GTCTGTCTTG GAACGGATGG AGCTTCGAGC AACAACAACC TGGACATCTT CGAGGAGATG
AAGGTTGCAG CCGTGGTCCA GAAGTGCTCT GTCGGGCGTT CAGCGATACT TCCGGCTGAT
GCTGTCTGGC GGATGGCCAC AGAGAATGCA TACAAGGCAT TCTCCCTTGA TATGGGTATA
AGGAGAGGGG CCCTCGCGGA TCTCGCCCTG ATCAACATGA GAAGACCATG GTTCATACCT
GTGACATCGA TGATCTCACA TCTGGTCTAC AGCATGTCGG GAGAGGCGAG CTACACGATA
TGCAACGGAA GGGTGCTCAT GAGGGATGGC GTGATCGAGG GTGAAGCTAA GATACTTGAT
GAAGCCCAGC GCTGCTACGA GAGGCTTATC TCGGAAGAGT AG
 
Protein sequence
MLIRSASIIR NGSLLKNIDI LIEGNRISEV GRDLRPNDDE IIDARNMLAV PGLVNSHTHL 
AMTLLRGYAD DMELIPWLQE KIWPLEARLK PSDVRAGVKL GCLELIRFGV TCYNDMYYFM
DETAAATREM GIRGVLSGVL FDMRPEFIND VEPFIKKWRD DDLIKPAVGP HAVYTCSEET
LLRAKDIAER YDVKIHIHLS ETRDEVDTFV NQRHMSPVEY LENLGFLSER VVAAHCVWLT
PRDIRILAER HVNVAHCPIS NLKLASGIAP VATLIEHGVN VCLGTDGASS NNNLDIFEEM
KVAAVVQKCS VGRSAILPAD AVWRMATENA YKAFSLDMGI RRGALADLAL INMRRPWFIP
VTSMISHLVY SMSGEASYTI CNGRVLMRDG VIEGEAKILD EAQRCYERLI SEE