Gene Mthe_0603 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMthe_0603 
Symbol 
ID4461748 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosaeta thermophila PT 
KingdomArchaea 
Replicon accessionNC_008553 
Strand
Start bp626397 
End bp627545 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content54% 
IMG OID639699612 
Productamidohydrolase 
Protein accessionYP_843034 
Protein GI116753916 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCAGAA TCGATGAGGA TGTTAATAAG CTGAATGCCC CGCCCATCAG CAAATATTCC 
GGTCCGGGAT CTCTCGCCTC TGCATCAGAG GATGCTGGCT TTCAGGATGA GCTGATACTC
TCCGGGACGG TGATCGCAGG CGAGGATATG GTTGTCCTGG ATGGGTACGT CGTTATAGAG
AACGGCGTAA TAAAAGAGAT CGGTGAGGGC AAAGAGAGAG GCCACCTCGA GGGAATCATA
TGCCCTGCAT TTGTCAATGC GCACACACAT GTCGCGGACT CCATCGCGAA GGATCCGCCG
TTCATGGATC TCGCGGATCT TGTAGGGCCG GGCGGTCTCA AGCACAGAAT TCTTGAGAGC
GCGAGCGATG ATCTCCTCGT CGAATCGATG CGCTTCTCTG TTTCAGAGAT GCTAGATACC
GGGACCTGCG TCTTCGGCGA TTTCAGGGAG GGTGGTTCTC ACGGCGTGGA GCTACTCCTC
AGAGCTATCG AGGGTTTGGG GATCCAGAGC AGGATCTTCG GCAGGCCTCT GAGGGCGCCA
TGGGATATAC ATCCAGCATG CTGGGGAGTC GGGCTTAGCA GCACTAGGGA TTACGATCAG
AGTTTTGTGG ATGAGGTTGT GAGGATCGCA AGAAAAGAGG GAAAGCGCAT CGCGATACAC
GCTGGAGAGG CCGGCAGGGA TGATATAGAT GGCGCGCTCG CGCTCGACCC CGACATTCTC
ATACATCTCT CCAGAGCGGA GCGCTCCGAT CTCAGGGATG TTGCGGAATC TGGCGCCTCT
GTGGTCGTGT GCCCCAGATC GAACCTCTTC ACACGCGCCG GCCTTCCTGA TGTCTCATCG
ATGCTATCTC TGGGAATCAA TGTCTGTGTG GGTACGGATA ACATTATGAT AAATTCGACC
AACATCTTCA GGGAGATGGA GCTTCTCTCA AAAGCGCTTG TCAGAGACGA CAGACAGGTT
TTTATGATGT GCACGATAAA TGGAGCAAGA GCACTGGGAA TGGATGAGAG GCTCGGCTCC
GTCGATCCCG GAAAGGAGGC GCGGGTGATG GTGTTCGACA GGAACTCACG CAACATGAGG
GGATCGTTGA ATCCATTGGG AAGCATCGTG AGAAGGGCTG AGCCCTCTGA TATAATACTG
AGGATCTGA
 
Protein sequence
MRRIDEDVNK LNAPPISKYS GPGSLASASE DAGFQDELIL SGTVIAGEDM VVLDGYVVIE 
NGVIKEIGEG KERGHLEGII CPAFVNAHTH VADSIAKDPP FMDLADLVGP GGLKHRILES
ASDDLLVESM RFSVSEMLDT GTCVFGDFRE GGSHGVELLL RAIEGLGIQS RIFGRPLRAP
WDIHPACWGV GLSSTRDYDQ SFVDEVVRIA RKEGKRIAIH AGEAGRDDID GALALDPDIL
IHLSRAERSD LRDVAESGAS VVVCPRSNLF TRAGLPDVSS MLSLGINVCV GTDNIMINST
NIFREMELLS KALVRDDRQV FMMCTINGAR ALGMDERLGS VDPGKEARVM VFDRNSRNMR
GSLNPLGSIV RRAEPSDIIL RI