Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mthe_0603 |
Symbol | |
ID | 4461748 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanosaeta thermophila PT |
Kingdom | Archaea |
Replicon accession | NC_008553 |
Strand | - |
Start bp | 626397 |
End bp | 627545 |
Gene Length | 1149 bp |
Protein Length | 382 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 639699612 |
Product | amidohydrolase |
Protein accession | YP_843034 |
Protein GI | 116753916 |
COG category | [F] Nucleotide transport and metabolism [R] General function prediction only |
COG ID | [COG0402] Cytosine deaminase and related metal-dependent hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCAGAA TCGATGAGGA TGTTAATAAG CTGAATGCCC CGCCCATCAG CAAATATTCC GGTCCGGGAT CTCTCGCCTC TGCATCAGAG GATGCTGGCT TTCAGGATGA GCTGATACTC TCCGGGACGG TGATCGCAGG CGAGGATATG GTTGTCCTGG ATGGGTACGT CGTTATAGAG AACGGCGTAA TAAAAGAGAT CGGTGAGGGC AAAGAGAGAG GCCACCTCGA GGGAATCATA TGCCCTGCAT TTGTCAATGC GCACACACAT GTCGCGGACT CCATCGCGAA GGATCCGCCG TTCATGGATC TCGCGGATCT TGTAGGGCCG GGCGGTCTCA AGCACAGAAT TCTTGAGAGC GCGAGCGATG ATCTCCTCGT CGAATCGATG CGCTTCTCTG TTTCAGAGAT GCTAGATACC GGGACCTGCG TCTTCGGCGA TTTCAGGGAG GGTGGTTCTC ACGGCGTGGA GCTACTCCTC AGAGCTATCG AGGGTTTGGG GATCCAGAGC AGGATCTTCG GCAGGCCTCT GAGGGCGCCA TGGGATATAC ATCCAGCATG CTGGGGAGTC GGGCTTAGCA GCACTAGGGA TTACGATCAG AGTTTTGTGG ATGAGGTTGT GAGGATCGCA AGAAAAGAGG GAAAGCGCAT CGCGATACAC GCTGGAGAGG CCGGCAGGGA TGATATAGAT GGCGCGCTCG CGCTCGACCC CGACATTCTC ATACATCTCT CCAGAGCGGA GCGCTCCGAT CTCAGGGATG TTGCGGAATC TGGCGCCTCT GTGGTCGTGT GCCCCAGATC GAACCTCTTC ACACGCGCCG GCCTTCCTGA TGTCTCATCG ATGCTATCTC TGGGAATCAA TGTCTGTGTG GGTACGGATA ACATTATGAT AAATTCGACC AACATCTTCA GGGAGATGGA GCTTCTCTCA AAAGCGCTTG TCAGAGACGA CAGACAGGTT TTTATGATGT GCACGATAAA TGGAGCAAGA GCACTGGGAA TGGATGAGAG GCTCGGCTCC GTCGATCCCG GAAAGGAGGC GCGGGTGATG GTGTTCGACA GGAACTCACG CAACATGAGG GGATCGTTGA ATCCATTGGG AAGCATCGTG AGAAGGGCTG AGCCCTCTGA TATAATACTG AGGATCTGA
|
Protein sequence | MRRIDEDVNK LNAPPISKYS GPGSLASASE DAGFQDELIL SGTVIAGEDM VVLDGYVVIE NGVIKEIGEG KERGHLEGII CPAFVNAHTH VADSIAKDPP FMDLADLVGP GGLKHRILES ASDDLLVESM RFSVSEMLDT GTCVFGDFRE GGSHGVELLL RAIEGLGIQS RIFGRPLRAP WDIHPACWGV GLSSTRDYDQ SFVDEVVRIA RKEGKRIAIH AGEAGRDDID GALALDPDIL IHLSRAERSD LRDVAESGAS VVVCPRSNLF TRAGLPDVSS MLSLGINVCV GTDNIMINST NIFREMELLS KALVRDDRQV FMMCTINGAR ALGMDERLGS VDPGKEARVM VFDRNSRNMR GSLNPLGSIV RRAEPSDIIL RI
|
| |