Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mbar_A3252 |
Symbol | |
ID | 3627729 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanosarcina barkeri str. Fusaro |
Kingdom | Archaea |
Replicon accession | NC_007355 |
Strand | - |
Start bp | 4175140 |
End bp | 4176438 |
Gene Length | 1299 bp |
Protein Length | 432 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 637702088 |
Product | N-ethylammeline chlorohydrolase |
Protein accession | YP_306713 |
Protein GI | 73670698 |
COG category | [F] Nucleotide transport and metabolism [R] General function prediction only |
COG ID | [COG0402] Cytosine deaminase and related metal-dependent hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 35 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.147814 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTGACA TAATTGTAAA AAATGCTTAC GTTATGACAA TGGACCCCGA TGAGGGAGAT CTCAAAAATG GGACTGTTGT CATCGAAGAC GGAAAGATTA CGGAAATCGG AGAAAAGACC AGTGAAAGTG CCGATACCGT AATTGATGCA AAACATTCGG TAGTAATGCC AGGGCTTGTA AATACGCATA CTCATGCTGC AATGACTCTT TTTCGGGGTT ATGCCGACGA TTTGCAGCTT GCAGACTGGC TTGAAGGGCA TATATGGCCT GCCGAGGCAA AGCTGACTGC AGAAGATGTT TATAAAGGCA GTCTGCTTGC CTGCCTGGAG ATGATCAGGT CAGGCACTAC TTCTTTTGCG GACATGTACT TTTATATGGA CGAGACTGCA AAAGCTGTTG AGGCATCAGG GCTTCGGGCT TCACTTTGCC ATGGGCTTAT CGAACTCTGG AACGAAGAAA AGGGCGCAAC AGACCTAAAA GAAGGGAAGC GCTTTGTCCG GGCCTGGCAG GGAGCGGCTG ACGGCAGGAT AAAAACAATG TATGGGCCTC ATGCCCCGAA TACCTGCTCT GAAGAATTTC TTGCAAAGGT AAGGGAAGAA GCCAACAGGG ATGGTGCAGG AATCCATATC CATCTCCTTG AAACGGAAGC CGAACTCCTG GCTATGAAAG AAAGGTACGG GAAATGTTCG GTGCACCTTC TGGAAGACAT AGGATTTTTA GGGCCTGATG TGCTTGCTGC TCACTGTGTC TGGCTTTCGG ACGGCGACAT AGAAATCCTG GGAAAAAGAG GAGTAAATGT TTCTCATAAT GTCATAAGTA ATATGAAACT AGCTTCAGGG ATTGCACCTG TATACAAGAT GCTCGAAAAA GGAGTCAATG TGAGCCTTGG TACGGATGGT TGTGCCTCAA ACAATAACCT TGACCTTTTT GAGGAGATGA AAACGGCTGC TCTTCTGCAT AAAGTCAATA CTTTCAGCCC TACTGCCCTT CCTGCGCGAC AGGTGCTTCA AATGGGTACT GTGAACGGTG CAAAAGCCCT TGGCACGGAA ACCGGCATGT TGAAAGTAGG AATGAAAGCG GACCTTATCG TGGTAGATAT GAAAAAAGCG CATCTTACCC CCTGTTTTGA TGTTCCCTCC CACTTGGTGT ACTCTGCAAA AGGAAGCGAC GTCAGGACAA CAATTGTAAA TGGGAAAGTC CTTATGGATG ATTACAAAGT GCTGGCCCTG GACGAGCAGA AAGTTATGGA AGATGCTCAA AAAGCCGCAG AAGAGCTTGT TACAAGGGTA AACGCCTGA
|
Protein sequence | MADIIVKNAY VMTMDPDEGD LKNGTVVIED GKITEIGEKT SESADTVIDA KHSVVMPGLV NTHTHAAMTL FRGYADDLQL ADWLEGHIWP AEAKLTAEDV YKGSLLACLE MIRSGTTSFA DMYFYMDETA KAVEASGLRA SLCHGLIELW NEEKGATDLK EGKRFVRAWQ GAADGRIKTM YGPHAPNTCS EEFLAKVREE ANRDGAGIHI HLLETEAELL AMKERYGKCS VHLLEDIGFL GPDVLAAHCV WLSDGDIEIL GKRGVNVSHN VISNMKLASG IAPVYKMLEK GVNVSLGTDG CASNNNLDLF EEMKTAALLH KVNTFSPTAL PARQVLQMGT VNGAKALGTE TGMLKVGMKA DLIVVDMKKA HLTPCFDVPS HLVYSAKGSD VRTTIVNGKV LMDDYKVLAL DEQKVMEDAQ KAAEELVTRV NA
|
| |