Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dbac_2936 |
Symbol | |
ID | 8378629 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfomicrobium baculatum DSM 4028 |
Kingdom | Bacteria |
Replicon accession | NC_013173 |
Strand | + |
Start bp | 3322790 |
End bp | 3324058 |
Gene Length | 1269 bp |
Protein Length | 422 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 645002169 |
Product | cytosine deaminase |
Protein accession | YP_003159427 |
Protein GI | 256830699 |
COG category | [F] Nucleotide transport and metabolism [R] General function prediction only |
COG ID | [COG0402] Cytosine deaminase and related metal-dependent hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTGGATT TGCTTATCAT CAACGCCGCC CTGCCTGGGG AGGAGGCGCT AGTCGAGATT GGCTGCAAGG ATGGCCGGAT TGTTGCGGTT GAACCGAGTA TCAAGGCTGA GGCGGCTCAG GTCATTGACG CCAAGGGATA TCTGGTGACT CCGCCTTTCG TGGACAGCCA TTTCCATATG GACGCGACCT TGTCGGCGGG GCTGCCGCGC AGGAACGAGA CGGGCACGCT GCTGGAGGGA ATTCGGATTT GGGGCGAGCT CAAGCCCGAC CTGACGCCGG AGGCTATCAA GGACCGGGCC ATGAAGCTGC TGCGCTGGAG CGTGGCCAAG GGCAATCTGG CCATCCGCAC CCATGTGGAC ACGACGGATC CGAGTCTCAT GGCTGTGGAT GTGCTGCTGG AAGTGCGCGA GGAGATGAAG GATTTCGTGG ACATCCAGCT GGTGGCCTTT CCCCAGGACG GGGTGTTGCG CGCCCCGCGC GGGGTGGAAC TGCTGGAGCG GGCTCTCGAT AAGGGCGTGG ACGTGGTCGG CGGCATCCCG CATTTCGAGC GGACCATGGA TCAGGGCCGG GAGTCGGTGC GCGTGCTGTG CGAGATCGCG GCCATGCGCG GGCTCATGGT GGACATGCAC TGCGACGAGT CCGACGACCC GCTTTCCCGG CACGTGGAGA GCCTGGCGTT TGAGACGCAA AGGCTGGGGC TACAGGGCCG GGTCACGGGC TCGCACCTGA CCAGTATGCA CTCCATGGAC AATTATTACG TGTCGAAGCT GTTGCCGCTC ATGGCCGAGG CGGGTTTGCA CTGCGTGTGC AACCCGCTGG TGAACATGAA CCTGCAAGGC CGCCACGACA CGTACCCCAA GAGACGGGGG CTCATGCGCG TGCCGGAGCT GATGAACTTA GGGCTCAACG TGTCCTTCGG CCACGACGAC ATCATGGACC CCTGGTACCC CATGGGCACC CACGACATGC TCGAAGTGGC GCACATGGGA GCGCACGCCC TGCACATGAC TGGCGTGGAT GGTTTGAAGA AGATGTTTGC GGCCGTGACC GTGAACGGTG CCAAAACCAT GGGACTGGAA AGTTACGGAC TTGAGCCCGG CTGCAACGCG GACATGGTCA TCCTGCAGGC CGCAAGCGAG ATTGAAGCCC TGCGCCTGCA CCCGGCACGC CTGTGGGTCA TCCGTCGCGG CAAGGTCATC AGCCGGACCC CGGAAGTGGT GGCCAGCGTG GAGTTGGGAG AGGGCGAGGA GTTGGTGGAT TTTACATAA
|
Protein sequence | MLDLLIINAA LPGEEALVEI GCKDGRIVAV EPSIKAEAAQ VIDAKGYLVT PPFVDSHFHM DATLSAGLPR RNETGTLLEG IRIWGELKPD LTPEAIKDRA MKLLRWSVAK GNLAIRTHVD TTDPSLMAVD VLLEVREEMK DFVDIQLVAF PQDGVLRAPR GVELLERALD KGVDVVGGIP HFERTMDQGR ESVRVLCEIA AMRGLMVDMH CDESDDPLSR HVESLAFETQ RLGLQGRVTG SHLTSMHSMD NYYVSKLLPL MAEAGLHCVC NPLVNMNLQG RHDTYPKRRG LMRVPELMNL GLNVSFGHDD IMDPWYPMGT HDMLEVAHMG AHALHMTGVD GLKKMFAAVT VNGAKTMGLE SYGLEPGCNA DMVILQAASE IEALRLHPAR LWVIRRGKVI SRTPEVVASV ELGEGEELVD FT
|
| |