Gene Dbac_2936 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDbac_2936 
Symbol 
ID8378629 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfomicrobium baculatum DSM 4028 
KingdomBacteria 
Replicon accessionNC_013173 
Strand
Start bp3322790 
End bp3324058 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content63% 
IMG OID645002169 
Productcytosine deaminase 
Protein accessionYP_003159427 
Protein GI256830699 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGGATT TGCTTATCAT CAACGCCGCC CTGCCTGGGG AGGAGGCGCT AGTCGAGATT 
GGCTGCAAGG ATGGCCGGAT TGTTGCGGTT GAACCGAGTA TCAAGGCTGA GGCGGCTCAG
GTCATTGACG CCAAGGGATA TCTGGTGACT CCGCCTTTCG TGGACAGCCA TTTCCATATG
GACGCGACCT TGTCGGCGGG GCTGCCGCGC AGGAACGAGA CGGGCACGCT GCTGGAGGGA
ATTCGGATTT GGGGCGAGCT CAAGCCCGAC CTGACGCCGG AGGCTATCAA GGACCGGGCC
ATGAAGCTGC TGCGCTGGAG CGTGGCCAAG GGCAATCTGG CCATCCGCAC CCATGTGGAC
ACGACGGATC CGAGTCTCAT GGCTGTGGAT GTGCTGCTGG AAGTGCGCGA GGAGATGAAG
GATTTCGTGG ACATCCAGCT GGTGGCCTTT CCCCAGGACG GGGTGTTGCG CGCCCCGCGC
GGGGTGGAAC TGCTGGAGCG GGCTCTCGAT AAGGGCGTGG ACGTGGTCGG CGGCATCCCG
CATTTCGAGC GGACCATGGA TCAGGGCCGG GAGTCGGTGC GCGTGCTGTG CGAGATCGCG
GCCATGCGCG GGCTCATGGT GGACATGCAC TGCGACGAGT CCGACGACCC GCTTTCCCGG
CACGTGGAGA GCCTGGCGTT TGAGACGCAA AGGCTGGGGC TACAGGGCCG GGTCACGGGC
TCGCACCTGA CCAGTATGCA CTCCATGGAC AATTATTACG TGTCGAAGCT GTTGCCGCTC
ATGGCCGAGG CGGGTTTGCA CTGCGTGTGC AACCCGCTGG TGAACATGAA CCTGCAAGGC
CGCCACGACA CGTACCCCAA GAGACGGGGG CTCATGCGCG TGCCGGAGCT GATGAACTTA
GGGCTCAACG TGTCCTTCGG CCACGACGAC ATCATGGACC CCTGGTACCC CATGGGCACC
CACGACATGC TCGAAGTGGC GCACATGGGA GCGCACGCCC TGCACATGAC TGGCGTGGAT
GGTTTGAAGA AGATGTTTGC GGCCGTGACC GTGAACGGTG CCAAAACCAT GGGACTGGAA
AGTTACGGAC TTGAGCCCGG CTGCAACGCG GACATGGTCA TCCTGCAGGC CGCAAGCGAG
ATTGAAGCCC TGCGCCTGCA CCCGGCACGC CTGTGGGTCA TCCGTCGCGG CAAGGTCATC
AGCCGGACCC CGGAAGTGGT GGCCAGCGTG GAGTTGGGAG AGGGCGAGGA GTTGGTGGAT
TTTACATAA
 
Protein sequence
MLDLLIINAA LPGEEALVEI GCKDGRIVAV EPSIKAEAAQ VIDAKGYLVT PPFVDSHFHM 
DATLSAGLPR RNETGTLLEG IRIWGELKPD LTPEAIKDRA MKLLRWSVAK GNLAIRTHVD
TTDPSLMAVD VLLEVREEMK DFVDIQLVAF PQDGVLRAPR GVELLERALD KGVDVVGGIP
HFERTMDQGR ESVRVLCEIA AMRGLMVDMH CDESDDPLSR HVESLAFETQ RLGLQGRVTG
SHLTSMHSMD NYYVSKLLPL MAEAGLHCVC NPLVNMNLQG RHDTYPKRRG LMRVPELMNL
GLNVSFGHDD IMDPWYPMGT HDMLEVAHMG AHALHMTGVD GLKKMFAAVT VNGAKTMGLE
SYGLEPGCNA DMVILQAASE IEALRLHPAR LWVIRRGKVI SRTPEVVASV ELGEGEELVD
FT