Gene Dshi_1943 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_1943 
SymbolcodA 
ID5712937 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009952 
Strand
Start bp2032700 
End bp2033968 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content68% 
IMG OID641267868 
Productcytosine deaminase 
Protein accessionYP_001533285 
Protein GI159044491 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.115146 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.0000891161 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGTCGACA TTCTGGTCAA GGGCGGCACG CTGCCCGACG GCACCCAGGC CGATATCGCC 
ATCACCGGCG ACCGCATCGT CGACGTCGCG CCCGGGATCG CCGCCAAGGC GGGCGAGGTG
ATCGACGCCA CCGGCGATCT GGTCAGCCCG CCCTTCGTGG ACCCCCATTT CCACATGGAC
GCCACGCTCA GCTACGGCAT ACCGCGGATC AACGCCTCGG GCACGCTGCT CGAAGGGATC
GCGCTCTGGG GCGAGTTGAA GCACGAGACG ACCATCGACG CGATGATCGA CCGCGCCCTG
CGCTATTGCG ACTGGGCGGT CTCCATGGGG CTGCTGGCGA TCCGCTCCCA TGTCGATACC
TGCGACGACA GCCTCAAGGG CGTGCAGGCG ATGTTGCAGC TGCGCGAGAC GGTCAAACCC
TATCTCGACC TGCAACTGGT GGCCTTCCCC CAGGACGGGC TCTACCGCGA TCCGACCGCG
CGGGAAAACA CCCTGCGCGC GCTGGATATG GGGCTGGACG TGGTGGGCGG CATCCCGCAT
TTCGAGCGCA CCATGGCCGA TGGTGCCGCC TCCGTGCGCG ATCTGTGCGA AATCGCCGCC
GACCGGGGCC TGCCCGTCGA TATGCACTGC GACGAGAGCG ACGATCCGAT GTCGCGGCAT
ATCGAAACCC TGGCGGCGGA AACCGTCCGC TGCGGGCTGC AGGGCAGGGT GGCCGGATCG
CACCTGACCT CCATGCATTC GATGGACAAT TACTACGTCT CGAAACTGCT GGCGCTCGTG
GCCGAGGCCG GGATTTCGGC GATCCCCAAC CCGCTGATCA ACATCATGCT GCAGGGCCGC
CACGATACCT ATCCCAAGCG CCGGGGCCTG ACCCGCGTGC GCGAGATGCA GGCGCTCGGC
ATCCCCGTGG GCTGGGGCCA GGACTGCGTG CGCGACCCGT GGTATTCGCT GGGCACCGCC
GACATGCTCG ACGTGGCCTT CATGGGGCTG CATGTGGCGC AGATGTCCGC GCCGGAAGAG
ATGGCGCGCT GTTTCGAGAT GGTGACCGAA ACCAACGCCG CGATCATCGG GCTGCCGGAT
TACGGGCTGC GCAAGGGGGC GCTGGCCTCG CTCGTGGTGC TCGATGCCGC CGACCCGATC
GAGGCGGTGC GCCTGCGCCC GGACCGTTTG TGCGTGATCT CCAAGGGCAG GGTGGTCTCG
CGCAAGGCGC GCAACGATGC GGCCCTGACG CTGCCTGGCC GCCCCGCCAC GGTCCACCGA
AGGCATTGA
 
Protein sequence
MVDILVKGGT LPDGTQADIA ITGDRIVDVA PGIAAKAGEV IDATGDLVSP PFVDPHFHMD 
ATLSYGIPRI NASGTLLEGI ALWGELKHET TIDAMIDRAL RYCDWAVSMG LLAIRSHVDT
CDDSLKGVQA MLQLRETVKP YLDLQLVAFP QDGLYRDPTA RENTLRALDM GLDVVGGIPH
FERTMADGAA SVRDLCEIAA DRGLPVDMHC DESDDPMSRH IETLAAETVR CGLQGRVAGS
HLTSMHSMDN YYVSKLLALV AEAGISAIPN PLINIMLQGR HDTYPKRRGL TRVREMQALG
IPVGWGQDCV RDPWYSLGTA DMLDVAFMGL HVAQMSAPEE MARCFEMVTE TNAAIIGLPD
YGLRKGALAS LVVLDAADPI EAVRLRPDRL CVISKGRVVS RKARNDAALT LPGRPATVHR
RH