Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_1943 |
Symbol | codA |
ID | 5712937 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | - |
Start bp | 2032700 |
End bp | 2033968 |
Gene Length | 1269 bp |
Protein Length | 422 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641267868 |
Product | cytosine deaminase |
Protein accession | YP_001533285 |
Protein GI | 159044491 |
COG category | [F] Nucleotide transport and metabolism [R] General function prediction only |
COG ID | [COG0402] Cytosine deaminase and related metal-dependent hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.115146 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.0000891161 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGTCGACA TTCTGGTCAA GGGCGGCACG CTGCCCGACG GCACCCAGGC CGATATCGCC ATCACCGGCG ACCGCATCGT CGACGTCGCG CCCGGGATCG CCGCCAAGGC GGGCGAGGTG ATCGACGCCA CCGGCGATCT GGTCAGCCCG CCCTTCGTGG ACCCCCATTT CCACATGGAC GCCACGCTCA GCTACGGCAT ACCGCGGATC AACGCCTCGG GCACGCTGCT CGAAGGGATC GCGCTCTGGG GCGAGTTGAA GCACGAGACG ACCATCGACG CGATGATCGA CCGCGCCCTG CGCTATTGCG ACTGGGCGGT CTCCATGGGG CTGCTGGCGA TCCGCTCCCA TGTCGATACC TGCGACGACA GCCTCAAGGG CGTGCAGGCG ATGTTGCAGC TGCGCGAGAC GGTCAAACCC TATCTCGACC TGCAACTGGT GGCCTTCCCC CAGGACGGGC TCTACCGCGA TCCGACCGCG CGGGAAAACA CCCTGCGCGC GCTGGATATG GGGCTGGACG TGGTGGGCGG CATCCCGCAT TTCGAGCGCA CCATGGCCGA TGGTGCCGCC TCCGTGCGCG ATCTGTGCGA AATCGCCGCC GACCGGGGCC TGCCCGTCGA TATGCACTGC GACGAGAGCG ACGATCCGAT GTCGCGGCAT ATCGAAACCC TGGCGGCGGA AACCGTCCGC TGCGGGCTGC AGGGCAGGGT GGCCGGATCG CACCTGACCT CCATGCATTC GATGGACAAT TACTACGTCT CGAAACTGCT GGCGCTCGTG GCCGAGGCCG GGATTTCGGC GATCCCCAAC CCGCTGATCA ACATCATGCT GCAGGGCCGC CACGATACCT ATCCCAAGCG CCGGGGCCTG ACCCGCGTGC GCGAGATGCA GGCGCTCGGC ATCCCCGTGG GCTGGGGCCA GGACTGCGTG CGCGACCCGT GGTATTCGCT GGGCACCGCC GACATGCTCG ACGTGGCCTT CATGGGGCTG CATGTGGCGC AGATGTCCGC GCCGGAAGAG ATGGCGCGCT GTTTCGAGAT GGTGACCGAA ACCAACGCCG CGATCATCGG GCTGCCGGAT TACGGGCTGC GCAAGGGGGC GCTGGCCTCG CTCGTGGTGC TCGATGCCGC CGACCCGATC GAGGCGGTGC GCCTGCGCCC GGACCGTTTG TGCGTGATCT CCAAGGGCAG GGTGGTCTCG CGCAAGGCGC GCAACGATGC GGCCCTGACG CTGCCTGGCC GCCCCGCCAC GGTCCACCGA AGGCATTGA
|
Protein sequence | MVDILVKGGT LPDGTQADIA ITGDRIVDVA PGIAAKAGEV IDATGDLVSP PFVDPHFHMD ATLSYGIPRI NASGTLLEGI ALWGELKHET TIDAMIDRAL RYCDWAVSMG LLAIRSHVDT CDDSLKGVQA MLQLRETVKP YLDLQLVAFP QDGLYRDPTA RENTLRALDM GLDVVGGIPH FERTMADGAA SVRDLCEIAA DRGLPVDMHC DESDDPMSRH IETLAAETVR CGLQGRVAGS HLTSMHSMDN YYVSKLLALV AEAGISAIPN PLINIMLQGR HDTYPKRRGL TRVREMQALG IPVGWGQDCV RDPWYSLGTA DMLDVAFMGL HVAQMSAPEE MARCFEMVTE TNAAIIGLPD YGLRKGALAS LVVLDAADPI EAVRLRPDRL CVISKGRVVS RKARNDAALT LPGRPATVHR RH
|
| |