Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_30170 |
Symbol | codA |
ID | 7761917 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | + |
Start bp | 3125551 |
End bp | 3126789 |
Gene Length | 1239 bp |
Protein Length | 412 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643805889 |
Product | Cytosine deaminase protein |
Protein accession | YP_002800157 |
Protein GI | 226945084 |
COG category | [F] Nucleotide transport and metabolism [R] General function prediction only |
COG ID | [COG0402] Cytosine deaminase and related metal-dependent hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.831905 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGATCG TCAACGCCCG CCTGCGCGGC CGCGAAGGCC TCCAGCGCAT CGAACTGGAC GGCGCGCGCA TCGCCGCCAT AGCCGCGCAA CCGGCGCCCG CCGAGGCCGG CGGCGACGAG TTGGACGCCG CCGGCAACCT GGTGGTACCG CCCTTCGTCG AACCGCACAT CCACCTGGAC GCCACCCTCA CCGCCGGCGA GCCGGCCTGG AACATGAGCG GCACCCTGTT CGAAGGCATC GAGCGCTGGG CCGAGCGCAA GGCGCTGGTC ACCCACGAGG ACATCAAGAC CCGGGCGAAG AAGGCCATCG ACATGCTGGT CGAGCACGGC ATCCAGCACG TGCGCACCCA CGTCGACGTC ACCGACCCGA CGCTGGCCGC GCTCAAGGCG ATGCTCGAGG TGCGCGAGGA AACCCGTCAC CTGATCGACC TGCAGATCGT CGCCTTTCCC CAGGAAGGCA TCGAGTCCTA CCAGGGCGGC CGCGAGCTGA TGACCGAGGC CATCGCCCTG GGCGCCGACG TGGTCGGCGG CATCCCGCAT TTCGAGAACA CCCGCGAGCA GGGCGTCGGC TCGATCAAAT TCCTCATGGA CCTGGCCGAG CGCACCGGCT GCCTGGTCGA CGTGCACTGC GACGAAACCG ACGATCCACA GTCGCGATTT CTCGAGGTGC TCGCCGAGGA GGCACGGGTG CGCGACATGG GCGAGCGGGT CACCGCCAGT CACACCACGG CCATGGGCTC ATGGGACAAC GCCTACTGCT CCAAGCTGTT CCGCCTGCTG AAGCTGTCGC GGATCAACTT CGTCTCCTGT CCCACCGAGA GCATCCACCT GCAGGGGCGC TTCGACACCT TTCCCAAGCG CCGCGGCCTG ACCCGGGTCG CCGAACTGGA CCGCGCCGGG CTGAACGTCT GCTTCGGCCA GGACTCCATC GTCGATCCCT GGTATCCACT GGGCAACGGC AACATCCTGC GCATCCTCGA AGCCGGCCTG CACATCTGCC ACATGCTCGG CTACGCCGAC CTGCAACGCG CCCTCGATCT GATCACCGAG CACAGCGCCA AGGCCCTGCA CCTGGGCGAG CGCTACGGCC TGGAAGTCGG GCGGCCGGCC AACCTGCTGA TCCTCTCGGC GGCCAACGAC TACGAGATGT TGCGCAGCCA GGGCCACGCG CTGGTATCGA TCCGCCACGG GGAGATCCTG ATGCGCCGCA CGCCGGCGCG GATCGAACGC CATCGCTGA
|
Protein sequence | MKIVNARLRG REGLQRIELD GARIAAIAAQ PAPAEAGGDE LDAAGNLVVP PFVEPHIHLD ATLTAGEPAW NMSGTLFEGI ERWAERKALV THEDIKTRAK KAIDMLVEHG IQHVRTHVDV TDPTLAALKA MLEVREETRH LIDLQIVAFP QEGIESYQGG RELMTEAIAL GADVVGGIPH FENTREQGVG SIKFLMDLAE RTGCLVDVHC DETDDPQSRF LEVLAEEARV RDMGERVTAS HTTAMGSWDN AYCSKLFRLL KLSRINFVSC PTESIHLQGR FDTFPKRRGL TRVAELDRAG LNVCFGQDSI VDPWYPLGNG NILRILEAGL HICHMLGYAD LQRALDLITE HSAKALHLGE RYGLEVGRPA NLLILSAAND YEMLRSQGHA LVSIRHGEIL MRRTPARIER HR
|
| |