Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_A1034 |
Symbol | codA |
ID | 4905905 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009078 |
Strand | + |
Start bp | 1000646 |
End bp | 1001902 |
Gene Length | 1257 bp |
Protein Length | 418 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640144140 |
Product | cytosine deaminase |
Protein accession | YP_001075070 |
Protein GI | 126455575 |
COG category | [F] Nucleotide transport and metabolism [R] General function prediction only |
COG ID | [COG0402] Cytosine deaminase and related metal-dependent hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGCTCA TCAACGCGAC GCTGCGCAAG CGCAGCGGCC TTTTCAGCAT CGTGCTCGAC GGCGGCGCGA TCGCGAGCGT CACGCCGCAG CCGGCGCGTG TCGACGCGCC GCACGCGCCG CGGGCGGACG TGATCGACGC CGACGGCAAG CTCGTGATCC CGCCGCTCGT CGAGCCGCAC ATCCATCTCG ACGCGGTGCT GACGGCGGGC GAGCCCGAAT GGAACATGAG CGGCACGCTG TTCGAGGGGA TCGAGCGCTG GGCGCAGCGC AAGGCGACGA TCACGCACGA GGACACGAAG GCCCGCGCGC ATGCGGCGAT CGGGATGTTG CGCGATCACG GCATTCAGCA CGTGCGCACG CACGTCGACG TGACCGACCC TTCGCTCGCG GCGCTGAAGG CGATGCTCGA GGTGAAGGAC GAGGCGCGCG GGCTGATCGA TCTGCAGATC GTCGCGTTCC CGCAGGAAGG CATCGAATCG TTCGACGGCG GCCGCGCGCT GATGGAGCGG GCAATCGAGG TGGGCGCGGA CGTCGTCGGC GGCATTCCGC ACTTCGAGAA CACGCGCGAG CAGGGCGTGA GCTCGATCCG GTTCCTGATG GATCTCGCCG AGCGCAGCGG CTGCCTGGTC GATGTGCACT GCGACGAGAC CGACGATCCG AACTCGCGCT TTCTCGAGGT GCTCGCCGAG GAAGCGCGCG TGCGCGGGAT CGGCGCGCGC GTGACGGCGA GCCATACGAC GGCGATGGGC TCGTACGACA ATGCGTACTG CTCGAAGCTG TTCCGCTTGC TGAAGCGCTC GCAGATCAAT TTCATCTCGT GCCCGACCGA AAGCATCCAT CTGCAAGGCC GCTTCGATAC GTTCCCGAAG CGCCGCGGTC TCACGCGCGT CGCCGAGCTC GATCGCGCCG GCATGAACGT GTGCTTCGGC CAGGATTCGA TTCGGGACCC CTGGTATCCG CTCGGCAACG GCAACATCCT GCGCGCGCTC GATGCGGGGC TGCACATCTG CCACATGATG GGCTATCAGG ATCTCGCGCG CAGCCTCGAT TTCGTCACCG AGCACAGCGC GCGCGCGATG CATCTCGGCG AGCGCTACGG CATCGAGCCG GGGCGGCCGG CGAATCTCGT CGTGCTCGAC GCATCCGACG ATTACGAGGC GCTGCGGCGG CAGGCGAAGG CGCTGCTGTC GATTCGCGGC GGCGAAGTGA TCATGCGCCG CGTGCCCGAG CGCATCGCGT ACCCGGCCGC GCGCTGA
|
Protein sequence | MKLINATLRK RSGLFSIVLD GGAIASVTPQ PARVDAPHAP RADVIDADGK LVIPPLVEPH IHLDAVLTAG EPEWNMSGTL FEGIERWAQR KATITHEDTK ARAHAAIGML RDHGIQHVRT HVDVTDPSLA ALKAMLEVKD EARGLIDLQI VAFPQEGIES FDGGRALMER AIEVGADVVG GIPHFENTRE QGVSSIRFLM DLAERSGCLV DVHCDETDDP NSRFLEVLAE EARVRGIGAR VTASHTTAMG SYDNAYCSKL FRLLKRSQIN FISCPTESIH LQGRFDTFPK RRGLTRVAEL DRAGMNVCFG QDSIRDPWYP LGNGNILRAL DAGLHICHMM GYQDLARSLD FVTEHSARAM HLGERYGIEP GRPANLVVLD ASDDYEALRR QAKALLSIRG GEVIMRRVPE RIAYPAAR
|
| |