Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BTH_II1639 |
Symbol | |
ID | 3846130 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia thailandensis E264 |
Kingdom | Bacteria |
Replicon accession | NC_007650 |
Strand | - |
Start bp | 1922799 |
End bp | 1924055 |
Gene Length | 1257 bp |
Protein Length | 418 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637838940 |
Product | cytosine deaminase |
Protein accession | YP_439833 |
Protein GI | 83717088 |
COG category | [F] Nucleotide transport and metabolism [R] General function prediction only |
COG ID | [COG0402] Cytosine deaminase and related metal-dependent hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.233761 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGCTCA TCAACGCGAC GCTGCGCAAG CGCAGCGGCC TTTTCAGCAT CGCGCTCGAC GGCGCGACGA TCGCGAGCGT CACGCCGCAG CCGGCGCGCA TCGATGCGCA AGGCGCGCCG CGCGCGGATG AAATCGATGT CGGCGGCAAG CTCGTGATTC CGCCGCTCGT CGAGCCGCAC ATCCACCTGG ATGCGGTGCT GACGGCGGGC GAGCCCGAGT GGAACATGAG CGGCACGCTG TTCGAGGGAA TCGAGCGCTG GGCGCAGCGC AAGGCGACGA TCACGCACGA GGACACGAAG GCGCGCGCGC ATGCGGCGAT CGGGATGCTG CGCGATCACG GCATTCAGCA CGTGCGCACC CACGTCGACG TGACCGATCC TTCGCTCGCG GCGCTGCAAG CGATGCTCGA AGTGAAGGAC GAGGCGCGCG GGCTGATCGA TCTGCAGATC GTCGCGTTTC CGCAGGAAGG GATCGAATCG TTCGACGGCG GCCGCGCGCT GATGGCGCGC GCGATCGCGA TGGGCGCGGA CGTCGTCGGC GGCATTCCGC ACTTCGAGAA CACGCGCGAG CAGGGCGTGA GCTCGATCGA GTTCCTGATG GATCTCGCCG ATCGCAGCGG CTGCCTCGTC GACGTGCATT GCGACGAAAC CGACGATCCG AACTCGCGTT TTCTCGAGGT GCTCGCCGAG CAAGCGCGCG TGCGCGGCGT CGGCGCGCGC GTGACGGCGA GCCACACGAC CGCGATGGGC TCGTACGACA ATGCGTACTG CTCGAAGCTG TTCCGCTTGC TGAAGCGCTC GGAGATCAAC TTCATCTCGT GTCCGACCGA GAGCATCCAT CTGCAAGGCC GCTTCGACAC GTTTCCGAAG CGCCGCGGGC TCACGCGCGT CGCCGAGCTC GATCGAGCCG GGATGAACGT GTGCTTCGGC CAGGATTCGA TTCGGGACCC GTGGTATCCG CTCGGCAACG GCAACATCCT GCGCGTGCTC GACGCGGGGC TGCACATTTG CCACATGATG GGCTATCAGG ATCTCGCACG CGCTCTTGAT TTCGTCACCG ACCATAGCGC GCGCGCGATG CATCTCGGCG AGCGCTACGG AATCGAGCCG GGGCGCCCCG CGAATCTCGT CGTGCTCGAC GCATCCGACG ATTACGAGGC GTTGCGCCGG CAGGCGAAGG CGCTGCTGTC GATTCGGGGC GGCGACGTGA TCATGCGCCG CGTGCCCGAG CGCATCGACT ACCCGGCCGC GCGCTGA
|
Protein sequence | MKLINATLRK RSGLFSIALD GATIASVTPQ PARIDAQGAP RADEIDVGGK LVIPPLVEPH IHLDAVLTAG EPEWNMSGTL FEGIERWAQR KATITHEDTK ARAHAAIGML RDHGIQHVRT HVDVTDPSLA ALQAMLEVKD EARGLIDLQI VAFPQEGIES FDGGRALMAR AIAMGADVVG GIPHFENTRE QGVSSIEFLM DLADRSGCLV DVHCDETDDP NSRFLEVLAE QARVRGVGAR VTASHTTAMG SYDNAYCSKL FRLLKRSEIN FISCPTESIH LQGRFDTFPK RRGLTRVAEL DRAGMNVCFG QDSIRDPWYP LGNGNILRVL DAGLHICHMM GYQDLARALD FVTDHSARAM HLGERYGIEP GRPANLVVLD ASDDYEALRR QAKALLSIRG GDVIMRRVPE RIDYPAAR
|
| |