Gene BTH_II1639 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBTH_II1639 
Symbol 
ID3846130 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia thailandensis E264 
KingdomBacteria 
Replicon accessionNC_007650 
Strand
Start bp1922799 
End bp1924055 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content67% 
IMG OID637838940 
Productcytosine deaminase 
Protein accessionYP_439833 
Protein GI83717088 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.233761 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGCTCA TCAACGCGAC GCTGCGCAAG CGCAGCGGCC TTTTCAGCAT CGCGCTCGAC 
GGCGCGACGA TCGCGAGCGT CACGCCGCAG CCGGCGCGCA TCGATGCGCA AGGCGCGCCG
CGCGCGGATG AAATCGATGT CGGCGGCAAG CTCGTGATTC CGCCGCTCGT CGAGCCGCAC
ATCCACCTGG ATGCGGTGCT GACGGCGGGC GAGCCCGAGT GGAACATGAG CGGCACGCTG
TTCGAGGGAA TCGAGCGCTG GGCGCAGCGC AAGGCGACGA TCACGCACGA GGACACGAAG
GCGCGCGCGC ATGCGGCGAT CGGGATGCTG CGCGATCACG GCATTCAGCA CGTGCGCACC
CACGTCGACG TGACCGATCC TTCGCTCGCG GCGCTGCAAG CGATGCTCGA AGTGAAGGAC
GAGGCGCGCG GGCTGATCGA TCTGCAGATC GTCGCGTTTC CGCAGGAAGG GATCGAATCG
TTCGACGGCG GCCGCGCGCT GATGGCGCGC GCGATCGCGA TGGGCGCGGA CGTCGTCGGC
GGCATTCCGC ACTTCGAGAA CACGCGCGAG CAGGGCGTGA GCTCGATCGA GTTCCTGATG
GATCTCGCCG ATCGCAGCGG CTGCCTCGTC GACGTGCATT GCGACGAAAC CGACGATCCG
AACTCGCGTT TTCTCGAGGT GCTCGCCGAG CAAGCGCGCG TGCGCGGCGT CGGCGCGCGC
GTGACGGCGA GCCACACGAC CGCGATGGGC TCGTACGACA ATGCGTACTG CTCGAAGCTG
TTCCGCTTGC TGAAGCGCTC GGAGATCAAC TTCATCTCGT GTCCGACCGA GAGCATCCAT
CTGCAAGGCC GCTTCGACAC GTTTCCGAAG CGCCGCGGGC TCACGCGCGT CGCCGAGCTC
GATCGAGCCG GGATGAACGT GTGCTTCGGC CAGGATTCGA TTCGGGACCC GTGGTATCCG
CTCGGCAACG GCAACATCCT GCGCGTGCTC GACGCGGGGC TGCACATTTG CCACATGATG
GGCTATCAGG ATCTCGCACG CGCTCTTGAT TTCGTCACCG ACCATAGCGC GCGCGCGATG
CATCTCGGCG AGCGCTACGG AATCGAGCCG GGGCGCCCCG CGAATCTCGT CGTGCTCGAC
GCATCCGACG ATTACGAGGC GTTGCGCCGG CAGGCGAAGG CGCTGCTGTC GATTCGGGGC
GGCGACGTGA TCATGCGCCG CGTGCCCGAG CGCATCGACT ACCCGGCCGC GCGCTGA
 
Protein sequence
MKLINATLRK RSGLFSIALD GATIASVTPQ PARIDAQGAP RADEIDVGGK LVIPPLVEPH 
IHLDAVLTAG EPEWNMSGTL FEGIERWAQR KATITHEDTK ARAHAAIGML RDHGIQHVRT
HVDVTDPSLA ALQAMLEVKD EARGLIDLQI VAFPQEGIES FDGGRALMAR AIAMGADVVG
GIPHFENTRE QGVSSIEFLM DLADRSGCLV DVHCDETDDP NSRFLEVLAE QARVRGVGAR
VTASHTTAMG SYDNAYCSKL FRLLKRSEIN FISCPTESIH LQGRFDTFPK RRGLTRVAEL
DRAGMNVCFG QDSIRDPWYP LGNGNILRVL DAGLHICHMM GYQDLARALD FVTDHSARAM
HLGERYGIEP GRPANLVVLD ASDDYEALRR QAKALLSIRG GDVIMRRVPE RIDYPAAR