Gene BURPS668_A1120 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A1120 
SymbolcodA 
ID4888186 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp1075725 
End bp1076981 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content68% 
IMG OID640131060 
Productcytosine deaminase 
Protein accessionYP_001062119 
Protein GI126444284 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones39 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGCTCA TCAACGCGAC GCTGCGCAAG CGCAGCGGCC TTTTCAGCAT CGTGCTCGAC 
GGCGGCGCGA TCGCGAGCGT CACGCCGCAG CCGGCGCGTG TCGACGCGCC GCACGCGCCG
CGGGCGGACG TGATCGACGC CGACGGCAAG CTCGTGATCC CGCCGCTCGT CGAGCCGCAC
ATCCATCTCG ACGCGGTGCT GACGGCGGGC GAGCCCGAAT GGAACATGAG CGGCACGCTG
TTCGAGGGGA TCGAGCGCTG GGCGCAGCGC AAGGCGACGA TCACGCACGA GGACACGAAG
GCCCGCGCGC ATGCGGCGAT CGGGATGCTG CGCGATCACG GCATTCAGCA CGTGCGCACG
CACGTCGACG TGACCGACCC TTCGCTCGCG GCGCTGAAGG CGATGCTCGA GGTGAAGGAC
GAGGCGCGCG GGCTGATCGA TCTGCAGATC GTCGCGTTCC CGCAGGAAGG CATCGAATCG
TTCGACGGCG GCCGCGCGCT GATGGAGCGG GCAATCGAGG TGGGCGCGGA CGTCGTCGGC
GGCATTCCGC ACTTCGAGAA CACGCGCGAG CAGGGCGTGA GCTCGATCCG GTTCCTGATG
GATCTCGCCG AGCGCAGCGG CTGCCTGGTC GATGTGCACT GCGACGAGAC CGACGATCCG
AACTCGCGCT TTCTCGAGGT GCTCGCCGAG GAAGCGCGCG TGCGCGGGAT CGGCGCGCGC
GTGACGGCGA GCCATACGAC GGCGATGGGC TCGTACGACA ATGCGTACTG CTCGAAGCTG
TTCCGCTTGC TGAAGCGCTC GCAGATCAAT TTCATCTCGT GCCCGACCGA AAGCATCCAT
CTGCAAGGCC GCTTCGATAC GTTCCCGAAG CGCCGCGGTC TCACGCGCGT CGCCGAGCTC
GATCGCGCCG GCATGAACGT GTGCTTCGGC CAGGATTCGA TTCGGGACCC CTGGTATCCG
CTCGGCAACG GCAACATCCT GCGTGCGCTC GATGCGGGGC TGCACATCTG CCACATGATG
GGCTATCAGG ATCTCGCGCG CAGCCTCGAT TTCGTCACCG AGCACAGCGC GCGCGCGATG
CATCTCGGCG AGCGCTACGG CATCGAGCCG GGGCGGCCGG CGAATCTCGT CGTGCTCGAC
GCATCCGACG ATTACGAGGC GCTGCGGCGG CAGGCGAAGG CGCTGCTGTC GATTCGCGGC
GGCGAAGTGA TCATGCGCCG CGTGCCCGAG CGCATCGCGT ACCCGGCCGC GCGCTGA
 
Protein sequence
MKLINATLRK RSGLFSIVLD GGAIASVTPQ PARVDAPHAP RADVIDADGK LVIPPLVEPH 
IHLDAVLTAG EPEWNMSGTL FEGIERWAQR KATITHEDTK ARAHAAIGML RDHGIQHVRT
HVDVTDPSLA ALKAMLEVKD EARGLIDLQI VAFPQEGIES FDGGRALMER AIEVGADVVG
GIPHFENTRE QGVSSIRFLM DLAERSGCLV DVHCDETDDP NSRFLEVLAE EARVRGIGAR
VTASHTTAMG SYDNAYCSKL FRLLKRSQIN FISCPTESIH LQGRFDTFPK RRGLTRVAEL
DRAGMNVCFG QDSIRDPWYP LGNGNILRAL DAGLHICHMM GYQDLARSLD FVTEHSARAM
HLGERYGIEP GRPANLVVLD ASDDYEALRR QAKALLSIRG GEVIMRRVPE RIAYPAAR