Gene BURPS1106A_1042 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_1042 
SymbolcobD 
ID4902263 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp1018107 
End bp1019159 
Gene Length1053 bp 
Protein Length350 aa 
Translation table11 
GC content76% 
IMG OID640134272 
Productputative threonine-phosphate decarboxylase 
Protein accessionYP_001065322 
Protein GI126454618 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase 
TIGRFAM ID[TIGR01140] L-threonine-O-3-phosphate decarboxylase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0057725 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTGACG CGCCGATCAC GCACGGCGGC AACCTGCACG AAGCCGCCCT TCGCTACGGC 
ATCCCGCGCG ACGCGTGGCT CGATCTGTCG ACGGGCATCA ATCCGCACGG TTTTCCGGTG
CCGCCCGTGC CCGCCGACGC GTGGCGCCGG CTGCCCGAGG ACGACGGCGT GCTCGCCGCG
CACGCGGCGC GCTACTACCG CGCGCCGGGC GCGGCGCACG TGCTGCCCGT CGCGGGCAGC
CAGGCGGCGA TCCGTGCGCT GCCGGCGCTT TTCGCGCGCG GCACGGTCGG CGTCGCGCCG
CTCGCATACA GCGAGTACGC GCCCGCGTTC GCGCGCCACG GCCATTCGGG CGCGCCGCTC
GACTGCGGCG CCGACACGCT GCCCGCCGCG CTCACGTACG CGATCGTCGC CAATCCGAAC
AATCCGACCG CCGAACGCAT CGATCGCACG CGGCTGCTGC GCTGGCACGC GCAACTCGTC
GCGCGCGGCG GCGCGCTGAT CGTCGACGAG GCGTTCGCGG ACGCCGAGAG CGCCGCGCAC
GCGTCGCTCG CCGCGGACAC GCATCGCGAC GGCCTCGTCG TATTGCGCTC GGTCGGCAAG
TTCTTCGGCC TCGCGGGCGT GCGCGCGGGC TTCGCGCTCG CCGCGCCCGC GCTGCTCGCG
CGGCTGCGCG ACGCGCTCGG CGCGTGGACC GTCAGCGGCC CGGCGCGCCA CGCGGTGCTC
GCCGCGTTCG CGGACGCGGC GTGGCAGCAC GCGATGCGCG AGCGGCTCGC GCACGACGGC
GCGCGCCTTG CCGCGCTGCT GCGCGCGCAC GGCTTCGTCA CGCACGCGAC GCCGCTTTTC
AGCTGGAGCG CCGATCCGCG CGCGCACGCG CTGCACGACG CGCTCGCGGC GCGCGGAATC
TGGACGCGCT ACTTCGCGCA CGCGCCGAGC GTGCGCATCG GGCTGCCCGC CGGCGACGAC
GACTGGCGGC GGCTCGAACG CACGCTCGCC GAGTGCGTGC CGACGCTAGC GGCCGCAGCC
GCGCATCCTT CCGAATCGAC CACACGGGAT TGA
 
Protein sequence
MADAPITHGG NLHEAALRYG IPRDAWLDLS TGINPHGFPV PPVPADAWRR LPEDDGVLAA 
HAARYYRAPG AAHVLPVAGS QAAIRALPAL FARGTVGVAP LAYSEYAPAF ARHGHSGAPL
DCGADTLPAA LTYAIVANPN NPTAERIDRT RLLRWHAQLV ARGGALIVDE AFADAESAAH
ASLAADTHRD GLVVLRSVGK FFGLAGVRAG FALAAPALLA RLRDALGAWT VSGPARHAVL
AAFADAAWQH AMRERLAHDG ARLAALLRAH GFVTHATPLF SWSADPRAHA LHDALAARGI
WTRYFAHAPS VRIGLPAGDD DWRRLERTLA ECVPTLAAAA AHPSESTTRD