Gene BURPS668_A2229 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A2229 
Symbol 
ID4887849 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp2156964 
End bp2159228 
Gene Length2265 bp 
Protein Length754 aa 
Translation table11 
GC content77% 
IMG OID640132166 
Producthypothetical protein 
Protein accessionYP_001063223 
Protein GI126445299 
COG category 
COG ID 
TIGRFAM ID[TIGR03369] cellulose biosynthesis protein BcsE 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.281505 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGACT CACGCGTGAA ATCCCCGGAC CCGGTGCCCG GCCGGCCCGC GAGCGGCGGG 
GCCGGTGCGC GCGCGCTCGC CCGCCTGCGC GCGTTGTGGC GCGTGTGCTC GCGCGCGGCG
CGGCCGCGCG AGCCCGCGCA TGCGGCGAAC CGGCTCGCGA TCGACGCGCT GCCCGACGAG
TGGGCCGAGC TCGCGCCGGG CGGCCTGTAT GCGGTGTACG CGGCGGCGCG CACGAGCGCG
TGCGACGCGC TGATCTGGGA CAGCGTGCGG GACGCGCGCA CGCGCGACGT CACGGTGGTG
CTCGCGCGCG AGCGCGCGGC GGTCGCGACG CGGCTGCGCG AGCTCGGCTT CGTCGACGGC
ATGCACGCGC GCGGCTGGCC GCGGCGGTTG AACGTGCTGG CGATGCCGCC GGGCGATATC
GCGGTGCGCG GCGCGGCGCG TGAGGGCGCA CCCGCGTCCG CGCCCGCGCC CGCGCCCGCG
TTCTCACGCC TCGTCGGCGG CCTGCGCGCG CTGAGGCGCT ACCGCTTCCG TTCGAACGCG
CTGTATTTCG TCGAAGGCGC GGAGCGCTGG TTCAGTTGGC ACGATCCGGT CGCGCTGACG
CACGAGGGGT GGGCGCTGGC CGGCTGGTGC CGTTCGCATC GGATCGCGCT CGTGCTGCTG
ATCGATCCGC GGGCGTCGCA AGCGGCCGCG AGCCGCGCCG ATGCGCGGCA CACGGCCCCG
CTGCCCGACG CACCCGATGC ACCTGACGCA TCCGACGCGG GTGACGGCGT GCTCGGCGGC
GCGCGCGCGG AGCACGGCGC CGACGATCGC ACCACGCTCT TCGCCGCCGA TCGCGCGCGC
GCCGCGCGCG GCGGCTTTCA CGGCGCGTGC GCGGGCGTCG CGCAATTGCA GCGCACGCAC
GGCGAGCTGC GCTGGCGGGT CGATTTCTGG CGCTCGCGCG GCGCGGTCGC CACGGGCGAG
GTGCGCGCGC TGCGCTTCAT CGGCGACGGA CGGCTCGCGG CCGTGCCGGC GGCCGGCGCG
CACGCGGCGG GCGGCGGCGC GCGGCTCGCG TTCGACGAGG CGCGCGTCGT CGTCAGCCGC
CGCGTGGTCG AGCGCGAATC GTGGGTGCCG GGCGATTGGG AAGTCGTCGA CGACAACGAC
GCGGTGCTCG CCGCGTGCGC CGGCGCGCAT GCGGCGAGCG CGGTGCTGGC GTTTACCGGC
CGCGCGCAGC TCGAAGCGCT GTGCGCGACG ATCCATGCGC TGCGCCTGCG GTGCGGCGGC
GCGCTGAAGA TCGTCGTCGT CGAGCGCGGC GAGGCGATGC GCCATCAATT CGAGCTGCTC
GCGCTGAACC TCGGCGCGAA CCAGGTCGTC GCGCGCAACC TGCCGTTCTC GCGCGTGCTC
GCGGTGCTGC GCTCGCTGCA GGGCCAGTTG CACGCGCGCC CGGTCGCGGC CGACTATCGG
GCCGCGCTCG CCGCGTCGCT CGGCGACACG GCGCTCGGCT ATCTGCCCGT CGGCGCGTTC
TGCTCGCAGG CGCGCGCGGT GCTCGAGCGC AGCGCGGTGC TCGCGCTGTC GCATACGCTC
GTGAAGCTGA CGCTGCTGCC CGGCGTCGCG CACGCGCACG CGTTGCGCGC GTGCACGCCG
CGCCGCGCGG GCGACGTGCT GACCGCCGAC GCGCAGCATC TGTATCTGTT CCTGTTCGCC
TGCGAGCTCG CCGATGCGAA CGACGTGCTC GGCCACCTCT TCGACGTGCC CGTCGAGCGG
ATCTCGGATC GCGTCGTGCA TCTCGCGCAG GACAGCATCG AGCATGAGCT GAATGCGCTC
GACGCGGCGA ACCGGCGCGC GCCGATCGCG GACTACAGCG ATCTCTTTTC GCCGGCGGCG
GTGGCGACGC GCGCGGCCGG CGCGCGCGCC TCGGCCGGCG CTCCGGCGGC GGCGCGCGAC
GGCGAACCGT CCGCCGAGCC TGTGTCGCCG CATGTGCCGC CGCATGCGCC GCATGTGTCG
GGCGCGCCGA CGCCGCCGGG CACGCGCGCC GCGGCGCACG GACCACCGTG GCGCCCGGCG
TTCGCGCTGT CGTCCGCGTC TCAGACCTCG CGGACATCGC CCGCCTCGGC GCAGATCGTC
GTACCGCCGC CGCAGGCGCC GTCGAACGTA TCGCACGCGC CGCTGTCCGC GACGCCGCGC
GCGCCGCGAC CGCGCCGGCC GCACGACGCC GGCGCGGTCG CCGGCGTGCG CACCCGCACC
GCCACGCGCG ACGCGATGCC GTTGCGCCCC AGGGAGGCTG AATGA
 
Protein sequence
MNDSRVKSPD PVPGRPASGG AGARALARLR ALWRVCSRAA RPREPAHAAN RLAIDALPDE 
WAELAPGGLY AVYAAARTSA CDALIWDSVR DARTRDVTVV LARERAAVAT RLRELGFVDG
MHARGWPRRL NVLAMPPGDI AVRGAAREGA PASAPAPAPA FSRLVGGLRA LRRYRFRSNA
LYFVEGAERW FSWHDPVALT HEGWALAGWC RSHRIALVLL IDPRASQAAA SRADARHTAP
LPDAPDAPDA SDAGDGVLGG ARAEHGADDR TTLFAADRAR AARGGFHGAC AGVAQLQRTH
GELRWRVDFW RSRGAVATGE VRALRFIGDG RLAAVPAAGA HAAGGGARLA FDEARVVVSR
RVVERESWVP GDWEVVDDND AVLAACAGAH AASAVLAFTG RAQLEALCAT IHALRLRCGG
ALKIVVVERG EAMRHQFELL ALNLGANQVV ARNLPFSRVL AVLRSLQGQL HARPVAADYR
AALAASLGDT ALGYLPVGAF CSQARAVLER SAVLALSHTL VKLTLLPGVA HAHALRACTP
RRAGDVLTAD AQHLYLFLFA CELADANDVL GHLFDVPVER ISDRVVHLAQ DSIEHELNAL
DAANRRAPIA DYSDLFSPAA VATRAAGARA SAGAPAAARD GEPSAEPVSP HVPPHAPHVS
GAPTPPGTRA AAHGPPWRPA FALSSASQTS RTSPASAQIV VPPPQAPSNV SHAPLSATPR
APRPRRPHDA GAVAGVRTRT ATRDAMPLRP REAE