Gene BMAA1587 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBMAA1587 
Symbol 
ID3086928 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia mallei ATCC 23344 
KingdomBacteria 
Replicon accessionNC_006349 
Strand
Start bp1716570 
End bp1718834 
Gene Length2265 bp 
Protein Length754 aa 
Translation table11 
GC content76% 
IMG OID637565471 
Producthypothetical protein 
Protein accessionYP_106170 
Protein GI53716243 
COG category 
COG ID 
TIGRFAM ID[TIGR03369] cellulose biosynthesis protein BcsE 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.592502 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGACT CACGCGTGAA ATCCCCGGAC CCGGTGCCCG GCCGGCCCGC GAGCGGCGGG 
GCCGGTGCGC GCGCGCTCGC CCGCCTGCGC GCGTTGTGGC GCGTGTGCTC GCGCGCGGCG
CGGCCGCGCG AGCCCGCGCA TGCGGCGAAC CGGCTCGCGA TCGACGCGCT GCCCGACGAG
TGGGCCGAGC TCGCGCCGGG CAGCCTGTAT GCGGTGTACG CGGCGGCGTG CACGAGCGCG
TGCGACGCGC TGATCTGGGA CAGCGTGCGG GACGCGCGCA CGCGCGACGT CACGGTGGTG
CTCGCGCGCG AGCGCGCGGC GGTCGCGACG CGGCTGCGCG AGCTCGGCTT CGTCGACGGC
ATGCACGCGC GCGGCTGGCC GCGGCGGTTG AACGTGCTGG CGATGCCGCC GGGCGATATC
GCGGCGCGCG GCGCGGCGCG TGAGGGCGCA CCCGCGCCCG CGCCCGTGCC TGCGCCCGCG
TTCTCACGCC TCGTCGGCGG CCTGCGCGCG CTGAGGCGCT ACCGCTTCCG TTCGAACGCG
CTGTATTTCG TCGAAGGCGC GGAGCGCTGG TTCAGTTGGC ACGATCCGGT CGCGCTGACG
CACGAGGGGT GGGCGCTGGC CGGCTGGTGC CGTTCGCGTC GGATCGCGCT CGTGCTGCTG
ATCGATCCGC GGGCGTCGCA AGCGGCCGCG AGCCGCGCCG ATGCGCGGCA CACGGCCCCG
CTGCCCGACG CACCCGATGC ACCTGACGCA TTCGACGCGG GTGACGGCGT GCTCGGCGGC
GCGCGCGCGG AGCACGGCGC CGACGATCGC ACCACGCTCT TCGCCGCCGA TCGCGCGCGC
GCCGCGCGCG GCGGCTTTCA CGGCGCGTGC GCGGGCGTCG CGCAATTGCA GCGCACGCAC
GGCGAGCTGC GCTGGCGGGT CGATTTCTGG CGCTCGCGCG GCGCGGTCGC CACGGGCGAG
GTGCGCGCGC TGCGCTTCGT CGGCGACGGA CGGCTCGCGG CCGTGCCGGC GGCCGGCGCG
CACGCGGCGG GCGACGGCGC GCGGCTCGCG TTCGACGAGG CGCGCGTCGT CGTCAGCCGC
CGCGTGGTCG AGCGCGAATC GTGGGTGCCG GGCGATTGGG AAGTCGTCGA CGACAACGAC
GCGGTGCTCG CCGCGTGCGC CGGCGCGCAT GCGGCGAGCG CGGTGCTGGC GTTTACCGGC
CGCGCGCAGC TCGAAGCGCT GTGCGCGACG ATCCATGCGC TGCGCCTGCG GTGCGGCGGC
GCGCTGAAGA TCGTCGTCGT CGAGCGCGGC GAGGCGATGC GCCATCAATT CGAGCTGCTC
GCGCTGAACC TCGGCGCGAA CCAGGTCGTC GCGCGCAACC TGCCGTTCTC GCGCGTGCTC
GCGGTGCTGC GCTCGCTGCA GGGCCAGTTG CACGCGCGCC CGGTCGCGGC CGACTATCGG
GCCGCGCTCG CCGCGTCGCT CGGCGACACG GCGCTCGGCT ATCTGCCCGT CGGCGCGTTC
TGCTCGCAGG CGCGCGCGGT GCTCGAGCGC AGCGCGGTGC TCGCGCTGTC GCATACGCTC
GTGAAGCTGA CGCTGCTGCC CGGCGTCGCG CACGCGCACG CGTTGCGCGC GTGCACGCCG
CGCCGCGCGG GCGACGTGCT GACCGCCGAC GCGCAGCACC TGTATCTGTT CCTGTTCGCC
TGCGAGCTCG CCGATGCGAA CGACGTGCTC GGCCACCTCT TCGACGTGCC TGTCGAGCGG
ATCTCGGATC GCGTCGTGCA TCTCGCGCAG GACAGCATCG AGCATGAGCT GAATGCGCTC
GACGCGGCGA ACCGGCGTGC GCCGATCGCG GATTACAGCG ATCTCTTTTC GCCGGCGGCG
GTGGCGACGC GCGCGGCCGG CGCGCGCGCC TCGGCCGGCG CTCCGGCGGC GGCGCGCGAC
GGCGAACCGT CCGCCGAGCC TATGTCGCCG CATGTGCCGC CGCATGCGCC GCATGTGTCG
GGCGCGCCGA CGCCGCCGGG CACGCGCGCC GCGGCGCACG GACCACCGTG GTGCTCGGCG
TTCGCGCTGT CGTCCGCATC GCAGACCTCG CGGACATCGC CCGCCTCGGC GCAGATCGTC
GTACCGCCGC CGCAGGCGCC GTCGAACGTA TCGCACGCGC CGCTGTCCGC GACGCCGCGC
GCGCCGCGAC CGCGCCGACC GCACGACGCC GGCGCGGTCG CCGGCGTGCG CACCCGCACC
GCCACGCGCG ACGCGATGCC GTTGCGCCCC AGGGAGGCTG AATGA
 
Protein sequence
MNDSRVKSPD PVPGRPASGG AGARALARLR ALWRVCSRAA RPREPAHAAN RLAIDALPDE 
WAELAPGSLY AVYAAACTSA CDALIWDSVR DARTRDVTVV LARERAAVAT RLRELGFVDG
MHARGWPRRL NVLAMPPGDI AARGAAREGA PAPAPVPAPA FSRLVGGLRA LRRYRFRSNA
LYFVEGAERW FSWHDPVALT HEGWALAGWC RSRRIALVLL IDPRASQAAA SRADARHTAP
LPDAPDAPDA FDAGDGVLGG ARAEHGADDR TTLFAADRAR AARGGFHGAC AGVAQLQRTH
GELRWRVDFW RSRGAVATGE VRALRFVGDG RLAAVPAAGA HAAGDGARLA FDEARVVVSR
RVVERESWVP GDWEVVDDND AVLAACAGAH AASAVLAFTG RAQLEALCAT IHALRLRCGG
ALKIVVVERG EAMRHQFELL ALNLGANQVV ARNLPFSRVL AVLRSLQGQL HARPVAADYR
AALAASLGDT ALGYLPVGAF CSQARAVLER SAVLALSHTL VKLTLLPGVA HAHALRACTP
RRAGDVLTAD AQHLYLFLFA CELADANDVL GHLFDVPVER ISDRVVHLAQ DSIEHELNAL
DAANRRAPIA DYSDLFSPAA VATRAAGARA SAGAPAAARD GEPSAEPMSP HVPPHAPHVS
GAPTPPGTRA AAHGPPWCSA FALSSASQTS RTSPASAQIV VPPPQAPSNV SHAPLSATPR
APRPRRPHDA GAVAGVRTRT ATRDAMPLRP REAE