Gene BMAA0447 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBMAA0447 
Symbol 
ID3087153 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia mallei ATCC 23344 
KingdomBacteria 
Replicon accessionNC_006349 
Strand
Start bp448268 
End bp449899 
Gene Length1632 bp 
Protein Length543 aa 
Translation table11 
GC content68% 
IMG OID637564367 
Producthypothetical protein 
Protein accessionYP_105224 
Protein GI53716548 
COG category[S] Function unknown 
COG ID[COG1357] Uncharacterized low-complexity proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.414119 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGCTGGC CCGCGGACGT CGACCTGCGC TTCTTCCAGC AGGCCGCGCC CGACCAATGG 
GCACGCGGCG AATGCTGGAC GCCCGGCGCG CGTTTCGAGC TGAGCGGCTT CGGGCCGCGG
GGCGAGGGCT TCGCGGGCGA ACTGCCGCGT CTCGCGCCGG TCGCGCTCGT GACGCGCAAC
GGCCGCCCGG GTATCGAGCG GCTGTCGTTC AAGCAGCAGA CGGCGTGGTT CCTGCCCGAT
CGCGGCATCG GCGTGCTGTG GTGGAACGGC GCGGTCGCGC TCGATTTCCT GCTCGACGAC
AGCCCGACGA TGCTCGTCAC CGCATTCAAG GACGAAGCCG AGCGGATCGA CATCGACGCG
CTGATGAAGT TCGCCGATCA GCGTGCCGAC CTGAACTGCA CCGATCCGCT GCAGCAGGCG
GATCACGAAC TGATGCCCGC GATTACGAGG GGCTGGACCT GGGAGATGAT CCTCGACACG
GAAGACCACC CGCGTTTCGC TCCGGCGCCG CGCGGCTATG AAGAAGTCCG TGCGCGGGTC
GAGCAGAATC GCCGCGAGTT GGTCGAGGCG CGCGATGCGA GCGAGCGGCT GTCGGCGTTC
GAGGAAGCGA ACCGCAACGC GAAGCTGCCG GGCGCGCCGC GCGGCGGCGA GAACTGGCGC
ACGCGGCTGC GTCAGGCGAA GACGCCCGAG CTCGCGAACG TGACGATTCG CGACGCCGAT
CTGTCGTCGC TGCGCTTTGA CGGCTGGAAG TTCGACGACG TGCGCTTCGA GCGCTGCACG
CTCGATCGCA GCGAATGGAC GAACTGCCGG CTCAATCAGG TGCATGCGGT CGACTGCTCG
TTCGCCGACG TCAAGATGAG CGACGGCTGG TGGAAGGGCG GCAAGATCCA GCGCTGCAAT
CTCGAACGCA GCGCGTGGTT GAACGTCGAG ATCGAGCGGA TCTCGCTCGA CGAATGCCGG
CTCGACGATC TGAAGGTGGC GGGCGGATCG TGGTCGATGC TGTCGGTGCA GGGCCGCGGC
GGCGTGCGCG GCGACGTTCA GGATGTCCAA TGGAATTCGG TGTCGTGGTC CGAGGTGAGC
GCGCCCGGCT GGACCTGGAC CCGCGTGCGC GCCGACGATC TCGCGATCGT CGAATGCGCA
ATGGCGGGCC TCGCGGTATC GCAGTGCACG CTCGCGAAGC CGAGCATCCT GCTCACCGAC
CTGTCCGCGA GCGTCTGGCA GCGCAGCATG CTGACGTTCG CGGTGCTGTC GCACGGCACG
TCGATCAACG GCGCGCGGCT CACCGATTGC GTGTTCAAGT CGTCGAGCCT GCAGGAGCTG
CGTGCGGATC GGGTTCAGGT CGATCACTGC TCGTTCATGC AATTGAACGC GCAGCATCTG
CACGCGCAGC AGTCGCATTG GAGCCGCACG GTGCTCGACG GCGCGAACGT GATGCATGCG
CAACTGACGG GCACGTCGTT CGACCGCTGC TCGCTGAAGG AGGCGATGTT CTATGGCGCC
GACATGCGGC AGACGCGCAT GCGCGACTGC AATCTCGTCA GGGTCCGCAC GTCGTGGATC
CATCCGCCGG AAGCGGGCGC GTGGCGCGGC AATCTGAGCG CCGGCCAGCT CGACGTGCCG
AGGAGGGTGT GA
 
Protein sequence
MGWPADVDLR FFQQAAPDQW ARGECWTPGA RFELSGFGPR GEGFAGELPR LAPVALVTRN 
GRPGIERLSF KQQTAWFLPD RGIGVLWWNG AVALDFLLDD SPTMLVTAFK DEAERIDIDA
LMKFADQRAD LNCTDPLQQA DHELMPAITR GWTWEMILDT EDHPRFAPAP RGYEEVRARV
EQNRRELVEA RDASERLSAF EEANRNAKLP GAPRGGENWR TRLRQAKTPE LANVTIRDAD
LSSLRFDGWK FDDVRFERCT LDRSEWTNCR LNQVHAVDCS FADVKMSDGW WKGGKIQRCN
LERSAWLNVE IERISLDECR LDDLKVAGGS WSMLSVQGRG GVRGDVQDVQ WNSVSWSEVS
APGWTWTRVR ADDLAIVECA MAGLAVSQCT LAKPSILLTD LSASVWQRSM LTFAVLSHGT
SINGARLTDC VFKSSSLQEL RADRVQVDHC SFMQLNAQHL HAQQSHWSRT VLDGANVMHA
QLTGTSFDRC SLKEAMFYGA DMRQTRMRDC NLVRVRTSWI HPPEAGAWRG NLSAGQLDVP
RRV