Gene BMAA1103 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBMAA1103 
Symbol 
ID3086934 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia mallei ATCC 23344 
KingdomBacteria 
Replicon accessionNC_006349 
Strand
Start bp1149975 
End bp1151030 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content72% 
IMG OID637565004 
ProductLysR family transcriptional regulator 
Protein accessionYP_105766 
Protein GI53716728 
COG category[K] Transcription 
COG ID[COG0583] Transcriptional regulator 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.130627 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACCGAT TTCAGGAAAT GCAGGTTTTC GTGCGGATCG CCGAGCGGCA GAGCTTCAGC 
CGGGCGTCGG ACGATCTGCG GATTCCGCGC GCGACCGTGA CCAACCTGAT GAAGCGCATG
GAGGCGCGGC TCGGCGCGCG GCTGCTCGAA CGGACGACGC GCACCGTGTG CCTCACGCAG
GACGGCGAAG CCTACTACCG GCGCTGCGTG CGGCTGATCG CCGATCTGGA GGAGGCCGAG
GGCGCGTTTC GCGCCGCGGC GCCGCGGGGG CTGCTGCGCG TGAACCTGCA GGGCACGCTC
GCGCGCTATT TCGTCGTGCC CGCGCTGCCG GATTTTCTCG CGCGCTATCC GGGGATCCGG
CTGCACATCG GCGAGGACGA CCGCTTCGTC GATCTGGTGC GCGAGGGCGT CGATTGCGTG
CTGCGCTCGG GCAACCTGCA GGATTCGTCG ATGGTCGGGC GGCGGGTCGC GCAGCTCGAG
CAGGTGACGG TCGCGAGCCC CGGCTATCTC GCGCGGCACG GCGAGCCGGC CGAGCTCGCC
GCGCTGGCCG CGCATCGCGC GGTCGACTAC GTGTCGAGCG CGACGGGCAA GCCGATGCCG
CTCGAATTCA CCGTCGACGG GCGCGTGACC GAGGTGCGGC TCGACGCGGC GATTTCCGTC
GCGGGCGTCG AGCTCTACAC GGGCGCGGCC GTCGCGGGGC TCGGCATCGT GCAGGTGCCG
CGCTACCGGA TCGCCGACGA ACTGGCCGAC GGACGCCTGA GGATCGTGCT CGGCGCGTAT
CCGCCGCCGC CGATGCCCGT CAGCGTGCTG TATCCGCACA GCCGGCAGTT GTCGTCGCGC
GTGCGGGCGT TCGCGCAGTG GCTGCGGGAG CGGTTCGACG CGGCGCAGGC GGGGCGGGCG
ACGGCGCGTG CGGCGCGCCG GGCTCTTCGG GTTCCGCCGC GCGCGCGCCG TCGCCGGCTC
GGTCGGAACC GGCGAAAGCG TTGCTATCGC GGCGGCCGCG TCGCTAGATT CGTCTTCGTT
GTCGCGAACA CGCCCGATGC GGGGGGCACG CGATGA
 
Protein sequence
MDRFQEMQVF VRIAERQSFS RASDDLRIPR ATVTNLMKRM EARLGARLLE RTTRTVCLTQ 
DGEAYYRRCV RLIADLEEAE GAFRAAAPRG LLRVNLQGTL ARYFVVPALP DFLARYPGIR
LHIGEDDRFV DLVREGVDCV LRSGNLQDSS MVGRRVAQLE QVTVASPGYL ARHGEPAELA
ALAAHRAVDY VSSATGKPMP LEFTVDGRVT EVRLDAAISV AGVELYTGAA VAGLGIVQVP
RYRIADELAD GRLRIVLGAY PPPPMPVSVL YPHSRQLSSR VRAFAQWLRE RFDAAQAGRA
TARAARRALR VPPRARRRRL GRNRRKRCYR GGRVARFVFV VANTPDAGGT R