Gene BMAA1914 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBMAA1914 
Symbol 
ID3086588 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia mallei ATCC 23344 
KingdomBacteria 
Replicon accessionNC_006349 
Strand
Start bp2098163 
End bp2099443 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content71% 
IMG OID637565784 
Producthypothetical protein 
Protein accessionYP_106445 
Protein GI53715977 
COG category[N] Cell motility
[S] Function unknown 
COG ID[COG1360] Flagellar motor protein
[COG3455] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR03349] type IV / VI secretion system protein, DotU family
[TIGR03350] type VI secretion system OmpA/MotB family protein 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACCTCAG AAACCGTGCA TCATCCGGAC ATATCGGCTC CGGCCCCGAC CTTCGATTCC 
GTCGCGGCGA CGCTCGCGCG GCGCGAGCCG GCGCCTGCGC CGGCCGGCGA GCCGCCCGCC
GCGCGCCTCG CCGCGATCAG GCTCGCCCGC AACCCGCTGC TCGAAGCCGC GCGCGTGCTG
CTGCGGGCGC TCGCCGACAT GCCCGAGCGG CTCGATCGCG ACGACATTCC GCAATTGCGA
CTGCTGCTGG AACAGGAGGT GCGCCTGTTC CAGCGGCTCT GCGAACAGGC GAACATCCGG
CGCGACCACA TGCTCGGCGC GCGCTACTGC CTGTGCACCG CGCTCGACGA GGCGGCGATG
CAGACGTCGT GGGCACAATC GGCGAGCGGC AATCTCGGCA CGTGGATCAG CGAGGGGCTC
GCGACGTCGT TCCACGAGGA TCGCCAGGGC GGAGACAAGG TCTATCTGCT GATCGGCCGG
CTGATGAATT CGCCGCACGA GCACATCGAC CTGCTCGAAG TCATCTATCG AATCTTGAGC
CTCGGCTTCG AGGGCCGCTA CCGTTACGAA GCCGACGGCC AGCGCAAGCA CGAGACCGTG
CGCCAGCGGC TCTACAACGA GATCGCATCG CAGCGCGGGC CGGTGTCGGT CGCGCTGTCG
CCGCACTGGC AGCCCGGCCC CCGCAACAGG AGCGCGCCGT TTCGCGATTT CCCCGCGTGG
GTCACGGCCG CCGTGCTGTC GCTGATCGCG CTCGGGCTGT TCGGCTGCTT CAAGTACGCG
CTGTCGACGC GCAGCGCCGA CGTGCAGCAG CGGATCGCCG CGATCGCGCG GATGGCGCCG
CCCGCCGCGC CGGCCGAGCT GCGCCTCGCG ACGCTGCTCG CCGGCGAGAT CGCGGCAGGC
ACGCTCAGCG TCGAGGAAAA CGCGCGCCGC AGCTCGGTGA CGTTCCGCGG CGACGCGATG
TTCGCGCCGG GCGCGGCCGG CGTGAACCCG GCGATGGGGC CGCTCATCCG GAAAATCGCG
GCCGAGATCG CGAGGGTGCC GGGCAAGGTG ACGGTGCGCG GCTACACCGA CAATCAGCCG
ATCAAAAGCC GCCAGTTCGC GTCGAACGAG GCGCTATCCG AAGAGCGCGC GACGCAGGTC
ATGCAGATGC TCCAGAGCGC GGGCGTGCCC GCGAGCCGCC TCGAGGCGCT CGGCAAGGGC
GGCGCCGAGC CGATCGGCGA CAACCGGACC CCGCAGGGCC GCGCGCTGAA CCGCCGCGTC
GAAATCACGG TCGCGCGCTG A
 
Protein sequence
MTSETVHHPD ISAPAPTFDS VAATLARREP APAPAGEPPA ARLAAIRLAR NPLLEAARVL 
LRALADMPER LDRDDIPQLR LLLEQEVRLF QRLCEQANIR RDHMLGARYC LCTALDEAAM
QTSWAQSASG NLGTWISEGL ATSFHEDRQG GDKVYLLIGR LMNSPHEHID LLEVIYRILS
LGFEGRYRYE ADGQRKHETV RQRLYNEIAS QRGPVSVALS PHWQPGPRNR SAPFRDFPAW
VTAAVLSLIA LGLFGCFKYA LSTRSADVQQ RIAAIARMAP PAAPAELRLA TLLAGEIAAG
TLSVEENARR SSVTFRGDAM FAPGAAGVNP AMGPLIRKIA AEIARVPGKV TVRGYTDNQP
IKSRQFASNE ALSEERATQV MQMLQSAGVP ASRLEALGKG GAEPIGDNRT PQGRALNRRV
EITVAR