Gene BMA10229_0220 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBMA10229_0220 
Symbol 
ID4789866 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia mallei NCTC 10229 
KingdomBacteria 
Replicon accessionNC_008835 
Strand
Start bp205346 
End bp206644 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content68% 
IMG OID 
ProductAraC family transcriptional regulator 
Protein accessionYP_001024045 
Protein GI124382669 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.15358 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACGCCC GATATCCCGG CGATTCGAAT TATCGATTCG CACAGATTTT GTGCTTTAAC 
GCGGAAACGT ATTTTCGTGG TGCATCGCCC TGCACAAGAA AATATCGCGC ACGGCGATCC
CGATTCGCGG TGCGCGCGCC TCAGGCTCGC GCGAGCGAAC CGCGCGCGGC GGTGCCGGAC
GCCCATTTCG CATCGCCGCG GCGATTCCCA CTAGAATGGT GTCCGTCGGC GCACGGCGCG
CGAGCGGCGG CGACCGGCGC GGCCGCGCGG CACGGTATGA AACTTGCGTC TCGTGGATGG
CGCGCGCCCC GCGCACGCGT GCCGACGACA TCGAAACACC CGCCGGCGTG CGCGGCGCCC
AGCACGCGCA CGGAGCGGCC TCACCCGCCG CCGGCTGTTT CGTTTCATCG GAATCGGGGT
TCGACCGTGG CCAAGCTAGA CCATCGCAAC CAGTCGCGTT ACTGGCACTC TCCCGGCATT
TCAGGGGTCG ATCTGTTGCT CGCCGACTTC ACGACGCACG ACTACGCGCC GCACGTGCAC
GATTCGCTTG TCGTCGCCGT CACGGAAGTC GGCGGTTCGG TGTTCAAGAG CCGCGGGCAG
ACGCGCCTCG CCGAGCCGAA CGCCGTGCTC GTGTTCAATC CGTGCGAGCC GCATTCGGGG
CGCATGGGCG GCAGCAGCCG CTGGCGCTAC CGGTCGTTCT ACCTCGCGGA AGCGGGCCTT
TCCCGCGTGC TGACGTTGCT CGGCATGGCG CAGCCGCGCT TTTTCACGTC GAACGTGCTC
GACGATCCTC AGCTCGTCGA ACAGTTTCTC ACCCTGCACC GCGCGATGGA CGAGCAGGAC
GATCTGCTGC GGCAGCAGGA ACTGCTCGTC AGCAGCTTCG GCACGCTGTT TTCGCGGCAC
GGGCTCCAGG CCGGGCTCGG CGCCGGCCCC GGCTTCGGCA CGAAGGCGGG CCTGCCGGCG
CTCAAGCCCG CGCTCGATCT GATGAACGAT TGCTTCGACC ACGCGCTCAC CCTCGAGCAG
ATCGCGGCGG CGGCGGGCCT CACGTCGTTC CAGCTGATCA CCGCGTTCAA CCGCGTGATC
GGCCTCACAC CGCACGCGTA CCTGAACCAG TTGAGGTTGC GCGCGGCGCT GCGCGAGCTG
CAGGCCGGCC GCTCGCTCGC CGACGCCGCG CTGACATCGG GCTTCTACGA TCAAAGCGCG
CTTTGCAACC ACTTCAAGCG CACGTTCGGG ATGACGCCGA TGCAGTACAC GCGCGCGCTC
GCGCCCGGCA AGCGCCCGCT CGCGCCGATC GGAATCTGA
 
Protein sequence
MDARYPGDSN YRFAQILCFN AETYFRGASP CTRKYRARRS RFAVRAPQAR ASEPRAAVPD 
AHFASPRRFP LEWCPSAHGA RAAATGAAAR HGMKLASRGW RAPRARVPTT SKHPPACAAP
STRTERPHPP PAVSFHRNRG STVAKLDHRN QSRYWHSPGI SGVDLLLADF TTHDYAPHVH
DSLVVAVTEV GGSVFKSRGQ TRLAEPNAVL VFNPCEPHSG RMGGSSRWRY RSFYLAEAGL
SRVLTLLGMA QPRFFTSNVL DDPQLVEQFL TLHRAMDEQD DLLRQQELLV SSFGTLFSRH
GLQAGLGAGP GFGTKAGLPA LKPALDLMND CFDHALTLEQ IAAAAGLTSF QLITAFNRVI
GLTPHAYLNQ LRLRAALREL QAGRSLADAA LTSGFYDQSA LCNHFKRTFG MTPMQYTRAL
APGKRPLAPI GI