Gene BMASAVP1_0424 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBMASAVP1_0424 
Symbol 
ID4678170 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia mallei SAVP1 
KingdomBacteria 
Replicon accessionNC_008784 
Strand
Start bp434886 
End bp436184 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content68% 
IMG OID639842951 
ProductAraC family transcriptional regulator 
Protein accessionYP_990034 
Protein GI121597158 
COG category[K] Transcription 
COG ID[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACGCCC GATATCCCGG CGATTCGAAT TATCGATTCG CACAGATTTT GTGCTTTAAC 
GCGGAAACGT ATTTTCGTGG TGCATCGCCC TGCACAAGAA AATATCGCGC ACGGCGATCC
CGATTCGCGG TGCGCGCGCC TCAGGCTCGC GCGAGCGAAC CGCGCGCGGC GGTGCCGGAC
GCCCATTTCG CATCGCCGCG GCGATTCCCA CTAGAATGGT GTCCGTCGGC GCACGGCGCG
CGAGCGGCGG CGACCGGCGC GGCCGCGCGG CACGGTATGA AACTTGCGTC TCGTGGATGG
CGCGCGCCCC GCGCACGCGT GCCGACGACA TCGAAACACC CGCCGGCGTG CGCGGCGCCC
AGCACGCGCA CGGAGCGGCC TCACCCGCCG CCGGCTGTTT CGTTTCATCG GAATCGGGGT
TCGACCGTGG CCAAGCTAGA CCATCGCAAC CAGTCGCGTT ACTGGCACTC TCCCGGCATT
TCAGGGGTCG ATCTGTTGCT CGCCGACTTC ACGACGCACG ACTACGCGCC GCACGTGCAC
GATTCGCTTG TCGTCGCCGT CACGGAAGTC GGCGGTTCGG TGTTCAAGAG CCGCGGGCAG
ACGCGCCTCG CCGAGCCGAA CGCCGTGCTC GTGTTCAATC CGTGCGAGCC GCATTCGGGG
CGCATGGGCG GCAGCAGCCG CTGGCGCTAC CGGTCGTTCT ACCTCGCGGA AGCGGGCCTT
TCCCGCGTGC TGACGTTGCT CGGCATGGCG CAGCCGCGCT TTTTCACGTC GAACGTGCTC
GACGATCCTC AGCTCGTCGA ACAGTTTCTC ACCCTGCACC GCGCGATGGA CGAGCAGGAC
GATCTGCTGC GGCAGCAGGA ACTGCTCGTC AGCAGCTTCG GCACGCTGTT TTCGCGGCAC
GGGCTCCAGG CCGGGCTCGG CGCCGGCCCC GGCTTCGGCA CGAAGGCGGG CCTGCCGGCG
CTCAAGCCCG CGCTCGATCT GATGAACGAT TGCTTCGACC ACGCGCTCAC CCTCGAGCAG
ATCGCGGCGG CGGCGGGCCT CACGTCGTTC CAGCTGATCA CCGCGTTCAA CCGCGTGATC
GGCCTCACAC CGCACGCGTA CCTGAACCAG TTGAGGTTGC GCGCGGCGCT GCGCGAGCTG
CAGGCCGGCC GCTCGCTCGC CGACGCCGCG CTGACATCGG GCTTCTACGA TCAAAGCGCG
CTTTGCAACC ACTTCAAGCG CACGTTCGGG ATGACGCCGA TGCAGTACAC GCGCGCGCTC
GCGCCCGGCA AGCGCGCGCT CGCGCCGATC GGAATCTGA
 
Protein sequence
MDARYPGDSN YRFAQILCFN AETYFRGASP CTRKYRARRS RFAVRAPQAR ASEPRAAVPD 
AHFASPRRFP LEWCPSAHGA RAAATGAAAR HGMKLASRGW RAPRARVPTT SKHPPACAAP
STRTERPHPP PAVSFHRNRG STVAKLDHRN QSRYWHSPGI SGVDLLLADF TTHDYAPHVH
DSLVVAVTEV GGSVFKSRGQ TRLAEPNAVL VFNPCEPHSG RMGGSSRWRY RSFYLAEAGL
SRVLTLLGMA QPRFFTSNVL DDPQLVEQFL TLHRAMDEQD DLLRQQELLV SSFGTLFSRH
GLQAGLGAGP GFGTKAGLPA LKPALDLMND CFDHALTLEQ IAAAAGLTSF QLITAFNRVI
GLTPHAYLNQ LRLRAALREL QAGRSLADAA LTSGFYDQSA LCNHFKRTFG MTPMQYTRAL
APGKRALAPI GI