Gene BMASAVP1_A1004 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBMASAVP1_A1004 
Symbol 
ID4679806 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia mallei SAVP1 
KingdomBacteria 
Replicon accessionNC_008785 
Strand
Start bp985207 
End bp986661 
Gene Length1455 bp 
Protein Length484 aa 
Translation table11 
GC content70% 
IMG OID639845278 
ProductGntR family transcriptional regulator 
Protein accessionYP_992344 
Protein GI121599547 
COG category[E] Amino acid transport and metabolism
[K] Transcription 
COG ID[COG1167] Transcriptional regulators containing a DNA-binding HTH domain and an aminotransferase domain (MocR family) and their eukaryotic orthologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.315502 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCACCG TCCCGCTTGC GCAGATCCCC GCGCCGCACG ATACCGCGAC GCTCACGCTC 
GTCGATCAGC TCGTGCAATG GGCGCGCCGC CGGATCGACG AGCGCGTGTT CCGGCCCGGC
ATGCGGATGC CGTCGATCCG CAAGCTCGCG CTCGACAAGA GCGTGTCGCG CTTCACGGTC
GTCGAGGCGT ACGAGCGGCT CGTCGCGCAG GGCTATCTCG ATTCGCGGCG CGGCTCCGGC
TTCTACGTGC GCGAGCGCGC GCCCGGGCAG CAGCCGGTGG GCGCATCGGG CGGCGCGCGC
GCGCAGCCCG TGCACAACAC GATCGACGTC GTCTGGCTGC TGCGCAACAT GCTGCACACG
GTCAGCCCGG AAAAGGGGCC GGGGCTCGGC TATCTGCCGA GCCGCTGGCT CGACGGCGAA
CTGATCACGA GCGCGTTGCG CGCGCTCGGC CGGCAATCGG GCGCGCAGAT GCTCGGCTTC
GGCAGCGCGC AGGGCTTCCT GCCGCTGCGG CAGCAACTGC AGACGCGCCT CGCCGAATTC
GAGATCGGCG CGACGCCCGA TCAGCTCGTG CTCGTGTCCG GCATCACGCA GGCGATCGAT
CTGATCGCGC GCCACTGCGT GCGCCCGGGC GACGCGGTGA TCGTCGGCGA TCCGGCCTGG
TTCCAGATGT TCGGGCGCTT CGCGTCGCAG GGCGCGCAGC TCGTCGGGAT GCCGTACACG
CCGGACGGCC CCGATCTCGA CGCGCTCGAG AACCTCGTGC AGATGTGGCG CCCGAAGATG
CTCGTGATCA ACTCGGTGCT GCACAATCCG ACGGGCACGT CGCTGTCGGC CGCGCAGGCG
TTCCGGATCC TGAAGCTCGC GGAGGCGTAC GATTTCCTCG TCGTCGAGGA CGACGTCTAC
GGCGACCTGT GCCCGCCGAG CTATCCGGCG ACGCGCCTGG CGAGCCTCGA CCAGTTAAGG
CGCGTGATCT TCCTCGGCAG CTTCTCGAAG ACGCTCGCCG CGAACCTGCG GGTCGGCTAC
ATCGCGTGCG CGCCGGAACT CGCGAAGGCG CTGACGGATC AGAAAATGCT CGTCGGGATG
ACGACGCCCG AGCTCAACGA GCGCGTGCTG TACAAGGTGC TCACGGAAGG GCACTACCGG
CGCCACGTCG AGCGGTTGCG CGCGCGGCTC GACGGCGTGC GCGACAAGAC CGCGCGGATG
CTCGAGCGCA CCGGCATGCG GCTCTTCACG ATGCCGGCGG CGGGGATGTT CCTGTGGGCC
GACACGGGCG TCGATTCGGA CGCGCTCGCC GCGGCCGCGC ACGAGGAAGG TTTCCTGCTC
ACGCCGGGGA GCCTCTTCTC GCCGCAGCAG TCGCCTTCGA CGTGGACGCG CTTTAACGTC
GCGAACTGCG GCGATCCGGC GCTGCCCGCG TTCCTCGGCC GCTATCTCGA CAGCGTGAAC
CGCCGCGCCT CTTGA
 
Protein sequence
MSTVPLAQIP APHDTATLTL VDQLVQWARR RIDERVFRPG MRMPSIRKLA LDKSVSRFTV 
VEAYERLVAQ GYLDSRRGSG FYVRERAPGQ QPVGASGGAR AQPVHNTIDV VWLLRNMLHT
VSPEKGPGLG YLPSRWLDGE LITSALRALG RQSGAQMLGF GSAQGFLPLR QQLQTRLAEF
EIGATPDQLV LVSGITQAID LIARHCVRPG DAVIVGDPAW FQMFGRFASQ GAQLVGMPYT
PDGPDLDALE NLVQMWRPKM LVINSVLHNP TGTSLSAAQA FRILKLAEAY DFLVVEDDVY
GDLCPPSYPA TRLASLDQLR RVIFLGSFSK TLAANLRVGY IACAPELAKA LTDQKMLVGM
TTPELNERVL YKVLTEGHYR RHVERLRARL DGVRDKTARM LERTGMRLFT MPAAGMFLWA
DTGVDSDALA AAAHEEGFLL TPGSLFSPQQ SPSTWTRFNV ANCGDPALPA FLGRYLDSVN
RRAS