Gene BURPS1106A_A1931 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A1931 
Symbol 
ID4904145 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp1891054 
End bp1892052 
Gene Length999 bp 
Protein Length332 aa 
Translation table11 
GC content71% 
IMG OID640145037 
ProductAraC family transcriptional regulator 
Protein accessionYP_001075965 
Protein GI126456291 
COG category[K] Transcription 
COG ID[COG4977] Transcriptional regulator containing an amidase domain and an AraC-type DNA-binding HTH domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.746291 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACGTCCG CCGCCGCTGC CGCACCCGCC GCCTGCGCCG AGCCGGTTCC CTCGGTCGCG 
CATTTCGGGT TCCTGACGTT GCCGAATTTC TCGATGATCG CGTTCACGAG CGCGGTCGAG
GTGCTGCGCA TGGCGAACTA CGTCGCGCGC GCGGACCATT ACCGCTGGTC GATCTTCTCG
CTCGACGGCG CGCCCGTGCG CGCGAGCAAC GGCATCGCGG TGCGGCCGAC GCAGCCGCTC
GACGTCGACG ATCCGCCGGA CGTGGTGATC GTCTGCGGCG GCATCCGGAT TCGCGAGGCG
GTGGACGAGC GGGTGCGCGA CGCGCTCGGC GCGCTCGCCG CGCGCGATGT GCCGCTCGGC
GGCATCTGCA CGGGCGCGTA TGCGCTGATG GCGTGCGGGC TGCTCGACGG CTACCGCTGC
GCGGTGCACT GGGAGAACCT GTCCGCGCTG CACGCGGAGT TTCCGCGGGT GCGCTTCGCC
GACGAGCTGT TCGCCGTCGA TCGCGACCGG CTCACCTGCA CGGGCGGCAC CGCGCCGCTC
GACCTGATGC TGAACCTCGT CGGCGCGCGG CTCGGGCAGC CGCTCGCCGC GCAGGTCTCC
GAGCAGTTCA TTCTCGAGCG CATCCGCGGC GCGACCGATC CGCAGCCGAT TCCGGTCGAC
GCGCGCGTCG GCTTCTCGCG CGCGGAGCTG ATCGAGGTCG TGCGGCTGAT GGAGGCGAAC
ATCGAGGAGC CGCTGTCGCT CGAGGAACTC GCGCGGCTCG TGCGGCTGTC GCAGCGGCAC
CTGCAGCGGA TGTTCAAGAT CTATCTGAAC GTATCGCCCA CGCACTACTA CCTGACGCTG
CGCCTGAAGC GCGCGCGCGA CCTGCTGCGC ACCACCGACG CATCGATCGC GCGCGTGACG
GCGGTCTGCG GCTTTCATTC GCCGTGCCAT TTCAGCAAGG CGTACCGCGC GCAGTTCGGC
CATGCGCCGA GCCACGAGCG GCGCGTATCG GCGCGCTGA
 
Protein sequence
MTSAAAAAPA ACAEPVPSVA HFGFLTLPNF SMIAFTSAVE VLRMANYVAR ADHYRWSIFS 
LDGAPVRASN GIAVRPTQPL DVDDPPDVVI VCGGIRIREA VDERVRDALG ALAARDVPLG
GICTGAYALM ACGLLDGYRC AVHWENLSAL HAEFPRVRFA DELFAVDRDR LTCTGGTAPL
DLMLNLVGAR LGQPLAAQVS EQFILERIRG ATDPQPIPVD ARVGFSRAEL IEVVRLMEAN
IEEPLSLEEL ARLVRLSQRH LQRMFKIYLN VSPTHYYLTL RLKRARDLLR TTDASIARVT
AVCGFHSPCH FSKAYRAQFG HAPSHERRVS AR