Gene BURPS668_A2025 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A2025 
Symbol 
ID4888606 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp1956992 
End bp1957990 
Gene Length999 bp 
Protein Length332 aa 
Translation table11 
GC content71% 
IMG OID640131963 
ProductAraC family transcriptional regulator 
Protein accessionYP_001063020 
Protein GI126443878 
COG category[K] Transcription 
COG ID[COG4977] Transcriptional regulator containing an amidase domain and an AraC-type DNA-binding HTH domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACGTCCG CCGCCGCTGC CGCACCCGCC GCCTGCGCCG AGCCGGTTCC CTCGGTCGCG 
CATTTCGGGT TCCTGACGTT GCCGAATTTC TCGATGATCG CGTTCACGAG CGCGGTCGAG
GTGCTGCGCA TGGCGAACTA CGTCGCGCGC GCGGACCATT ACCGCTGGTC GATCTTCTCG
CTCGACGGCG CGCCCGTGCG CGCGAGCAAC GGCATCGCGG TGCGGCCGAC GCGGCCGCTC
GACGTCGACG ATCCGCCGGA CGTGGTGATC GTCTGCGGCG GCATCCGGAT TCGCGAGGCG
GTGGACGAGC GGGTGCGCGA CGCGCTCGGC GCGCTCGCCG CGCGCGATGT GCCGCTCGGC
GGCATCTGCA CGGGCGCGTA TGCGCTGATG GCGTGCGGGC TGCTCGACGG CTACCGCTGC
GCGGTGCACT GGGAGAACCT GTCCGCGCTG CACGCGGAGT TTCCGCGGGT GCGCTTCGCC
GACGAGCTGT TCGCCGTCGA TCGCGACCGG CTCACCTGCA CGGGCGGCAC CGCGCCGCTC
GACCTGATGC TGAACCTCGT CGGCGCGCGG CTCGGGCAGC CGCTCGCCGC GCAGGTCTCC
GAGCAGTTCA TTCTCGAGCG CATCCGCGGC GCGACCGATC CGCAGCCGAT TCCGGTCGAC
GCGCGCGTCG GCTTCTCGCG CGCGGAGCTG ATCGAGGTCG TGCGGCTGAT GGAGGCGAAC
ATCGAGGAGC CGCTGTCGCT CGAGGAACTC GCGCGGCTCG TGCGGCTGTC GCAGCGGCAC
CTGCAGCGGA TGTTCAAGAT CTATCTGAAC GTATCGCCCA CGCACTACTA CCTGACGCTG
CGCCTGAAGC GCGCGCGCGA CCTGCTGCGC ACCACCGACG CATCGATCGC GCGCGTGACG
GCGGTCTGCG GCTTTCATTC GCCGTGCCAT TTCAGCAAGG CGTACCGCGC GCAGTTCGGC
CATGCGCCGA GCCACGAGCG GCGCGTATCG GCGCGCTGA
 
Protein sequence
MTSAAAAAPA ACAEPVPSVA HFGFLTLPNF SMIAFTSAVE VLRMANYVAR ADHYRWSIFS 
LDGAPVRASN GIAVRPTRPL DVDDPPDVVI VCGGIRIREA VDERVRDALG ALAARDVPLG
GICTGAYALM ACGLLDGYRC AVHWENLSAL HAEFPRVRFA DELFAVDRDR LTCTGGTAPL
DLMLNLVGAR LGQPLAAQVS EQFILERIRG ATDPQPIPVD ARVGFSRAEL IEVVRLMEAN
IEEPLSLEEL ARLVRLSQRH LQRMFKIYLN VSPTHYYLTL RLKRARDLLR TTDASIARVT
AVCGFHSPCH FSKAYRAQFG HAPSHERRVS AR