Gene BURPS668_A2667 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A2667 
Symbol 
ID4886238 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp2559078 
End bp2560202 
Gene Length1125 bp 
Protein Length374 aa 
Translation table11 
GC content74% 
IMG OID640132603 
ProductDarR 
Protein accessionYP_001063659 
Protein GI126442538 
COG category[K] Transcription 
COG ID[COG4977] Transcriptional regulator containing an amidase domain and an AraC-type DNA-binding HTH domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGTCG CGCCGGCTCG CGCCGACCGC ACCGGCGCGA TGGACAAACC GACTGTACGC 
AATTACCGTA TGGCGGATAT GCCAAAGTCC CCACGGTTCC CGCCAACGCC ATCGGCGGCA
GCGCCGACGG CGGCCGCGCC CGCCGCCGCG CGCCGCGCGG TGCACGTGCT CGCGTTCGAC
GATGTGCAGT TGCTCGACGT CACCGGGCCG CTGCAAGTGT TCGCGAGCGC GAACGATTTC
GCCGCGCGCC GCGGGCTCGC GATTCCGTAC GCGCCGCGCG TCGTCGCCGC CCACGCGCCT
TCGGTGATGT CGTCGGCCGG GCTCGCGTTC GCCGCCGCGC CGCTGCCCGC CGCGCGCGAG
CCGTCCGATA CGCTGATCGT CGCGGGCGGC TGCGGCGTCC ACGGCGCGGC GCGCGATCCG
CGGCTCGTCG ACTGGGTGCG CCGGCGCGCG GCGCACGCGC GGCGCATCGC GTCGGTGTGC
TCGGGCGCGT TCGTGCTCGC GGCGGCGGGG CTGCTGGGCG GACGCCGCGT CGCCACGCAC
TGGTCGCGCT GCGACGAGCT CGCGCAACGC TATCCCGACG TGCGCGTCGA GCCCGATCCC
ATTTTCATCC GCGACGGCAA CGTCTGGACG TCGGCAGGCG TCACGGCCGG CATCGATCTC
GCGCTCGCGC TCGTCGAGGA CGACCTCGGC CGCGCGCTGG CGCTCGACGT CGCGCGGTAT
CTCGTCGTGT TTCTGAAGCG CCCGGGCGGC CAGGCGCAAT TCAGCGCCGC GCTGTCGCTG
CAGCACGAGG GCGGCTGCTT CGACGAACTG CACGCATGGG CGGCCGCGAA TCTCGGCGCG
GACTTGTCGG TCGCGGCGCT CGCCGCGCGC GCCGGCATGA GCGAGCGCAG TTTCATGCGC
CGCTACCGCG AAGCGACCGG CAGGACGCCC GCGCGGGCGA TCGAGCAGAT GCGCGTCGAA
GCCGCGCGCA ACCTGCTCGC CGACGCACCG CTGCCGATCA AGCGGATCGC CGCGCGCTGC
GGATTCGGCA GCGAGGAAAC GATGCGCCGC AGTTTCCTGC GCATGCTCGG CGTGGCACCG
CAGGCCTATC GCGAGCGGTT CGCGACGAAT CGGCGAGGCG TCTGA
 
Protein sequence
MSVAPARADR TGAMDKPTVR NYRMADMPKS PRFPPTPSAA APTAAAPAAA RRAVHVLAFD 
DVQLLDVTGP LQVFASANDF AARRGLAIPY APRVVAAHAP SVMSSAGLAF AAAPLPAARE
PSDTLIVAGG CGVHGAARDP RLVDWVRRRA AHARRIASVC SGAFVLAAAG LLGGRRVATH
WSRCDELAQR YPDVRVEPDP IFIRDGNVWT SAGVTAGIDL ALALVEDDLG RALALDVARY
LVVFLKRPGG QAQFSAALSL QHEGGCFDEL HAWAAANLGA DLSVAALAAR AGMSERSFMR
RYREATGRTP ARAIEQMRVE AARNLLADAP LPIKRIAARC GFGSEETMRR SFLRMLGVAP
QAYRERFATN RRGV