Gene BURPS1710b_A0236 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1710b_A0236 
Symbol 
ID3694216 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1710b 
KingdomBacteria 
Replicon accessionNC_007435 
Strand
Start bp350827 
End bp351846 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content69% 
IMG OID637730490 
ProductAraC family transcriptional regulator 
Protein accessionYP_335395 
Protein GI76817450 
COG category[K] Transcription 
COG ID[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACCCAG ACCTCGAGAT CGTCCCCACC CGCCGCGACG AATCGTTTCG CGCATGGTCG 
CACGACTATC CGCACACGGT CGCGAAATGG CATTTTCATC CGGAGTACGA AATCCACCTG
ATTCAGGGTT CGCGCGGCAA GTTCTTCGTC GGCGACCATA TCGGCGATTT CGCGCCCGGC
AACCTCGTCG TCACCGGGCC GAACCTGCCG CACAACTGGA TCAGCGAGCT CGGCCCCGGC
GAGCGCGTGC CGTCGCGCGA CGTCGTGCTG CAGTTCTCGC GCGACGCGGC CGAGAAGATG
GTGGCCGCGT TCGCCGAGCT GCAGCCGGTG CTCGACCTGA TCGACGAAGC GTCGCGCGGC
GTGCAGTTTC CGGACGAGAT CGGGCTCGCC GTCGCGCCGC TGATGCTCGA GCTCGCGAGC
GCGCACGGCT GCCGGCGCGT CGAGGTGCTG ATGGCGCTGT TCGACCGGCT GGCGTCGTGC
GCCGCGCGTC GCACGCTCGC CGGCCCCGGC TACCGGATCG ACGCGCAGCA CTACATGTCG
TCGACGATCA ACCAGGTGCT CGCGTACCTG CGGCAGAACC TGCCGGGCGC GCTACGCGAG
GCGGACGTCG CCGAATTCGC CGGCATGAGC GTGAGCACGT TCACGCGCTT CTTCCGCCGG
CACACGGGCT CGACGTTCGT CCAGTATCTG AACCGGCTGC GGATCAACGA AGCGTGCGAG
CTGCTGATGT GCTCGGCGCT CAGCGTCACC GACATCTGCT ACCGCATCGG CTTCAACAAC
CTGTCGAACT TCAACCGGCA ATTCCTCGCG ATGAAGGGGA TGCCGCCGTC GCGCTTTCGC
GCGCTGCATC GGTTGAACGA GCCGCATGAC GCGCCCGAAC CGCACGAGCC GCACGAGCCG
CACGCGTCGC TCGCGCCGGC CGCCGCGCCC GCGGCCCCGG GCGCGGCGGC CCGGCCCCCC
GAGCGCGCCG CACCCACCGC GCGCGCCGTC ATCCATTCGC ACCGGAGCCT CCACCCGTGA
 
Protein sequence
MNPDLEIVPT RRDESFRAWS HDYPHTVAKW HFHPEYEIHL IQGSRGKFFV GDHIGDFAPG 
NLVVTGPNLP HNWISELGPG ERVPSRDVVL QFSRDAAEKM VAAFAELQPV LDLIDEASRG
VQFPDEIGLA VAPLMLELAS AHGCRRVEVL MALFDRLASC AARRTLAGPG YRIDAQHYMS
STINQVLAYL RQNLPGALRE ADVAEFAGMS VSTFTRFFRR HTGSTFVQYL NRLRINEACE
LLMCSALSVT DICYRIGFNN LSNFNRQFLA MKGMPPSRFR ALHRLNEPHD APEPHEPHEP
HASLAPAAAP AAPGAAARPP ERAAPTARAV IHSHRSLHP