Gene BTH_II1174 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBTH_II1174 
Symbol 
ID3844996 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia thailandensis E264 
KingdomBacteria 
Replicon accessionNC_007650 
Strand
Start bp1372376 
End bp1373368 
Gene Length993 bp 
Protein Length330 aa 
Translation table11 
GC content69% 
IMG OID637838477 
ProductAraC family transcriptional regulator 
Protein accessionYP_439371 
Protein GI83716412 
COG category[K] Transcription 
COG ID[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.0789379 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACCCAG ACCTCGAAAT CGTCCCCACC CGCCGCGACG AATCGTTTCG CGCGTGGTCG 
CACGACTATC CGCACACGGT CGCGAAGTGG CATTTCCATC CGGAGTACGA AATCCACCTG
ATCCAGGGCT CGCGCGGCAA GTTCTTCGTC GGCGACTACA TCGGCGATTT CGCGCCCGGC
AACCTCGTCG TCACCGGGCC GAACCTGCCG CACAACTGGA TCAGCGAACT CGGCCCCGGC
GAGCGCGTGC CGTCGCGCGA CGTCGTGCTG CAGTTCTCGC GCGACGCGGC CGAGAAGATG
GTCGCCGCGT TCGCCGAGCT GCAGCCGGTG CTCGACCTGA TCGACGAAGC GTCGCGCGGC
GTGCAGTTCC CGGACGAAGT CGGGCTCGCT GTCGCGCCGC TGATGGTCGA GCTCGCGAGC
GCGCACGGCT GCCGGCGCGT CGAGGTGCTG ATGGCGCTGT TCGACCGGCT GGCGTCGTGC
GCCGCGCGCC GCCCGCTCGC GGGCCCCGGC TACCGGATCG ACGCGCAGCA CTACATGTCG
TCGACGATCA ACCAGGTGCT CGCGTATCTG CGGCAGAACC TGCCGGGCGC GCTGCGCGAG
GCGGACGTCG CCGAATTCGC CGGCATGAGC GTGAGCACGT TCACGCGCTT CTTTCGCCGG
CATACCGGCT CGACGTTCGT CCAGTACCTG AACCGGCTGC GGATCAACGA AGCGTGCGAA
TTGCTGATGT GCTCGGCGCT CAACGTCACC GACATCTGCT ATCGCGTCGG CTTCAACAAC
CTGTCGAACT TCAACCGGCA ATTCCTCGCG ATGAAGGGGA TGCCGCCGTC ACGCTTTCGC
GCGCTGCACC GGTTGAACGA GCCGCGCGAG CAGGACGCGG CGCCCGCTGC CGCGGCATCG
GCATTGGCAT CGGCCACGGC GGCCTTCGCG GCCACGGCCC CCGCCCCCAT CGCGCGCACC
GCCCCCCACT CGCACCGGAG CCTCCACCCG TGA
 
Protein sequence
MNPDLEIVPT RRDESFRAWS HDYPHTVAKW HFHPEYEIHL IQGSRGKFFV GDYIGDFAPG 
NLVVTGPNLP HNWISELGPG ERVPSRDVVL QFSRDAAEKM VAAFAELQPV LDLIDEASRG
VQFPDEVGLA VAPLMVELAS AHGCRRVEVL MALFDRLASC AARRPLAGPG YRIDAQHYMS
STINQVLAYL RQNLPGALRE ADVAEFAGMS VSTFTRFFRR HTGSTFVQYL NRLRINEACE
LLMCSALNVT DICYRVGFNN LSNFNRQFLA MKGMPPSRFR ALHRLNEPRE QDAAPAAAAS
ALASATAAFA ATAPAPIART APHSHRSLHP