Gene BTH_II0853 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBTH_II0853 
Symbol 
ID3844856 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia thailandensis E264 
KingdomBacteria 
Replicon accessionNC_007650 
Strand
Start bp995098 
End bp996285 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content73% 
IMG OID637838156 
ProductAraC family transcriptional regulator 
Protein accessionYP_439050 
Protein GI83717752 
COG category[K] Transcription 
COG ID[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.0449934 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTTGCC TGAGCGGGCG GCCGCGCCTG CGTCGCGCGT TACGCAACGC TCGCCTGCGC 
GACAGGCGGC CCGCACGGAC GCTTCGCAAA CGCGCTTCCA AGTCGCACCT TTCACGGGCC
TTCAACATGG CGAAGACGCA CGCGCCCGGC AGCGGCACGC TGCTGCGGTT CTTTTCGACC
GACGACATGC CGCTCGCGCG CGCAGCGGCG TTCTGGAGCG CGCACGTGTT CCACTGCGAG
GATGTGCGCG CGGAGCAGGC GCGCGCGTTT CACGGGCACG GCTTTCTCTG CCGGTGCGAG
CGCGGCCGGT TCGTTCGTTT CCGCGGCGCG TCGCTCGATG CGCGGATCAG CGACGCGTGG
CTGAGCGCCG CGACGGCCGA CGCGCACGTG ACGATCTGCG CGCTGCACGC GGGCGAGTGC
ACGGTCGAGG CGCCCGGCTT GCCGGATGCG CGCTTTCGCG CGAACGATCT GTTCCTGCTC
GACGGCGGCC GGCCGATGCG CGTGCGCTGG GACGAGCCGT GCTTCAGCGC GCTCAGACTG
CCGCGCGCGT CGGTGGCGCG CACGCTCGGG CAGGCGGCGA TGGATGCGTC GCCGAGCTCG
GCTTCGTTGC AGGCGGCGCG GCTCGCGCCG TTTCTCGCGG CCGAGCTCGC GCTCATCGGC
GGCCGCGGCC CGGCGCTGTC GTCCGACGAG CTCGATTACG TGCTCGCGCG CGCGGCGGAC
CTCGGCCGCG CGCTGCTTCA GGCGGCGCTG TCGGCGCGCG TGCGGCGCGG CGCGCCCGCG
CGCGCCGACC GGCTGCAGGC CGCGTATCGC TACATCGAAC AGCATCTCCA TCTGCCCACG
CTCACGCCCG AGCGGATCGC CGATGCGATC CATTGCTCGC GCACGCAGCT CTATCGCCTG
TTTCGCCACG AATCGCAGAC GGTGAAGGCC GCGTTGCGCG ACGCGCGGCT GAACCGCAGC
CTCGGCTACC TCGAGCGGCC CGAGCTCGCG CTTAGCATCG GCGAGATCGC GCACGCGTGC
GGTTTTCCCG ATCAGTCGAC GTTCGGCAAG CTGTTTCGCC GGCGCTTCGG AAGGACGCCG
GGCGAGGTGC GGCGCGCCGC GCGGGGGCGC CGCGATGAAG CCGAGCCGCC CGACACCGCG
CAAGGCGGCG ACGCGGCGCA AGCACAGGCG CAGACGCTTC AACGATAG
 
Protein sequence
MRCLSGRPRL RRALRNARLR DRRPARTLRK RASKSHLSRA FNMAKTHAPG SGTLLRFFST 
DDMPLARAAA FWSAHVFHCE DVRAEQARAF HGHGFLCRCE RGRFVRFRGA SLDARISDAW
LSAATADAHV TICALHAGEC TVEAPGLPDA RFRANDLFLL DGGRPMRVRW DEPCFSALRL
PRASVARTLG QAAMDASPSS ASLQAARLAP FLAAELALIG GRGPALSSDE LDYVLARAAD
LGRALLQAAL SARVRRGAPA RADRLQAAYR YIEQHLHLPT LTPERIADAI HCSRTQLYRL
FRHESQTVKA ALRDARLNRS LGYLERPELA LSIGEIAHAC GFPDQSTFGK LFRRRFGRTP
GEVRRAARGR RDEAEPPDTA QGGDAAQAQA QTLQR