Gene BTH_I2784 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBTH_I2784 
Symbol 
ID3849515 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia thailandensis E264 
KingdomBacteria 
Replicon accessionNC_007651 
Strand
Start bp3198704 
End bp3199900 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content68% 
IMG OID637842452 
ProductAraC family transcriptional regulator 
Protein accessionYP_443296 
Protein GI83719900 
COG category[K] Transcription 
COG ID[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.00366686 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTCGCCGG CGCCGCGGGC GTGCGCGGGC CGCCGATGTC ACGGCGGATC ATCTGGCCTT 
CCGGTCGCGG CAAGCCGGCG GCGATCGCCC ATGATCCGGC GTGCGGGAAA GATCGCATTG
AGCATTACAT TGAGCAATGG CATATTGGGC GTGCTCCGAA CATTTGAGGC CATCTTGATC
GCCAAGCTCG CCCATTGGGA TTTCGCCCGG CCCGTCGGCA CGACGCGCGT GCTCGTCGAA
GTCGGCGTCG AGCAGGGGTT GACCGTCGAC CGATGCCTCG ACGGCAGCGG CGTTGCCCCT
GAACGGCTCG ACGAGCCGGA CGCCACCGTC GCCGCAGCGC AGGAACTGCG CATCATCCGC
AACCTGATGC GGCTGCTGGG GCCGGCGTTT CCGCTCGGCA TCGAAGTCGG CCGCCGCTAC
CACGCGACGA CTTACGGAAT CTGGGGATTC GCGCTCATGA GCAGCGCGAC GTTCGGCGAT
GCCGTGTCGG TCGGATTGCG CTACCTTCAA CTGACGTCGA CCTTCTGCGA CATCCGGCCG
ACCGTGCGCG GCGAGGACGC GACGCTCGTG ATCGACGATC GCGACCTGCC CGGCGACGTG
CGCGACGTGC TGGTCGAAAT CACGGTGGCC GCGTTGATCA CGCTGCAGTT CGATCTCGAT
TCTGCGAACT TGCCGGTCAA GCGTCTTGCG CTCAAGATGA AGCCGCCGGC GTACGCCGGC
CGCTTTCGGA CGCTGTTCGA TGCGTCGCCC GAATTCGGCG CGGCGCACAA TGCGCTGACG
GTCGACGCGC ATTGTTTGGC TCTGAAGTTG CCGCAGCGCA ACGCGCTGAC GCGGCGGCAA
TGCGAGGACG AGTGCCGCCG CGTGCTCGAG CGCCGCCGTC GCAGCGAAGG CTGGGCGGGG
CGTGTGCGCC GGCATCTCGC CGGCGATCCG GCGCGCGGCC CGACGATGGA CGTGCTCGCG
GCCGAGTTGG GCGTGAGCGT GCGCACGTTG CGGCGGCGGC TTGCGGATGA GGGGACGGAT
TACGAGACGG TCGTCGACGA GATTCGCGAG GCGCTGGCCG AAGCGCTGCT TGCGACCACG
ACGCTGACCG TCGCGGAAGT GTCGGAGCGC CTCGGTTATT CGGAACCTTC CGCGTTCGCG
CGCGCGTTCA GGCGCTGGAA GGCGATGTCG CCGAATGAAT ACCGGCGGTC CGCGTGA
 
Protein sequence
MSPAPRACAG RRCHGGSSGL PVAASRRRSP MIRRAGKIAL SITLSNGILG VLRTFEAILI 
AKLAHWDFAR PVGTTRVLVE VGVEQGLTVD RCLDGSGVAP ERLDEPDATV AAAQELRIIR
NLMRLLGPAF PLGIEVGRRY HATTYGIWGF ALMSSATFGD AVSVGLRYLQ LTSTFCDIRP
TVRGEDATLV IDDRDLPGDV RDVLVEITVA ALITLQFDLD SANLPVKRLA LKMKPPAYAG
RFRTLFDASP EFGAAHNALT VDAHCLALKL PQRNALTRRQ CEDECRRVLE RRRRSEGWAG
RVRRHLAGDP ARGPTMDVLA AELGVSVRTL RRRLADEGTD YETVVDEIRE ALAEALLATT
TLTVAEVSER LGYSEPSAFA RAFRRWKAMS PNEYRRSA