Gene BTH_II1804 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBTH_II1804 
Symbol 
ID3846420 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia thailandensis E264 
KingdomBacteria 
Replicon accessionNC_007650 
Strand
Start bp2178793 
End bp2179920 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content70% 
IMG OID637839105 
ProductAraC family transcription regulator 
Protein accessionYP_439998 
Protein GI83717593 
COG category[K] Transcription 
COG ID[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTCGCTC CGCTCAATTA CACTGATCGT TTTTGCCATA CCGCACCGCG GCACGCTCCT 
GCGATGAAGC ACGAAGAAAA GAAAGGCACC GTTTCGATCG AACTCGTCGA GTCGAGCCTC
GCGCTGTCGC GGCGGCGCGG CGTCGACGAC GCGCCGCTCC TCGCGCAGGC GGGCATTGCG
GCCGCGTTGC TCGCGCAGCG CAACGCACGC GTGTCCGCGC GGCAGTACGG CGCGCTGTGG
AACGCGATCG CGCGCGCGCT CGACGACGAA TTCTTCGGCC AGGACTCGCA CCCGATGCGC
TGCGGCAGCT TCATCGCGAT GAGCCAGGCG GCGCTCGGCG CGCGCAACGG GCTGCGCGCG
CTCGCCCGCG CGGTCAACTT CATGCACTGC GTGCTCGACG ATCTGCACGC CGAGATCGAC
GCGAACGCCG AGCGCGTGCG CCTGCGCTTC GTGCACCGCA ACAGCGCGAA TCCGCCGGAG
ATGTTCGCGT ACGCAACCTA TTTCATCATC GTCTACGGCC TCACGTGCTG GCTCATCGGA
CGGCGCATTC CGCTGCTGCA CGCGGGCTTT CGCTGCGGCG AGCCTCGCGC GGTCCACGAA
TATCAGTTGA TGTTCTGCGA CGACATGCGC TTCGGCGAAT CCGAATCGTA TGTCGATTTC
GATCCGGCGT TCGCCGCGCT GCCCGTCGTG CAGACGGCGA AGACGCTCAA GCCGTTCCTG
CGCGACGCGC CCGCGAGCTT CATCGTCAAG TACCGCAACC CGCACGCGCT CGGCGGGCGC
GTGCGCGCGG CGCTGCGCGC GCTGCCGCCC GCCGCTTGGC CCACCGCGCG GGCGCTCGCC
GCGCGGCTGC ATGTAGCCGA GGCGACGCTG CGCCGCAAGC TGAAGCAGGA AGGCCACTCG
TACCAGACGA TCAAGGACGC GCTGCGCCTC GATCTCGCGT GCGAGGCGCT CGCCGACCCG
GCCCGCACGG TCGCCGACGT CGCCGCGGCG ACCGGCTTCG CCGAGCCGAG CGCGTTCTAC
CGCGCGTTCC GCAAGTGGCG CGGGATGAGC CCCGCCGACT ACCGCGACGC CGCGCTCGCC
GCGCGCGCGG CCGCTTCGCG CTTTCGCCGG AAACCGCCTA CTCTTTAA
 
Protein sequence
MLAPLNYTDR FCHTAPRHAP AMKHEEKKGT VSIELVESSL ALSRRRGVDD APLLAQAGIA 
AALLAQRNAR VSARQYGALW NAIARALDDE FFGQDSHPMR CGSFIAMSQA ALGARNGLRA
LARAVNFMHC VLDDLHAEID ANAERVRLRF VHRNSANPPE MFAYATYFII VYGLTCWLIG
RRIPLLHAGF RCGEPRAVHE YQLMFCDDMR FGESESYVDF DPAFAALPVV QTAKTLKPFL
RDAPASFIVK YRNPHALGGR VRAALRALPP AAWPTARALA ARLHVAEATL RRKLKQEGHS
YQTIKDALRL DLACEALADP ARTVADVAAA TGFAEPSAFY RAFRKWRGMS PADYRDAALA
ARAAASRFRR KPPTL