Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BTH_I2784 |
Symbol | |
ID | 3849515 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia thailandensis E264 |
Kingdom | Bacteria |
Replicon accession | NC_007651 |
Strand | - |
Start bp | 3198704 |
End bp | 3199900 |
Gene Length | 1197 bp |
Protein Length | 398 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637842452 |
Product | AraC family transcriptional regulator |
Protein accession | YP_443296 |
Protein GI | 83719900 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.00366686 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGTCGCCGG CGCCGCGGGC GTGCGCGGGC CGCCGATGTC ACGGCGGATC ATCTGGCCTT CCGGTCGCGG CAAGCCGGCG GCGATCGCCC ATGATCCGGC GTGCGGGAAA GATCGCATTG AGCATTACAT TGAGCAATGG CATATTGGGC GTGCTCCGAA CATTTGAGGC CATCTTGATC GCCAAGCTCG CCCATTGGGA TTTCGCCCGG CCCGTCGGCA CGACGCGCGT GCTCGTCGAA GTCGGCGTCG AGCAGGGGTT GACCGTCGAC CGATGCCTCG ACGGCAGCGG CGTTGCCCCT GAACGGCTCG ACGAGCCGGA CGCCACCGTC GCCGCAGCGC AGGAACTGCG CATCATCCGC AACCTGATGC GGCTGCTGGG GCCGGCGTTT CCGCTCGGCA TCGAAGTCGG CCGCCGCTAC CACGCGACGA CTTACGGAAT CTGGGGATTC GCGCTCATGA GCAGCGCGAC GTTCGGCGAT GCCGTGTCGG TCGGATTGCG CTACCTTCAA CTGACGTCGA CCTTCTGCGA CATCCGGCCG ACCGTGCGCG GCGAGGACGC GACGCTCGTG ATCGACGATC GCGACCTGCC CGGCGACGTG CGCGACGTGC TGGTCGAAAT CACGGTGGCC GCGTTGATCA CGCTGCAGTT CGATCTCGAT TCTGCGAACT TGCCGGTCAA GCGTCTTGCG CTCAAGATGA AGCCGCCGGC GTACGCCGGC CGCTTTCGGA CGCTGTTCGA TGCGTCGCCC GAATTCGGCG CGGCGCACAA TGCGCTGACG GTCGACGCGC ATTGTTTGGC TCTGAAGTTG CCGCAGCGCA ACGCGCTGAC GCGGCGGCAA TGCGAGGACG AGTGCCGCCG CGTGCTCGAG CGCCGCCGTC GCAGCGAAGG CTGGGCGGGG CGTGTGCGCC GGCATCTCGC CGGCGATCCG GCGCGCGGCC CGACGATGGA CGTGCTCGCG GCCGAGTTGG GCGTGAGCGT GCGCACGTTG CGGCGGCGGC TTGCGGATGA GGGGACGGAT TACGAGACGG TCGTCGACGA GATTCGCGAG GCGCTGGCCG AAGCGCTGCT TGCGACCACG ACGCTGACCG TCGCGGAAGT GTCGGAGCGC CTCGGTTATT CGGAACCTTC CGCGTTCGCG CGCGCGTTCA GGCGCTGGAA GGCGATGTCG CCGAATGAAT ACCGGCGGTC CGCGTGA
|
Protein sequence | MSPAPRACAG RRCHGGSSGL PVAASRRRSP MIRRAGKIAL SITLSNGILG VLRTFEAILI AKLAHWDFAR PVGTTRVLVE VGVEQGLTVD RCLDGSGVAP ERLDEPDATV AAAQELRIIR NLMRLLGPAF PLGIEVGRRY HATTYGIWGF ALMSSATFGD AVSVGLRYLQ LTSTFCDIRP TVRGEDATLV IDDRDLPGDV RDVLVEITVA ALITLQFDLD SANLPVKRLA LKMKPPAYAG RFRTLFDASP EFGAAHNALT VDAHCLALKL PQRNALTRRQ CEDECRRVLE RRRRSEGWAG RVRRHLAGDP ARGPTMDVLA AELGVSVRTL RRRLADEGTD YETVVDEIRE ALAEALLATT TLTVAEVSER LGYSEPSAFA RAFRRWKAMS PNEYRRSA
|
| |