Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BTH_II0762 |
Symbol | |
ID | 3845197 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia thailandensis E264 |
Kingdom | Bacteria |
Replicon accession | NC_007650 |
Strand | + |
Start bp | 888623 |
End bp | 890059 |
Gene Length | 1437 bp |
Protein Length | 478 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 637838065 |
Product | regulatory protein HrpB |
Protein accession | YP_438959 |
Protein GI | 83716867 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTTTCGA CACTCTATTT CGCCGCCGCC GCCGCGCTCG CCAATCCGTC GTCGAACGAC GCCTACGCCG CGCGCGTGCT CGACGGCCGC TTCGCCGAAG CGGCCGCGCT CGTCGCCGCG CGCTGCGACG CCGAGCCGGG CGACGCGGCC GCCGACGTGC GCGACATCCA CGCGTATGCC GACATCCAGC TCGTGCTCGG CCGCTTCGAG GAAGCCGAGG AAACGTTCCG CCGCGCGCAG AAGCTGCTGC GCGAATCGCG CGACCAGACG CGCGTGATGA CCTGCCGCAA CGCGAGCTGG CAGGCGTTCT TCCAGAACCG TTTCGGCATC GCGCAGGCCG GCTTCCGGCG GATCGCCGAC GATCGCGCGG CGAGCGTCGC GCAGAAGCTC GAAAGCCTCG TCGGCGCATG CGTCGTCATG CACCATCTCG GCCGCGCCGA GGATGCGATG GCGGCGCTCG ACGTGCTGGA CGAGGCCGCC CGCGGCGCCG ATCCGCGCTG GCCGCAGCTC GCGCTCGCGC TGCGCACGGA CTTGCTGCTG CAATACCAGA TCGGTCGCGC CGATGCGCTC GGCGACCACG TCTACTGGCG CTCGGCGCTC GCCGACCTGC CGTTGTCGGC GTTCGCGGAA ACGGGCGCGC CCGCGGCCGG CGCGGATGCG CCGCCGCTCT TGCGGATGCG CGTCGACTAC ATGCGGCAGG CGCGCGCGCT CGCGGCCGGC GACCACGCGG CGCTCGCGCG CATCGATGCG CACGTGAGCT GGTGCGCGAG GACGGGCCTC GCCGACTATC AGCGCAGCGT GCGCCTCGAA GTGGCGCTCG CGGCGCTCGC GGGCTGCGCG CCGAACGTCG CCGACACGAT GATCGCGCCG TATCGCGACG CGGCCGGGCG CTGCGGCTCG CATGTGCGCT GGACGCTCGA CTATCTGTAT TGCGCGGCGA AGGTGCGCGA GCAGCAGGGG CGGATTCGCG AGTCGTCGTC GCTGTATGCG CGCTATGCGC TCGCGTCGGT GCGGCACGTG CGCTTGGACG GCGCGGCGCT CGCGCCCGCG CTCGCGATCG CGCGCTGCGG CACGCACGCG GGCGCGCCGA GCGACGACGT CAGCGCGCGG CTGCCGGGCA AGTACCGGCG CGCGTATCGC TACCTGATGG AGCGGCTCGA CCAGCGCGAC CTGTCGGTGC GCGAGATCGC CGCGCACATC GACGTGACCG AGCGCGCGTT GCAGGCGGCG TTCAAGACGT ATCTCGGGCT GTCGCCGAGC GAGCTGATCC GGCGGCAGCG GATGGAGCGG ATTCGCGGCG AGCTGCTGTC CGACGCGCCG CGCGCGGCGA GCGTGCTCGA GATCGCGAGC CGCTGGGGCA TCCAGCATCG GTCGACGCTC GTCAACGGCT ATCGGCGGAT GTTCGACGAG GCGCCTTCGC AGACGGCCGA CCGGTGA
|
Protein sequence | MFSTLYFAAA AALANPSSND AYAARVLDGR FAEAAALVAA RCDAEPGDAA ADVRDIHAYA DIQLVLGRFE EAEETFRRAQ KLLRESRDQT RVMTCRNASW QAFFQNRFGI AQAGFRRIAD DRAASVAQKL ESLVGACVVM HHLGRAEDAM AALDVLDEAA RGADPRWPQL ALALRTDLLL QYQIGRADAL GDHVYWRSAL ADLPLSAFAE TGAPAAGADA PPLLRMRVDY MRQARALAAG DHAALARIDA HVSWCARTGL ADYQRSVRLE VALAALAGCA PNVADTMIAP YRDAAGRCGS HVRWTLDYLY CAAKVREQQG RIRESSSLYA RYALASVRHV RLDGAALAPA LAIARCGTHA GAPSDDVSAR LPGKYRRAYR YLMERLDQRD LSVREIAAHI DVTERALQAA FKTYLGLSPS ELIRRQRMER IRGELLSDAP RAASVLEIAS RWGIQHRSTL VNGYRRMFDE APSQTADR
|
| |