Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BTH_II0536 |
Symbol | |
ID | 3846657 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia thailandensis E264 |
Kingdom | Bacteria |
Replicon accession | NC_007650 |
Strand | + |
Start bp | 635826 |
End bp | 637094 |
Gene Length | 1269 bp |
Protein Length | 422 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 637837841 |
Product | ECF subfamily RNA polymerase sigma factor |
Protein accession | YP_438736 |
Protein GI | 83717364 |
COG category | [K] Transcription |
COG ID | [COG4941] Predicted RNA polymerase sigma factor containing a TPR repeat domain |
TIGRFAM ID | [TIGR02937] RNA polymerase sigma factor, sigma-70 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.726292 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGGACG CCGTCGTCCA TCGCGCGATC GACGCCGTCT GGCGGATCGA GGCCGCGAGA ATCATCGCGC ATGTCGCGCG GCTCGTGCGC GACGTCGGCG TGGCCGAGGA ACTCGCGCAG GACGCGCTCG TCGCGGCCCT CGAGCACTGG CCGAGCGACG GCGTACCGGA GAATCCGGGC GCGTGGCTGA TGACGGCCGC GAAGCGCCGC GCGCTCGATC ATCTGCGGCA GAACGTGCTG CATGCGCGCA AGCGCGAGCA GATCGGCCTC GATCTTGATG CGCTCGGCGC GCACGTCGCG CCGGACGTCG CCGATGTGTT CGAGGCGGCG CGCGACGACG ACATCGGCGA CGATCTGCTG AGGCTCGTGT TCACCGCGTG CCATCCGGTG CTGTCGACCG ACGCGCGCGT GGCACTGACG CTGCGGCTGC TCGGCGGGCT GACGACGGGC GAGATCGCGC GCGCGTTTCT CACGCCCGAG CCGACGATCG CGCAGCGGAT CGTGCGCGCG AAGCGCACGC TGTCGGCGGC GAAGGTGCCG TTCGAGGTGC CGCGCGCGCC GGAGCGCGCG GCGCGGCTTG CTTCGGTGCT CGAAGTGATT TATCTGATCT TCAACGAAGG CCATTCGGCG ACGGCGGGCG ACGACTGGAT GCGTCCCGCT CTGACCGACG AAGCGCTGCG GCTCGGGCGC GTGCTCGCCG GGCTCGCGCC CGACGAGAGC GAGGTGCACG GGCTCGTCGC GCTGATGGAG ATCCAGGCGT CGCGGATGCA CGCGCGCGTC GACGCGCAAG GGCGTCCCGT GCTGCTGCTC GATCAGGACC GCAGCCGCTG GGACCCGCTG CTGATCCGGC GCGGGCTCGC CGCGCTCGCG CGTTCGGAGG CGCTCGGCGG CGCGAGCGGG CCGTATGCGC TGCAGGCCGC GCTCGCCGCG TGCCACGCAC GCGCGCGCAG CGCCGACGAC ACGGACTGGG AGCAGATCGT CGCGCTCTAC GACGCGCTCG CGCAGGTCGC GCCTTCGCCC GTCGTCGAGC TGAATCGCGC GGTCGCGGTC GGCATGGCGT TCGGGCCGGC CGCGGGGCTC GAGATCGTCG ACGCGCTCGC GGCCGATCCC GCGCTCGCGC GTTATCACTG GCTGCCGAGC GTGCGAGGCG ATCTGCTCGC GAAGCTCGGG CGGCGCGACG AGGCGCAGGC CGAGTTCAGG CGCGCGGCCG ACATGACGCT CAATGCGCGC GAGCGCGAGA TGCTGCTTGC GCGCGCGATG CAGCGGTGA
|
Protein sequence | MTDAVVHRAI DAVWRIEAAR IIAHVARLVR DVGVAEELAQ DALVAALEHW PSDGVPENPG AWLMTAAKRR ALDHLRQNVL HARKREQIGL DLDALGAHVA PDVADVFEAA RDDDIGDDLL RLVFTACHPV LSTDARVALT LRLLGGLTTG EIARAFLTPE PTIAQRIVRA KRTLSAAKVP FEVPRAPERA ARLASVLEVI YLIFNEGHSA TAGDDWMRPA LTDEALRLGR VLAGLAPDES EVHGLVALME IQASRMHARV DAQGRPVLLL DQDRSRWDPL LIRRGLAALA RSEALGGASG PYALQAALAA CHARARSADD TDWEQIVALY DALAQVAPSP VVELNRAVAV GMAFGPAAGL EIVDALAADP ALARYHWLPS VRGDLLAKLG RRDEAQAEFR RAADMTLNAR EREMLLARAM QR
|
| |