Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_A2632 |
Symbol | |
ID | 4886205 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009075 |
Strand | - |
Start bp | 2531045 |
End bp | 2532313 |
Gene Length | 1269 bp |
Protein Length | 422 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 640132569 |
Product | sigma-70 family RNA polymerase sigma factor |
Protein accession | YP_001063625 |
Protein GI | 126442967 |
COG category | [K] Transcription |
COG ID | [COG4941] Predicted RNA polymerase sigma factor containing a TPR repeat domain |
TIGRFAM ID | [TIGR02937] RNA polymerase sigma factor, sigma-70 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGGACG CCGCCGTTTA CCGTGCGATC GATGCCGTCT GGCGGATCGA GGCCGCGAGA ATCATCGCGC ACGTCGCGCG GCTCGTGCGC GACGTCGGCG TGGCCGAAGA GCTCGCGCAG GACGCGCTCG TCGCGGCGCT CGAGCACTGG CCGAGCGGCG GCGTGCCGGA CAATCCGGGC GCATGGCTGA TGACGGCCGC GAAGCGCCGC GCGCTCGATC ATCTGCGGCA GCACGCGCTG CACGCGCGCA AGCGCGAGCA GATCGGCCTC GATCTCGATG CGCTCGGCGC GCACGTCGCG CCTGACGTCG CCGACGTGTT CGAAGCGGCG CGCGACGACG ACATCGGCGA CGATCTGCTG CGGCTCGTGT TCACCGCTTG TCATCCGGTG CTGTCGACCG ACGCGCGCGT CGCGCTGACG CTGCGGCTGC TCGGGGGGCT GACGACGGGC GAGATCGCGC GCGCGTTTCT CACGCCGGAG CCGACGATCG CGCAGCGGAT CGTGCGCGCG AAGCGCACGC TATCGGCGGC GAAGGTGCCG TTCGAGGTGC CGCGCGCACC GGAGCGCGCG GCGCGGCTGG CATCGGTGCT CGAAGTGATC TATCTGGTTT TCAACGAAGG CTATTCGGCG ACGGCGGGCG ACGACTGGAT GCGCCCCGCG CTGACCGACG AGGCGCTTCG GCTCGGGCGC GTGCTCGCCG GGCTCGCGCC CGACGAGAGC GAGGTGCACG GGCTTGTCGC GCTGATGGAG ATCCAGGCGT CGCGGATGCA TGCGCGGGTC GATGCGCAGG GCCGCCCCGT GCTGCTGCTC GATCAGGATC GCAGCCGCTG GGATCCGCTG CTGATCCGGC GCGGGCTTGC CGCGCTCGCA CGCTCGGAGG CGCTCGGCGG CGCGAGCGGG CCCTATGCGC TGCAGGCGGC GCTCGCCGCA TGTCACGCGC GTGCGCGCCA TGCCGACGAT ACCGACTGGG AGCAGATCGT CGCGCTCTAC GACGCGCTCG CGCAGGTCGC GCCCTCGCCC GTCGTCGAGC TGAATCGCGC GGTTGCGGTC GGCATGGCGT TCGGGCCAGC GGCGGGGCTC GAGATCGTCG ACGCGCTCGC GGCCGACCCG GCGCTCGCGC GCTATCACTG GCTGCCGAGC GTGCGCGGCG ATCTGCTCGC GAAGCTCGGG CGGCGCGCCG AGGCGCAGGC CGAATTCCAG CGTGCGGCCG ACATGACGCT CAATGCGCGC GAGCGTGAGA TGCTGCTTGC GCGCGCGACG CAGCGGTGA
|
Protein sequence | MMDAAVYRAI DAVWRIEAAR IIAHVARLVR DVGVAEELAQ DALVAALEHW PSGGVPDNPG AWLMTAAKRR ALDHLRQHAL HARKREQIGL DLDALGAHVA PDVADVFEAA RDDDIGDDLL RLVFTACHPV LSTDARVALT LRLLGGLTTG EIARAFLTPE PTIAQRIVRA KRTLSAAKVP FEVPRAPERA ARLASVLEVI YLVFNEGYSA TAGDDWMRPA LTDEALRLGR VLAGLAPDES EVHGLVALME IQASRMHARV DAQGRPVLLL DQDRSRWDPL LIRRGLAALA RSEALGGASG PYALQAALAA CHARARHADD TDWEQIVALY DALAQVAPSP VVELNRAVAV GMAFGPAAGL EIVDALAADP ALARYHWLPS VRGDLLAKLG RRAEAQAEFQ RAADMTLNAR EREMLLARAT QR
|
| |