Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_2198 |
Symbol | rpoS |
ID | 4884023 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009074 |
Strand | - |
Start bp | 2191808 |
End bp | 2192887 |
Gene Length | 1080 bp |
Protein Length | 359 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 640128126 |
Product | RNA polymerase sigma factor RpoS |
Protein accession | YP_001059233 |
Protein GI | 126440772 |
COG category | [K] Transcription |
COG ID | [COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) |
TIGRFAM ID | [TIGR02394] RNA polymerase sigma factor RpoS [TIGR02937] RNA polymerase sigma factor, sigma-70 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 0.524707 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCGAAAT CGAAGCGCCA CGATCCGCAA GCCGAGTCTG AGAAGATCAG TCGAGCCAAG CAAGCATCGG TAGAGCGAAC TGGTGCTTCA GCGGACGAGG ACGAAGACGC CGCCGACAAC GAACGCGACT ACGAGTCGCG CGATGCGGAT CCGGACGAAT CGGGCGAGGG GCGCGGCGAC GCGCAGCCCG ATCTGGATGA CTTCCGGGCG CTTCTGCAGG CCGAGCTCAC CGCCGATACG ATCCAGCACT ACCTGAACCG CATCAGCGTG AAGCCGCTGT TGACGGTCGA GGAAGAGCAG CGCTACTCGC GGCTCGCGAA GGCGGGCGAA TTCGAGGCGC GGCAGGTGAT GATCGAGCGC AACCTGCGGC TTGTCGTCAG CATCGCGAAA GGCTATCTGA ATCGCGGCGT GCCGCTCCTC GATCTGATCG AAGAAGGCAA TCTTGGGCTC ATGCACGCGA TCGAGAAATT CGATCCGACG CGTGGCTTTC GCTTCTCGAC GTACGCGACC TGGTGGATCC GCCAGAGCAT CGAGCGCGCG ATCATGAATC AGGCGCGGAC GGTACGGCTG CCGGTGCACG TGATCCGCGA ACTGAACCAG GTGCTGCGCG CGAAGCGCCA TTTGGAGAAG AACTCGATGT CCACGGGCGA GGCCGCCGAG CGCCGCGAGG CGAGCATCGA CGACATCGCG TATCTGACCG GCAAGACGGC CGAGGAGGTC ACGGACATCC TCGCGCTGAA CGAGCATACG GCGTCGCTCG ACGCGCCGCT CGATCTCGAC CCGGCGAGCA GCCTGCTCGA TCTGCTGCCC GACGATCAGA GCCAGTCGCC GGACGCGGAG GTTCAGCACC GCGAGCTGGA GACGCTCACG CGCGCCTGGT TGTCGCGCCT GTCCGACAAG CACCGCCATG TGATCGAGCG GCGCTTCGGC CTGAACCATA TCGAACCCGC GACGCTCGAG GAGCTTGCCG ACGAGATGGG GCTGACCCGC GAACGGGTTC GCCAGATCCA GCAGGAAGCG CTCGTGCGGC TCAAGCGGTT TTTCGCATCC AACGGCGTGC GCAAGGACGC CGTTCTGTAA
|
Protein sequence | MPKSKRHDPQ AESEKISRAK QASVERTGAS ADEDEDAADN ERDYESRDAD PDESGEGRGD AQPDLDDFRA LLQAELTADT IQHYLNRISV KPLLTVEEEQ RYSRLAKAGE FEARQVMIER NLRLVVSIAK GYLNRGVPLL DLIEEGNLGL MHAIEKFDPT RGFRFSTYAT WWIRQSIERA IMNQARTVRL PVHVIRELNQ VLRAKRHLEK NSMSTGEAAE RREASIDDIA YLTGKTAEEV TDILALNEHT ASLDAPLDLD PASSLLDLLP DDQSQSPDAE VQHRELETLT RAWLSRLSDK HRHVIERRFG LNHIEPATLE ELADEMGLTR ERVRQIQQEA LVRLKRFFAS NGVRKDAVL
|
| |