Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpha266_2028 |
Symbol | |
ID | 4569148 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides DSM 266 |
Kingdom | Bacteria |
Replicon accession | NC_008639 |
Strand | - |
Start bp | 2343961 |
End bp | 2344860 |
Gene Length | 900 bp |
Protein Length | 299 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 639766609 |
Product | RNA polymerase, sigma 32 subunit, RpoH |
Protein accession | YP_912464 |
Protein GI | 119357820 |
COG category | [K] Transcription |
COG ID | [COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) |
TIGRFAM ID | [TIGR02937] RNA polymerase sigma factor, sigma-70 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.738085 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGGCAAC TTAAAATAAG CAAGCAGATT ACCAATCGTG AGAGCCTGTC GCTTGATCGG TATCTGCAGG AGATAGGAAA GTATGATTTA CTGACCGCCG AAGATGAGGT GAAACTGACC AAGGCGATCA AGGAGGGTTA TGATACACCG GTTGATACCG TCGAATACAG AAGGGCCAAG CGTGCGCTTG ACAAGTTGAT CAAGGGAAAC CTGAGGTTTG TTGTTTCTGT TGCCAAACAG TACCAGAATC AGGGGCTTAC GCTCGGCGAT CTTATTAATG AAGGGAATCT TGGTTTGATC AAGGCAGCCA AACGCTTCGA TGAAACGAGG GGATTCAAGT TTATCTCCTA TGCGGTCTGG TGGATTCGTC AGTCTATTCT TCAGGCGCTT GCCGAACAGT CGAGGATTGT GAGGCTGCCG CTGAACAGGG TCGGAACCCT GAACAAGATC AGCAAGGCTT ACAGCCAGTT GGAACAGGAG TTCGAACGCG ATCCGAATAC GCGGGAACTT GCCAATCTTC TCGATATGGA TTCCCAGGAT GTTGCCGATA CGCTCAAGAT TGCCGGAAGG CATGTTTCTG TTGATGCTCC GTTTGCGCAG GGTGATGATA ATCGCCTTCT CGATGTTCTT CAGAATGACG GTCATCTTCC TGACCATGGG CTCAACAAGG ACTCTCTCAC TCTTGAAGTT GAACGATCTC TCTCCGTGCT TGCTCCGAGA GAAGCGGACG TGATCCGTTC CTATTTCGGC ATAGGGATGG ATAATCCACT GACCCTTGAG GAGATTGGCG AAAAATTCAA GCTGACCCGC GAGCGTGTTC GCCAGATCAA GGAAAAAGCG ATACGCAGGT TGCGCCAGTC GGCATACAGC AAGATTCTTA AGGAGTATAT CGGCAGTTAA
|
Protein sequence | MRQLKISKQI TNRESLSLDR YLQEIGKYDL LTAEDEVKLT KAIKEGYDTP VDTVEYRRAK RALDKLIKGN LRFVVSVAKQ YQNQGLTLGD LINEGNLGLI KAAKRFDETR GFKFISYAVW WIRQSILQAL AEQSRIVRLP LNRVGTLNKI SKAYSQLEQE FERDPNTREL ANLLDMDSQD VADTLKIAGR HVSVDAPFAQ GDDNRLLDVL QNDGHLPDHG LNKDSLTLEV ERSLSVLAPR EADVIRSYFG IGMDNPLTLE EIGEKFKLTR ERVRQIKEKA IRRLRQSAYS KILKEYIGS
|
| |