Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_29561 |
Symbol | |
ID | 4777820 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | + |
Start bp | 2614605 |
End bp | 2615735 |
Gene Length | 1131 bp |
Protein Length | 376 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 640088480 |
Product | putative type II alternative RNA polymerase sigma factor |
Protein accession | YP_001018951 |
Protein GI | 124024644 |
COG category | [K] Transcription |
COG ID | [COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) |
TIGRFAM ID | [TIGR02937] RNA polymerase sigma factor, sigma-70 family [TIGR02997] RNA polymerase sigma factor, cyanobacterial RpoD-like family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGAAGAGCA GCCCCCAGCA AACCAAGACT TCCCTAGTCC GCAGAAGCGG CAGCACTGAT CCTGTGCGTC TCTACTTACA GGACATTGGA CGCGTAGAGC TACTGACCCA TGAAGAGGAA GTAACCCTGG CGCGACTTGT GCAGCGCAGG GAAGCCCTAC TCAAACAGGA AAGGCAACTG GCCTCCAGCC AAGAAGCGAT CAAAGAATTA CAAAGACTGG AGGAGTTACA GCAGCGAGAA GCGAACCATT CCTGCCACTG GCCCACCAAA CAGGAATGGG CTATGGCTGC TGGGCTCACC CTGGCCGAGC TGCAAGACAA AATCGAGACT GGTTACAAAA CCTGGGGAGC CCTGACTGGT CTTGACCCCT TGGAACTCAA GCGAAGTTTG CGAGCTGGTC GGCGTGCCAA GGATCAGATG ATCCAGGCCA ACCTTCGGCT TGTGGTGGCT GTAGCCAAGA AATATCAACA ACGGGGCATA GAACTGCTTG ATCTGGTGCA AGAAGGCACC CTGGGCTTGG AACGCGCAGT AGAGAAATTT GACCCGGCTA GAGGTTTCCG CTTCAGCACC TACGCCTACT GGTGGATCCG TCAGGGCATC ACAAGGGCCA TTGCGACGCA AAGTCGGACG ATCCGACTGC CGATGCACAT CACCGAAAAA CTAAACCGCA TCAAACGGGT TCAACAGGAG ATTGCTAGCA ACCAAGGACG ATTAGCTTCG ATTGCCGATC TCGCCAAGGC ACTTGGCCTT AGTGAAGAAA CAGTGCGCCT AACCCTAATG AGGGTCCCCC GTTCGATCTC CTTGGAAACT CGAATAGGCC AAGAACAAGA CAGCCAACTA GGCGATCTGC TGGAAGACAG CAACGCGACC CCAGAGGAGA AACTCACCCG CGATCAATTG CACAACGACC TTGAAATCTT GCTGGATGAA CTAAGCAACC GCGAAGCGAC AGTGATCAGA CGACGTTTTG GACTTGAAGA CGACACTCCT CAAACCCTGA CGCAAATTGG CGAGGCAATG CATCTATCGC GAGAACGAGT TCGTCAGATC GAAAGCCATG CCCTCTTGAA ATTGCGTCAA CCACAACGTC GCTGCAAGGT ACGGGACTAC ATTCAAAATC TCGATTCCTG A
|
Protein sequence | MKSSPQQTKT SLVRRSGSTD PVRLYLQDIG RVELLTHEEE VTLARLVQRR EALLKQERQL ASSQEAIKEL QRLEELQQRE ANHSCHWPTK QEWAMAAGLT LAELQDKIET GYKTWGALTG LDPLELKRSL RAGRRAKDQM IQANLRLVVA VAKKYQQRGI ELLDLVQEGT LGLERAVEKF DPARGFRFST YAYWWIRQGI TRAIATQSRT IRLPMHITEK LNRIKRVQQE IASNQGRLAS IADLAKALGL SEETVRLTLM RVPRSISLET RIGQEQDSQL GDLLEDSNAT PEEKLTRDQL HNDLEILLDE LSNREATVIR RRFGLEDDTP QTLTQIGEAM HLSRERVRQI ESHALLKLRQ PQRRCKVRDY IQNLDS
|
| |