Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9301_18881 |
Symbol | |
ID | 4912715 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9301 |
Kingdom | Bacteria |
Replicon accession | NC_009091 |
Strand | + |
Start bp | 1616615 |
End bp | 1617634 |
Gene Length | 1020 bp |
Protein Length | 339 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 640161494 |
Product | type II alternative sigma-70 family RNA polymerase sigma factor |
Protein accession | YP_001092112 |
Protein GI | 126697226 |
COG category | [K] Transcription |
COG ID | [COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) |
TIGRFAM ID | [TIGR02937] RNA polymerase sigma factor, sigma-70 family [TIGR02997] RNA polymerase sigma factor, cyanobacterial RpoD-like family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGGATCC CTCTGGAATC TGCAAAAAGC TCTTCAGATA ATAATTTTGA TGAGCCAAGA TTACCAAACA CTGCGGGCAA GTCTCGCAAA TCGAAATCCA GTCTTACGGC AAAACAAAGC CAAAAAAAAT CTGGCAGACT CGCTTCAGAT TCTATTGGCT ATTACTTAAG TAGCATTGGA AGAGTACCTC TTTTGACTCC AGCAGAGGAA ATAGAGTTAG CTCATCATGT TCAGAACATG AAAAAGTTGC TACAGATTCC TGAAACTGAT AGAACCCAAC GAAATCTTTA TCAAATTAAG ATTGGCAAAA GAGCAAGAGA TAGAATGATG GCAGCTAATC TAAGACTCGT TGTCTCGGTT GCAAAAAAAT ACCAAAACCA AGGGCTTGAA TTATTAGATC TTGTCCAGGA AGGAGCTATT GGCCTTGAAA GAGCTGTAGA TAAATTTGAT CCTGCTATGG GATATAAATT CTCAACTTAT GCTTACTGGT GGATTAGACA AGGAATGACG AGGGCAATTG ATAATAGTGC TAGAACGATC CGTTTGCCTA TTCACATAAG TGAAAAACTG TCCAAAATGA GAAGAGTCTC TAGAGAATTA TCACATAAAT TTGGCAGACA ACCTACAAGA TTGGAAATGG CAACTGAGAT GGGAATTGAT CAAAAAGATT TAGAAGATTT AATTTCTCAA AGTGCTCCAT GCGCCTCCCT AGATGCACAT GCAAGAGGGG AAGAAGACAG AAGTACTCTT GGTGAACTCA TACCTGATCC AAACTGTGAA GAGCCTATGG AGGGTATGGA TAGAACTATT CAAAAGGAGC ATTTAGGAAC TTGGCTTTCT CAATTAAATG AAAGAGAGCA AAAAATCATG AAGCTAAGGT TTGGGCTAGA TGGTGAAGAA CCATTAACAC TCGCAGAAAT AGGAAGACAA ATTAATGTTT CACGAGAAAG AGTAAGGCAA CTAGAAGCTA AAGCAATATT AAAGCTTCGA GTAATGACGA CTCATCAAAA AGCAGCTTAA
|
Protein sequence | MGIPLESAKS SSDNNFDEPR LPNTAGKSRK SKSSLTAKQS QKKSGRLASD SIGYYLSSIG RVPLLTPAEE IELAHHVQNM KKLLQIPETD RTQRNLYQIK IGKRARDRMM AANLRLVVSV AKKYQNQGLE LLDLVQEGAI GLERAVDKFD PAMGYKFSTY AYWWIRQGMT RAIDNSARTI RLPIHISEKL SKMRRVSREL SHKFGRQPTR LEMATEMGID QKDLEDLISQ SAPCASLDAH ARGEEDRSTL GELIPDPNCE EPMEGMDRTI QKEHLGTWLS QLNEREQKIM KLRFGLDGEE PLTLAEIGRQ INVSRERVRQ LEAKAILKLR VMTTHQKAA
|
| |