Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | A9601_19071 |
Symbol | |
ID | 4718646 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. AS9601 |
Kingdom | Bacteria |
Replicon accession | NC_008816 |
Strand | + |
Start bp | 1644619 |
End bp | 1645638 |
Gene Length | 1020 bp |
Protein Length | 339 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 640079642 |
Product | type II alternative sigma-70 family RNA polymerase sigma factor |
Protein accession | YP_001010297 |
Protein GI | 123969439 |
COG category | [K] Transcription |
COG ID | [COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) |
TIGRFAM ID | [TIGR02937] RNA polymerase sigma factor, sigma-70 family [TIGR02997] RNA polymerase sigma factor, cyanobacterial RpoD-like family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGGATCC CTCTGGAATC TGCGAAAAGC TCTTCAGATA ATAATTTTGA TGAGCCAAGA TTACCAAACA CTGCGGGCAA ATCTCGCAAA TCGAAATCCA GTCTTACGGC AAAACAAAGC CAAAAAAAAT CTGGCAGACT CGCTTCAGAT TCTATTGGCT ATTACTTAAG TAGCATTGGA AGAGTACCTC TTTTGACTCC AGCAGAGGAA ATAGAGTTAG CTCATCATGT TCAGAACATG AAAAAGTTGC TACAAATTCC TGAAACTGAT AGAACCCAAC GAAATCTTTA TCAAATTAAG ATTGGCAAAA GAGCAAGAGA TAGAATGATG GCAGCTAATC TAAGGCTCGT TGTCTCAGTT GCAAAAAAAT ACCAAAACCA AGGGCTTGAA TTATTAGATC TTGTCCAGGA AGGAGCTATT GGACTTGAAA GAGCCGTAGA TAAATTTGAT CCTGCTATGG GATATAAATT CTCAACTTAT GCTTACTGGT GGATTAGACA AGGAATGACG AGGGCAATTG ATAACAGTGC TAGAACCATT CGTTTGCCTA TTCACATAAG TGAAAAACTA TCCAAAATGA GAAGAGTCTC CAGAGAATTA TCACACAAAT TTGGCAGACA ACCTACAAGA TTGGAAATGG CAACTGAGAT GGGAATTGAT CAAAAAGATT TAGAAGATTT AATTTCTCAA AGTGCTCCTT GCGCCTCCCT AGATGCACAT GCAAGAGGGG AAGAAGACAG AAGTACACTT GGTGAACTCA TACCTGATCC AAACTGTGAA GAGCCTATGG AAGGTATGGA TAGAACTATT CAAAAAGAGC ATTTAGGAAC TTGGCTTTCT CAATTAAATG AAAGAGAGCA AAAAATCATG AAGCTCAGAT TTGGGCTAGA TGGTGAAGAA CCATTAACAC TCGCAGAAAT AGGAAGACAA ATTAATGTTT CGCGAGAAAG AGTAAGGCAA CTAGAAGCTA AAGCAATATT AAAGCTTCGA GTAATGACAA CTCATCAAAA AGCAGCTTAA
|
Protein sequence | MGIPLESAKS SSDNNFDEPR LPNTAGKSRK SKSSLTAKQS QKKSGRLASD SIGYYLSSIG RVPLLTPAEE IELAHHVQNM KKLLQIPETD RTQRNLYQIK IGKRARDRMM AANLRLVVSV AKKYQNQGLE LLDLVQEGAI GLERAVDKFD PAMGYKFSTY AYWWIRQGMT RAIDNSARTI RLPIHISEKL SKMRRVSREL SHKFGRQPTR LEMATEMGID QKDLEDLISQ SAPCASLDAH ARGEEDRSTL GELIPDPNCE EPMEGMDRTI QKEHLGTWLS QLNEREQKIM KLRFGLDGEE PLTLAEIGRQ INVSRERVRQ LEAKAILKLR VMTTHQKAA
|
| |