Gene P9301_18881 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9301_18881 
Symbol 
ID4912715 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9301 
KingdomBacteria 
Replicon accessionNC_009091 
Strand
Start bp1616615 
End bp1617634 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content39% 
IMG OID640161494 
Producttype II alternative sigma-70 family RNA polymerase sigma factor 
Protein accessionYP_001092112 
Protein GI126697226 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02937] RNA polymerase sigma factor, sigma-70 family
[TIGR02997] RNA polymerase sigma factor, cyanobacterial RpoD-like family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGGATCC CTCTGGAATC TGCAAAAAGC TCTTCAGATA ATAATTTTGA TGAGCCAAGA 
TTACCAAACA CTGCGGGCAA GTCTCGCAAA TCGAAATCCA GTCTTACGGC AAAACAAAGC
CAAAAAAAAT CTGGCAGACT CGCTTCAGAT TCTATTGGCT ATTACTTAAG TAGCATTGGA
AGAGTACCTC TTTTGACTCC AGCAGAGGAA ATAGAGTTAG CTCATCATGT TCAGAACATG
AAAAAGTTGC TACAGATTCC TGAAACTGAT AGAACCCAAC GAAATCTTTA TCAAATTAAG
ATTGGCAAAA GAGCAAGAGA TAGAATGATG GCAGCTAATC TAAGACTCGT TGTCTCGGTT
GCAAAAAAAT ACCAAAACCA AGGGCTTGAA TTATTAGATC TTGTCCAGGA AGGAGCTATT
GGCCTTGAAA GAGCTGTAGA TAAATTTGAT CCTGCTATGG GATATAAATT CTCAACTTAT
GCTTACTGGT GGATTAGACA AGGAATGACG AGGGCAATTG ATAATAGTGC TAGAACGATC
CGTTTGCCTA TTCACATAAG TGAAAAACTG TCCAAAATGA GAAGAGTCTC TAGAGAATTA
TCACATAAAT TTGGCAGACA ACCTACAAGA TTGGAAATGG CAACTGAGAT GGGAATTGAT
CAAAAAGATT TAGAAGATTT AATTTCTCAA AGTGCTCCAT GCGCCTCCCT AGATGCACAT
GCAAGAGGGG AAGAAGACAG AAGTACTCTT GGTGAACTCA TACCTGATCC AAACTGTGAA
GAGCCTATGG AGGGTATGGA TAGAACTATT CAAAAGGAGC ATTTAGGAAC TTGGCTTTCT
CAATTAAATG AAAGAGAGCA AAAAATCATG AAGCTAAGGT TTGGGCTAGA TGGTGAAGAA
CCATTAACAC TCGCAGAAAT AGGAAGACAA ATTAATGTTT CACGAGAAAG AGTAAGGCAA
CTAGAAGCTA AAGCAATATT AAAGCTTCGA GTAATGACGA CTCATCAAAA AGCAGCTTAA
 
Protein sequence
MGIPLESAKS SSDNNFDEPR LPNTAGKSRK SKSSLTAKQS QKKSGRLASD SIGYYLSSIG 
RVPLLTPAEE IELAHHVQNM KKLLQIPETD RTQRNLYQIK IGKRARDRMM AANLRLVVSV
AKKYQNQGLE LLDLVQEGAI GLERAVDKFD PAMGYKFSTY AYWWIRQGMT RAIDNSARTI
RLPIHISEKL SKMRRVSREL SHKFGRQPTR LEMATEMGID QKDLEDLISQ SAPCASLDAH
ARGEEDRSTL GELIPDPNCE EPMEGMDRTI QKEHLGTWLS QLNEREQKIM KLRFGLDGEE
PLTLAEIGRQ INVSRERVRQ LEAKAILKLR VMTTHQKAA