Gene A9601_19071 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_19071 
Symbol 
ID4718646 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp1644619 
End bp1645638 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content39% 
IMG OID640079642 
Producttype II alternative sigma-70 family RNA polymerase sigma factor 
Protein accessionYP_001010297 
Protein GI123969439 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02937] RNA polymerase sigma factor, sigma-70 family
[TIGR02997] RNA polymerase sigma factor, cyanobacterial RpoD-like family 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGGATCC CTCTGGAATC TGCGAAAAGC TCTTCAGATA ATAATTTTGA TGAGCCAAGA 
TTACCAAACA CTGCGGGCAA ATCTCGCAAA TCGAAATCCA GTCTTACGGC AAAACAAAGC
CAAAAAAAAT CTGGCAGACT CGCTTCAGAT TCTATTGGCT ATTACTTAAG TAGCATTGGA
AGAGTACCTC TTTTGACTCC AGCAGAGGAA ATAGAGTTAG CTCATCATGT TCAGAACATG
AAAAAGTTGC TACAAATTCC TGAAACTGAT AGAACCCAAC GAAATCTTTA TCAAATTAAG
ATTGGCAAAA GAGCAAGAGA TAGAATGATG GCAGCTAATC TAAGGCTCGT TGTCTCAGTT
GCAAAAAAAT ACCAAAACCA AGGGCTTGAA TTATTAGATC TTGTCCAGGA AGGAGCTATT
GGACTTGAAA GAGCCGTAGA TAAATTTGAT CCTGCTATGG GATATAAATT CTCAACTTAT
GCTTACTGGT GGATTAGACA AGGAATGACG AGGGCAATTG ATAACAGTGC TAGAACCATT
CGTTTGCCTA TTCACATAAG TGAAAAACTA TCCAAAATGA GAAGAGTCTC CAGAGAATTA
TCACACAAAT TTGGCAGACA ACCTACAAGA TTGGAAATGG CAACTGAGAT GGGAATTGAT
CAAAAAGATT TAGAAGATTT AATTTCTCAA AGTGCTCCTT GCGCCTCCCT AGATGCACAT
GCAAGAGGGG AAGAAGACAG AAGTACACTT GGTGAACTCA TACCTGATCC AAACTGTGAA
GAGCCTATGG AAGGTATGGA TAGAACTATT CAAAAAGAGC ATTTAGGAAC TTGGCTTTCT
CAATTAAATG AAAGAGAGCA AAAAATCATG AAGCTCAGAT TTGGGCTAGA TGGTGAAGAA
CCATTAACAC TCGCAGAAAT AGGAAGACAA ATTAATGTTT CGCGAGAAAG AGTAAGGCAA
CTAGAAGCTA AAGCAATATT AAAGCTTCGA GTAATGACAA CTCATCAAAA AGCAGCTTAA
 
Protein sequence
MGIPLESAKS SSDNNFDEPR LPNTAGKSRK SKSSLTAKQS QKKSGRLASD SIGYYLSSIG 
RVPLLTPAEE IELAHHVQNM KKLLQIPETD RTQRNLYQIK IGKRARDRMM AANLRLVVSV
AKKYQNQGLE LLDLVQEGAI GLERAVDKFD PAMGYKFSTY AYWWIRQGMT RAIDNSARTI
RLPIHISEKL SKMRRVSREL SHKFGRQPTR LEMATEMGID QKDLEDLISQ SAPCASLDAH
ARGEEDRSTL GELIPDPNCE EPMEGMDRTI QKEHLGTWLS QLNEREQKIM KLRFGLDGEE
PLTLAEIGRQ INVSRERVRQ LEAKAILKLR VMTTHQKAA