Gene P9301_14741 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9301_14741 
Symbol 
ID4911039 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9301 
KingdomBacteria 
Replicon accessionNC_009091 
Strand
Start bp1242742 
End bp1243683 
Gene Length942 bp 
Protein Length313 aa 
Translation table11 
GC content36% 
IMG OID640161066 
Producttype II alternative sigma-70 family RNA polymerase sigma factor 
Protein accessionYP_001091698 
Protein GI126696812 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02937] RNA polymerase sigma factor, sigma-70 family
[TIGR02997] RNA polymerase sigma factor, cyanobacterial RpoD-like family 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTTCAT CATTACCAAC ACAAGCAGAA CCGCCTAAAA GGAGAAGCAA TGATCCTATT 
AGCTGGTATT TGCAAAATAT CGGTAGAGTT CCTTTATTAA CACCTGCCGA GGAGATTGAG
TTGGGTAATC AAGTCCAGAA GATGATGATT CTTACAGAAG ATGGACAATT AAATGAAAAG
ACTAATGAAT TTACAACTCA GCAAAAAAGA ACAATAAAAA TTGGTCGAAG AGCTAAAGAA
AGAATGATGA AAGCTAATTT AAGATTAGTT GTCAGTGTTG CAAAAAAATA TCAAGGTAAA
GGACTGGAAC TTCTTGATTT AGTACAAGAA GGTTCTCTTG GGTTGGAGAG GGCTGTTGAA
AAATTTGATC CGACAAGGGG GTATAAGTTT TCTACATACG CTTTTTGGTG GATTAGACAG
AGTATGACAA GAGCAATTGC TTGTCAATCA AGAACAATTC GTTTACCTGT TCACTTAAGT
GAAAGGTTAG CTTCAATTAG AAAAGTTAGT AGAGATTTGG CTCATAAACT TGGTGCTATG
CCCAGCAGGA TTGAAATTGC AGAGGCTATG GAAATTGATG TAGAAGAATT GGATTCTGTC
TTAAGACAAG CTTTATCGAC AAGTAGTTTA GATGCTCCAG TAAATGGCGA TGACGGCAGA
AGCTTTTTAG GTGATTTAAT TGCTGATAGT AATAATGAAG AACCTTTAGA TCAAGTTGAA
CAAAAAATGC ATCAAGAGCA ACTTGGTAAG TGGTTAAGTC ATTTGAGCGA GCAAGAACAA
CATGTTCTCA AATTAAGATT TGGGCTTGAT GCAAATGAGA GACACACACT TGCTGAAATT
GGAAGATTAT TAGAAGTTTC CAGAGAAAGA GTAAGACAAG TTGAACTTAA GGCGCTAAGA
AAATTAAGGA ATTTAACTAG AAAATTACCT AGCAGTATTT AA
 
Protein sequence
MISSLPTQAE PPKRRSNDPI SWYLQNIGRV PLLTPAEEIE LGNQVQKMMI LTEDGQLNEK 
TNEFTTQQKR TIKIGRRAKE RMMKANLRLV VSVAKKYQGK GLELLDLVQE GSLGLERAVE
KFDPTRGYKF STYAFWWIRQ SMTRAIACQS RTIRLPVHLS ERLASIRKVS RDLAHKLGAM
PSRIEIAEAM EIDVEELDSV LRQALSTSSL DAPVNGDDGR SFLGDLIADS NNEEPLDQVE
QKMHQEQLGK WLSHLSEQEQ HVLKLRFGLD ANERHTLAEI GRLLEVSRER VRQVELKALR
KLRNLTRKLP SSI