Gene P9303_19601 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_19601 
Symbol 
ID4777206 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp1724844 
End bp1725785 
Gene Length942 bp 
Protein Length313 aa 
Translation table11 
GC content51% 
IMG OID640087470 
ProductType II alternative RNA polymerase sigma factor, sigma-70 family protein 
Protein accessionYP_001017967 
Protein GI124023660 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02937] RNA polymerase sigma factor, sigma-70 family
[TIGR02997] RNA polymerase sigma factor, cyanobacterial RpoD-like family 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0713549 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTTTCAG CAGCGCCTAA ATCAGCAGAA ACACAGAGGC GTAGAAGTTC TGATCCTGTC 
AGCTGGTATC TCACAACGAT TGGGCGTATA CCTCTTCTTA CCCCTGCTGA AGAGATTGAA
CTTGGCAATC AAGTTCAGAC GATGATGAGT CTCACTCAAG ACGGCTCAGT TGCGCCTGAT
GATAAGGAGT TTACGACACA TCAGCGTCGC ATGATTCGCA TTGGCCGTCG TGCCAAAGAA
CGCATGATGA AGGCCAATCT TCGTCTTGTT GTGAGTGTTG CCAAGAAATA TCAAGGCAAA
GGACTGGAAC TCCTCGATCT CATCCAGGAG GGTTCACTTG GTTTAGAGCG TGCTGTTGAA
AAGTTTGATC CAACCCGTGG CTACAAGTTT TCGACCTATG CGTTTTGGTG GATTCGTCAG
AGCATGACAC GTGCGATTGC GTGCCAGTCG CGCACGATTC GCCTTCCTGT ACATCTCAGT
GAAAGGCTGA CCACAATTCG AAAGGTTTCT CTGGATTTGG CTCACAAGCT TGGAGCAATG
CCCAGTCGCT CCGAGATCGC TGAAGCGATG GATATCCCTG TTGATGAACT CGACTCTTTA
TTGCGTCAGG CGCTAACAAC CAGCAGTTTG GATGCGCCAG TGAATGGCGA AGAAGGACGA
AGTTTTCTTG GTGATCTGAT CGCTGATTCC TCTCTTGGTG AACCTCTCGA CAAGGTGGAG
CAGCGTATTC ATCATGAGCA GCTCGGGCGT TGGCTCAGCC ATCTCAGTGA GCAGGAGCAG
CATGTTCTTA AGCTCCGTTT TGGTTTGGAA ACCCATGATC GACACACCTT GGCTGAGATT
GGTCGCTTGA TGGAAGTCTC GCGTGAGCGT GTTCGTCAAG TGGAACTAAA GGCCTTGCGC
AAGCTGCGTA ACCTCACGCG TAGGGTGCCC AACGGGATCT GA
 
Protein sequence
MVSAAPKSAE TQRRRSSDPV SWYLTTIGRI PLLTPAEEIE LGNQVQTMMS LTQDGSVAPD 
DKEFTTHQRR MIRIGRRAKE RMMKANLRLV VSVAKKYQGK GLELLDLIQE GSLGLERAVE
KFDPTRGYKF STYAFWWIRQ SMTRAIACQS RTIRLPVHLS ERLTTIRKVS LDLAHKLGAM
PSRSEIAEAM DIPVDELDSL LRQALTTSSL DAPVNGEEGR SFLGDLIADS SLGEPLDKVE
QRIHHEQLGR WLSHLSEQEQ HVLKLRFGLE THDRHTLAEI GRLMEVSRER VRQVELKALR
KLRNLTRRVP NGI