Gene P9303_29951 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_29951 
Symbol 
ID4776731 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp2646199 
End bp2647233 
Gene Length1035 bp 
Protein Length344 aa 
Translation table11 
GC content53% 
IMG OID640088519 
ProductType II alternative RNA polymerase sigma factor, sigma-70 family protein 
Protein accessionYP_001018990 
Protein GI124024683 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02937] RNA polymerase sigma factor, sigma-70 family
[TIGR02997] RNA polymerase sigma factor, cyanobacterial RpoD-like family 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.982291 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGGATCC CTCTGGAATC TGCGAAGGGT GCTTCACTCA CGCCTTCGTC AGAAGTTGTA 
TTACCGTCTA CATCTAAGCA ACCTTCAGAA AAAGCGAATC GAGCTGGCCG TAACGGGCAG
ACATCCCGTA ACCAAAATCG CCAAGGTGGT CGTTTGGGTA CTGATGCGAT CGGGTTCTAC
CTGAGCAGCA TCGGACGCGT CCCTTTGCTG ACCGCAGCTG AGGAAATTGA GCTGGCACAC
CATGTGCAAG CGATGAAGGA ATTGCTGGAG TTACCAGAGC AAGACCACAC TCCACGACAA
CGCCACAAAA TTCGCATGGG CAAACGCGCC CGTGACCGCA TGATGTCAGC CAACCTCCGG
CTCGTGGTTA GCGTTGCCAA AAAATATCAG AATCAGGGCC TTGAACTCCT TGACCTAGTC
CAAGAAGGAG CCATTGGTCT CGAACGTGCT GTCGACAAGT TCGATCCAGC CATGGGTTAT
AAGTTCTCCA CCTACGCCTA TTGGTGGATT CGTCAAGGGA TGACCAGGGC CATCGACAAC
AGCGCTCGCA CCATCCGTCT ACCCATTCAC ATCAGCGAAA AACTCTCCAA GATGCGACGC
ATCTCAAGAG AGCTTTCCCA TCGTTTCGGC CGTCAACCAA ATCGATTGGA GTTAGCCCAT
GCCATGGGCA TTCAACCCCA AGACCTTGAG GATCTCATCG CTCAAAGCGC TCCTTGCGCA
TCTCTCGATG CCCATGCCCG CGGAGAAGAA GACCGCAGCA CCCTAGGTGA ACTGATACCC
GACCCCAATG GGGCCGAACC AATGGAAGGC CTAGATCGCA GCATCCAAAA GGAACACCTA
GGAGGTTGGC TATCTCAGCT CAATGAACGT GAACAGAAAA TCCTGCGCTT GCGCTTCGGT
CTAGATGGTG AAGAACCACT GACCCTCGCT GAAATTGGTC GGCAAATCAG CGTCTCACGA
GAACGCGTAC GACAGTTGGA GGCCAAAGCC ATTCTCAAGC TACGGATGAT GACCAACCAT
CAACAAGCTG CATGA
 
Protein sequence
MGIPLESAKG ASLTPSSEVV LPSTSKQPSE KANRAGRNGQ TSRNQNRQGG RLGTDAIGFY 
LSSIGRVPLL TAAEEIELAH HVQAMKELLE LPEQDHTPRQ RHKIRMGKRA RDRMMSANLR
LVVSVAKKYQ NQGLELLDLV QEGAIGLERA VDKFDPAMGY KFSTYAYWWI RQGMTRAIDN
SARTIRLPIH ISEKLSKMRR ISRELSHRFG RQPNRLELAH AMGIQPQDLE DLIAQSAPCA
SLDAHARGEE DRSTLGELIP DPNGAEPMEG LDRSIQKEHL GGWLSQLNER EQKILRLRFG
LDGEEPLTLA EIGRQISVSR ERVRQLEAKA ILKLRMMTNH QQAA