Gene P9515_18181 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9515_18181 
Symbol 
ID4719987 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9515 
KingdomBacteria 
Replicon accessionNC_008817 
Strand
Start bp1601446 
End bp1602447 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content36% 
IMG OID640081517 
Producttype II alternative sigma-70 family RNA polymerase sigma factor 
Protein accessionYP_001012132 
Protein GI123967051 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02937] RNA polymerase sigma factor, sigma-70 family
[TIGR02997] RNA polymerase sigma factor, cyanobacterial RpoD-like family 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCATCCG AAACATTAGT TGAGAATAAA TTAGCTAATA TATCTTCTCT CAAATCTAGC 
AACGATGTAG ATTTGGTAAG GTCATATTTG AGAGATATTG GAAGAGTTCC ACTACTATCA
CATGAGCAAG AAATAACTCT AGGGAGACAA GTTCAAGAAT ACATGCAAGT TGAGAGGGCT
GAGCTTGAAA TCATAGAATT GACAGATAAT AAACCCTCAG TTGAAGAATT AGCCGCAAAA
TTAAATTTAT CTTCATCACA AATTAAAAAA AGACTCAGAG CTGGTCAACG TGCGAAAGAA
AGAATGGTTG CTGCAAATCT AAGACTTGTT GTTAGTGTTG CCAAAAAATA TACTAAGAGA
AATATGGAAC TACTTGATTT AATTCAAGAA GGAACAATTG GATTGGTAAG GGGGGTAGAA
AAATTCGATC CTGCAAGAGG ATATAAATTT TCAACTTATG CCTATTGGTG GATTAGGCAA
GGAATTACTA GAGCAATTGC GGAAAAAAGT AGAGCTATTA GATTACCTAT TCATATCACC
GAAATGCTTA ATAAGTTAAA AAAAGGGCAA CGAGAATTGA GTCAAGAAAT GTCAAGAACG
CCTACAATTA GCGAACTTGC GAAATATGTT GAATTACCTG AAGATGATGT TAAGGATTTA
ATGTGCAAAG CGGGCCAGCC AGTGAGTCTC GAAACTAAGG TTGGTGATGG CGAAGATACA
GTATTACTAG ATTTACTAGC TGGAGGAGAG GATTTGCCTG ACGAACAAAT TGAGATGGAT
TGTATGAGAG GTGATCTTCA TTCTCTACTT CATCAGCTGC CTGATCTTCA ATGTAGAGTT
TTAAGAATGA GATATGGAAT GGATGGAGAT GAACCAATGT CTCTAACTGG TATTGGACGT
GTTTTAGGTA TCAGTAGAGA TAGAGTAAGA AATTTAGAAA GAGATGGTTT GAGAGGTTTA
CGTAGACTCA GTGAGAATGT TGAAGCCTAT TTCGTTTCTT GA
 
Protein sequence
MSSETLVENK LANISSLKSS NDVDLVRSYL RDIGRVPLLS HEQEITLGRQ VQEYMQVERA 
ELEIIELTDN KPSVEELAAK LNLSSSQIKK RLRAGQRAKE RMVAANLRLV VSVAKKYTKR
NMELLDLIQE GTIGLVRGVE KFDPARGYKF STYAYWWIRQ GITRAIAEKS RAIRLPIHIT
EMLNKLKKGQ RELSQEMSRT PTISELAKYV ELPEDDVKDL MCKAGQPVSL ETKVGDGEDT
VLLDLLAGGE DLPDEQIEMD CMRGDLHSLL HQLPDLQCRV LRMRYGMDGD EPMSLTGIGR
VLGISRDRVR NLERDGLRGL RRLSENVEAY FVS