Gene P9301_07301 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9301_07301 
Symbol 
ID4912537 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9301 
KingdomBacteria 
Replicon accessionNC_009091 
Strand
Start bp649869 
End bp651203 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content30% 
IMG OID640160312 
Productcarboxyl-terminal processing protease 
Protein accessionYP_001090954 
Protein GI126696068 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0793] Periplasmic protease 
TIGRFAM ID[TIGR00225] C-terminal peptidase (prc) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAAGATAA GAAAATTGCT TAAAAAAAAA TTTATATTTC TGTTTGCGAC ATCCTTTTCT 
GGACTATTCT TAAATAGTTT TGCAGAGGCA ACAGTTTTAA ATAATAGTTA TAAAGAAGTA
ATTGATCATG TTTGGCAAAT TGTATATAGA GATTTTCTTG ATTCAAGCGG CAAATTTCAA
AAGTCCAATT GGATTAATCT GAGAAAAGAA GTTTTATCAA AAACATATTC AGATAGCAAT
GAAGCATATG ATGCGATTAG AGATATGCTT TCTAACTTAG ATGATTCTTA TACAAGATTT
TTAGAACCTA AGGAATTTAA TCAAATGAGA ATCGATACCT CTGGTGAATT AACTGGAGTT
GGTATCCAAA TAGTTAAAGA TAAAGAATCT GATGATTTAA TAATTATTTC TCCCATAGAG
GGCACCCCTG CCTTTGATGC TGGAATTAAA GCTAGAGATA AAATATTATC CATAGATGAT
ATTTCTACTG AAGGTATGAA TATTGAGGAG GCCGTGAAAT TAATAAGGGG GCAAAGAGGT
ACTAAAGTAA AGCTTGAAAT TCTTAGAGGT TCTCAATCCT TTTTTAAGAC TTTATCGAGA
GAAAAAATTG AAATAAAATC TGTATCAAGT AAAGTCAATC AAACCAAAAA TGGCTTATTA
ATTGGCTATG TAAGAATTAA ACAATTTAAT GCAAATGCAT CAAAAGAAAC TAGAGATGCT
ATTAAGGATT TAGAAACAAA AAAAGTCGCA GGATATGTTC TTGACTTGAG AAGTAATCCA
GGAGGTTTAT TAGAATCAAG CATTGATATC TCAAGGCACT TTATTAACAA AGGAGTAATA
GTTAGTACAG TAAGTAAAGA TGGTTTAAAA GAAACAAAAA AAGGAAACGG ACAAGCTCTA
ACTAAAAAAC CCCTAGTTGT ACTGGTTAAT GAGGGTTCTG CTAGTGCTAG TGAAATAGTT
TCTGGTGCAA TAAAAGATAA TAAAAGAGGA AAATTAGTTG GAAAAAAAAC GTTTGGTAAA
GGTCTAGTTC AATCTATGAG GACATTAGTT GATGGTTCAG GATTAACTGT TACAGTCGCC
AAGTATTTAA CTCCGAACGG TACTGATATA AACAAATCTG GAATTATTCC AGATATAGAT
GTAAAAATGA ATATCAACCC TATTCTCCAA AGAGAGATTG GAACTAGAAA AGATAAACAA
TATAGAGCTG GTGAAAAAGA GCTAATAAAT ATAATTAATA GAAAGAATCA GATAAGCGAA
TTTAAGCCCG ACACTGCAAA CCTTAATGCT TTCCTAAAAA TTAATAAGGA AAATAAAATA
TTTTTATTAA ATTAA
 
Protein sequence
MKIRKLLKKK FIFLFATSFS GLFLNSFAEA TVLNNSYKEV IDHVWQIVYR DFLDSSGKFQ 
KSNWINLRKE VLSKTYSDSN EAYDAIRDML SNLDDSYTRF LEPKEFNQMR IDTSGELTGV
GIQIVKDKES DDLIIISPIE GTPAFDAGIK ARDKILSIDD ISTEGMNIEE AVKLIRGQRG
TKVKLEILRG SQSFFKTLSR EKIEIKSVSS KVNQTKNGLL IGYVRIKQFN ANASKETRDA
IKDLETKKVA GYVLDLRSNP GGLLESSIDI SRHFINKGVI VSTVSKDGLK ETKKGNGQAL
TKKPLVVLVN EGSASASEIV SGAIKDNKRG KLVGKKTFGK GLVQSMRTLV DGSGLTVTVA
KYLTPNGTDI NKSGIIPDID VKMNINPILQ REIGTRKDKQ YRAGEKELIN IINRKNQISE
FKPDTANLNA FLKINKENKI FLLN