Gene P9301_03501 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9301_03501 
Symbol 
ID4912475 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9301 
KingdomBacteria 
Replicon accessionNC_009091 
Strand
Start bp318018 
End bp319304 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content36% 
IMG OID640159921 
Productcarboxyl-terminal protease 
Protein accessionYP_001090574 
Protein GI126695688 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0793] Periplasmic protease 
TIGRFAM ID[TIGR00225] C-terminal peptidase (prc) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATGCAT CTTTTAACAA ACTTTTAACA TTCAAAACTT TCATCACTGC ATTAATGATC 
ATCGTTTTTT CTATCAATCT CTTATTGGTT GAAAGAGTGA ATGCTCTCAG TGACAGCAGG
CAATTAGTAC TTGATGCTTG GACCTTAGTA AACGAAGGGT TTTATGATCC AGAAAAGTTT
GATGAAATCC AATGGAAAAG AATTAGACAA AAAACATTAC AGAAACAAAT TGAAACAAGT
GAAGAGGCTT ATTCCGCAAT TGAAGACATG TTAAGACCTC TAGACGATCC CTACACCAGA
GTTTTACGCC CAAAAGATTA TGAGCTACTG AAATCAAGTA ATTTTGGTAG TGAAATTAAT
GGGGTTGGGC TTCAATTAGG TGAAGATGAT GACAATAGAG TTAAAGTAAT CTCTACTCTT
GGGGGTTCCC CAGCTGAAGA AGCTGGGATA ATAAGCGGGG ACTTGATAGA GACAGTTGAC
GGAATCTCAT CAGAAAAATT AGGGCTTGCG GGCACTGCCT CTAGGTTAAG AGGTGAATCA
GGGACAAAAG TTTTAGTTGA ATTATCTTCT GAATCAGGAG AAATTAGGGA AGTTGATCTA
GAGAGGAGGT CAGTAGATCT AAGACCAGTT AGAACAAAAA GATTAAGAGA CGATTCTCAC
ACAATAGGAT ATTTAAGGAT AACTCAATTC AGCGAAAGCG TACCCAAAAA AGTTGAAGAG
GCACTTCAAG AGTTGAAAGA GAAAGATGTT GAGGGCTTAA TCTTGGATCT TAGAAATAAT
TCAGGGGGAC TAGTAAGCTC TGGTATAGCA GTTGCAGACT CATTATTAAG TGAGAAACCT
GTAGTCGAGA CAAAAGATAG AAATGGAATT AAAGATGCAA TTATTTCTCA AAAAGAGACA
TATTTTGATG GACCAATGGT GACTTTAGTA AATAAAGGTA CTGCAAGTGC CAGTGAAATA
CTTGCTGGTT CTTTACAAGA TAATGAGAGA TCTATTCTTA TGGGAGAGCA AACTTATGGC
AAAGGTTTAA TTCAATCCCT AAAAAGTTTG GGAGAAGATA GTGGTATTGC AATAACAGTA
GCCAGTTACT TAACCCCCAA AGGTAATAAT ATTCAAGGCC AGGGTATGAC TCCTGACAAA
TTACTAGATC TCCCTGATGC AAATGATTAT GGAAGTACTG ATGATAAATG GGTGAAAAAT
GCAGAATTAT TTTTGGGGTC GCTTCTAGAA AAAGAAGAAG TTTCAGTTCA AACTATTGAA
TTAAATAATG AAGAAATTAA ATCTTGA
 
Protein sequence
MNASFNKLLT FKTFITALMI IVFSINLLLV ERVNALSDSR QLVLDAWTLV NEGFYDPEKF 
DEIQWKRIRQ KTLQKQIETS EEAYSAIEDM LRPLDDPYTR VLRPKDYELL KSSNFGSEIN
GVGLQLGEDD DNRVKVISTL GGSPAEEAGI ISGDLIETVD GISSEKLGLA GTASRLRGES
GTKVLVELSS ESGEIREVDL ERRSVDLRPV RTKRLRDDSH TIGYLRITQF SESVPKKVEE
ALQELKEKDV EGLILDLRNN SGGLVSSGIA VADSLLSEKP VVETKDRNGI KDAIISQKET
YFDGPMVTLV NKGTASASEI LAGSLQDNER SILMGEQTYG KGLIQSLKSL GEDSGIAITV
ASYLTPKGNN IQGQGMTPDK LLDLPDANDY GSTDDKWVKN AELFLGSLLE KEEVSVQTIE
LNNEEIKS