Gene P9303_18561 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_18561 
SymbolprlC 
ID4775991 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp1616422 
End bp1618569 
Gene Length2148 bp 
Protein Length715 aa 
Translation table11 
GC content54% 
IMG OID640087365 
ProductM3 family peptidase 
Protein accessionYP_001017863 
Protein GI124023556 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0339] Zn-dependent oligopeptidases 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTGTTG CTGTGAACGA AATAAAGTCG CCAGCCCTCC TCCAGGGTGA AGGAATCCCA 
AACTTCTCTG CCATTACTGC TCAACAAGTT CAGGATCACA TACCTGAACT TCTCTGTGCT
CTTAACAAAC AATTCAGCCA GCTGGAACAA GACCTAGACA AAGTTTTGGC ATCTGGCAAG
AGCATTAATT GGCAACAAGT CATGTCTCCC CTGCATCAGC TTCAAGAGCA ATTGCGTTGG
AGCTGGGGAG TTGTATCCCA TCTCAACGGC GTCTGCAATA CCTCCGAATT ACGACAAGCT
CACGCTGCAC AGGAACCAGA GGTGGTGCGC TTTGGAAACC TTATGGGCCA AAGCCAAACA
CTTCATCGTG CCCTGCGCCG GCTGAAAGAC CAGACCTCAA GGCCATTACT GGATTCGACT
CAACAGCGGA TTTTGAACGC GGAATTACTC TCCATGGATC AACGAGGCGT GGGTCTCGAT
GACCACGCAC AGCAGGCTTT TAATACCACC AGTGAGCAAC TGGCCGAGCT GTCGACCCGC
TTCAGCAATC ACGTCCTGGA TGCAACCCAG GGATGGAGCC TGCTCCTCAA TCGATCATCA
CAAGTTGACG GACTGCCGCA GAGGGCCTTG GAGGTTTTGT CCTCAGCAGC CAAGCAAGCC
GGAGATCACC GAGAAGACGG TGGAGAACCG ACAGCAGAGC AGGGTCCCTG GCGATTAGGT
CTAGATATGC CCCGCTACAT CCCCTTCATC ACCCATGCAA AAGATCGAGG GCTACGTGAA
ACGCTCTACA AGGCTCATGT GAGTAGAGCG AGTGCAGGTG AGTTAAACAA TCAACCATTG
ATTGAAGAGC TGCTAAGCCT TCGGCTTGAG CAAGCTCAAA GGCTTGGCTA CATGAACTGG
GCTGAACTCA GCCTGGCCAG CAAAATGGCC GAGGGCGTTG AGGCCGTCGA GCAGTTGCTT
GAGGAGTTGC GTGCTGCCGC TTTACCCGCC GCCCAAACGG AGTTAATTGA GCTCGAGGCC
TGTGCCAAAA AACATGGAGC TCCAGAGGCC AGCCAGCTCA AACCATGGGA TGTGAACTTC
TGGGCTGAAC GGCTGAGGCA AGAGCGCTTT GACCTTGATC AAGAAGCGCT GCGCCCTTGG
TTCCCCCTGC CACAGGTTTT GGAAGGTTTG TTCGGACTTT GTGAACGTCT TTTTGGCATT
CGCATTCAAA GTGCCGACGG CGAGGCTCCG ATCTGGCATC AAGACGTGCG TTATTTCCGG
GTGTTGGATG CCAATGGTTC AGACCTGGCA GCGTTCTACC TCGATCCCTA TAGCCGACCA
GCCAGCAAGC GAGGGGGGGC ATGGATGGAC GAATGCCTGA TACGCAGCAA AAGCCTCGAG
GGCCAATCGA TTCTTCCAGT GGCCTATTTG ATTTGTAATC AGACCCCACC GCAAGCAGAT
ACACCAAGTC TGATGAGCTT CGATGAGGTG GAGACTTTGT TCCATGAGTT CGGCCATGGT
CTTCAGCACA TGCTCACGAC CGTTGAGTAT CCACAGGCTG CAGGAATCAA CAACGTGGAA
TGGGACGCAG TGGAACTGCC TAGCCAGTTC ATGGAGAACT GGTGCCTCGA TCGCACCACG
TTGATGGGGA TGGCACGTCA CTGGCGGACT GGCGAACCGC TTCCGGAGGA GGAGTTCGCA
AAATTGCGCT CCAGCCGCAC CTTTAATGCC GGTTTGGCAA CTCTGCGCCA GGTGCATTTC
GCTCTCACTG ATCTGCGCTT GCATAGTTGC TGGACACCAG ATCTCGGCGT GACCCCAGAT
CAGCTTCGCC GTCAGATTGC TGAGACCACC ACGGTGATGC TTCCGATCGC CGAAGATAAA
TTCCTCTGTG GCTTTGGCCA CATCTTCGCC GGTGGTTATT CTGCCGGGTA CTACTCCTAC
AAGTGGGCAG AAGTTCTTAG TGCTGATGCC TTTGCTGCCT TTGAGGAAGC TGGTCTGGAA
CTTGAAGATC AAGTTCGACT CACTGGCGCT CGCTTTCGCG ACACAGTTCT CAGTTTGGGA
GGTAGCCATT CCCCAGCAGA CGTCTATGAG CAATTCCGCG GGCGACCAGC GACCACCGAG
GCATTGATTC GCCATTCCGG CTTAGCCGCA AGCGCTGCGG ATCAGTGA
 
Protein sequence
MPVAVNEIKS PALLQGEGIP NFSAITAQQV QDHIPELLCA LNKQFSQLEQ DLDKVLASGK 
SINWQQVMSP LHQLQEQLRW SWGVVSHLNG VCNTSELRQA HAAQEPEVVR FGNLMGQSQT
LHRALRRLKD QTSRPLLDST QQRILNAELL SMDQRGVGLD DHAQQAFNTT SEQLAELSTR
FSNHVLDATQ GWSLLLNRSS QVDGLPQRAL EVLSSAAKQA GDHREDGGEP TAEQGPWRLG
LDMPRYIPFI THAKDRGLRE TLYKAHVSRA SAGELNNQPL IEELLSLRLE QAQRLGYMNW
AELSLASKMA EGVEAVEQLL EELRAAALPA AQTELIELEA CAKKHGAPEA SQLKPWDVNF
WAERLRQERF DLDQEALRPW FPLPQVLEGL FGLCERLFGI RIQSADGEAP IWHQDVRYFR
VLDANGSDLA AFYLDPYSRP ASKRGGAWMD ECLIRSKSLE GQSILPVAYL ICNQTPPQAD
TPSLMSFDEV ETLFHEFGHG LQHMLTTVEY PQAAGINNVE WDAVELPSQF MENWCLDRTT
LMGMARHWRT GEPLPEEEFA KLRSSRTFNA GLATLRQVHF ALTDLRLHSC WTPDLGVTPD
QLRRQIAETT TVMLPIAEDK FLCGFGHIFA GGYSAGYYSY KWAEVLSADA FAAFEEAGLE
LEDQVRLTGA RFRDTVLSLG GSHSPADVYE QFRGRPATTE ALIRHSGLAA SAADQ