Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_18561 |
Symbol | prlC |
ID | 4775991 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | - |
Start bp | 1616422 |
End bp | 1618569 |
Gene Length | 2148 bp |
Protein Length | 715 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 640087365 |
Product | M3 family peptidase |
Protein accession | YP_001017863 |
Protein GI | 124023556 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0339] Zn-dependent oligopeptidases |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTGTTG CTGTGAACGA AATAAAGTCG CCAGCCCTCC TCCAGGGTGA AGGAATCCCA AACTTCTCTG CCATTACTGC TCAACAAGTT CAGGATCACA TACCTGAACT TCTCTGTGCT CTTAACAAAC AATTCAGCCA GCTGGAACAA GACCTAGACA AAGTTTTGGC ATCTGGCAAG AGCATTAATT GGCAACAAGT CATGTCTCCC CTGCATCAGC TTCAAGAGCA ATTGCGTTGG AGCTGGGGAG TTGTATCCCA TCTCAACGGC GTCTGCAATA CCTCCGAATT ACGACAAGCT CACGCTGCAC AGGAACCAGA GGTGGTGCGC TTTGGAAACC TTATGGGCCA AAGCCAAACA CTTCATCGTG CCCTGCGCCG GCTGAAAGAC CAGACCTCAA GGCCATTACT GGATTCGACT CAACAGCGGA TTTTGAACGC GGAATTACTC TCCATGGATC AACGAGGCGT GGGTCTCGAT GACCACGCAC AGCAGGCTTT TAATACCACC AGTGAGCAAC TGGCCGAGCT GTCGACCCGC TTCAGCAATC ACGTCCTGGA TGCAACCCAG GGATGGAGCC TGCTCCTCAA TCGATCATCA CAAGTTGACG GACTGCCGCA GAGGGCCTTG GAGGTTTTGT CCTCAGCAGC CAAGCAAGCC GGAGATCACC GAGAAGACGG TGGAGAACCG ACAGCAGAGC AGGGTCCCTG GCGATTAGGT CTAGATATGC CCCGCTACAT CCCCTTCATC ACCCATGCAA AAGATCGAGG GCTACGTGAA ACGCTCTACA AGGCTCATGT GAGTAGAGCG AGTGCAGGTG AGTTAAACAA TCAACCATTG ATTGAAGAGC TGCTAAGCCT TCGGCTTGAG CAAGCTCAAA GGCTTGGCTA CATGAACTGG GCTGAACTCA GCCTGGCCAG CAAAATGGCC GAGGGCGTTG AGGCCGTCGA GCAGTTGCTT GAGGAGTTGC GTGCTGCCGC TTTACCCGCC GCCCAAACGG AGTTAATTGA GCTCGAGGCC TGTGCCAAAA AACATGGAGC TCCAGAGGCC AGCCAGCTCA AACCATGGGA TGTGAACTTC TGGGCTGAAC GGCTGAGGCA AGAGCGCTTT GACCTTGATC AAGAAGCGCT GCGCCCTTGG TTCCCCCTGC CACAGGTTTT GGAAGGTTTG TTCGGACTTT GTGAACGTCT TTTTGGCATT CGCATTCAAA GTGCCGACGG CGAGGCTCCG ATCTGGCATC AAGACGTGCG TTATTTCCGG GTGTTGGATG CCAATGGTTC AGACCTGGCA GCGTTCTACC TCGATCCCTA TAGCCGACCA GCCAGCAAGC GAGGGGGGGC ATGGATGGAC GAATGCCTGA TACGCAGCAA AAGCCTCGAG GGCCAATCGA TTCTTCCAGT GGCCTATTTG ATTTGTAATC AGACCCCACC GCAAGCAGAT ACACCAAGTC TGATGAGCTT CGATGAGGTG GAGACTTTGT TCCATGAGTT CGGCCATGGT CTTCAGCACA TGCTCACGAC CGTTGAGTAT CCACAGGCTG CAGGAATCAA CAACGTGGAA TGGGACGCAG TGGAACTGCC TAGCCAGTTC ATGGAGAACT GGTGCCTCGA TCGCACCACG TTGATGGGGA TGGCACGTCA CTGGCGGACT GGCGAACCGC TTCCGGAGGA GGAGTTCGCA AAATTGCGCT CCAGCCGCAC CTTTAATGCC GGTTTGGCAA CTCTGCGCCA GGTGCATTTC GCTCTCACTG ATCTGCGCTT GCATAGTTGC TGGACACCAG ATCTCGGCGT GACCCCAGAT CAGCTTCGCC GTCAGATTGC TGAGACCACC ACGGTGATGC TTCCGATCGC CGAAGATAAA TTCCTCTGTG GCTTTGGCCA CATCTTCGCC GGTGGTTATT CTGCCGGGTA CTACTCCTAC AAGTGGGCAG AAGTTCTTAG TGCTGATGCC TTTGCTGCCT TTGAGGAAGC TGGTCTGGAA CTTGAAGATC AAGTTCGACT CACTGGCGCT CGCTTTCGCG ACACAGTTCT CAGTTTGGGA GGTAGCCATT CCCCAGCAGA CGTCTATGAG CAATTCCGCG GGCGACCAGC GACCACCGAG GCATTGATTC GCCATTCCGG CTTAGCCGCA AGCGCTGCGG ATCAGTGA
|
Protein sequence | MPVAVNEIKS PALLQGEGIP NFSAITAQQV QDHIPELLCA LNKQFSQLEQ DLDKVLASGK SINWQQVMSP LHQLQEQLRW SWGVVSHLNG VCNTSELRQA HAAQEPEVVR FGNLMGQSQT LHRALRRLKD QTSRPLLDST QQRILNAELL SMDQRGVGLD DHAQQAFNTT SEQLAELSTR FSNHVLDATQ GWSLLLNRSS QVDGLPQRAL EVLSSAAKQA GDHREDGGEP TAEQGPWRLG LDMPRYIPFI THAKDRGLRE TLYKAHVSRA SAGELNNQPL IEELLSLRLE QAQRLGYMNW AELSLASKMA EGVEAVEQLL EELRAAALPA AQTELIELEA CAKKHGAPEA SQLKPWDVNF WAERLRQERF DLDQEALRPW FPLPQVLEGL FGLCERLFGI RIQSADGEAP IWHQDVRYFR VLDANGSDLA AFYLDPYSRP ASKRGGAWMD ECLIRSKSLE GQSILPVAYL ICNQTPPQAD TPSLMSFDEV ETLFHEFGHG LQHMLTTVEY PQAAGINNVE WDAVELPSQF MENWCLDRTT LMGMARHWRT GEPLPEEEFA KLRSSRTFNA GLATLRQVHF ALTDLRLHSC WTPDLGVTPD QLRRQIAETT TVMLPIAEDK FLCGFGHIFA GGYSAGYYSY KWAEVLSADA FAAFEEAGLE LEDQVRLTGA RFRDTVLSLG GSHSPADVYE QFRGRPATTE ALIRHSGLAA SAADQ
|
| |