Gene P9303_21861 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_21861 
Symbol 
ID4777800 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp1943725 
End bp1945065 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content56% 
IMG OID640087701 
Productcarboxyl-terminal protease 
Protein accessionYP_001018186 
Protein GI124023879 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0793] Periplasmic protease 
TIGRFAM ID[TIGR00225] C-terminal peptidase (prc) 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.43703 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCCAA TGTTGCCGAC TACTAACTCC CGATCACATC CGCTGCGGCG CCTATGGCTT 
GCGTGGCTGA GCCTGGGTCT TGGCGTGCTG CTGATGGCGA CTCCAGCCAT GGCATTGAAT
GATGCCCAAC AACTGGTCGT TGAAAGCTGG CGGCTCGTCA ACCAGGGATA TCTAGACCCA
GCCAAGTTCG ATCAGGTGCA CTGGCGAAGG CTAAGAGAGC AGGCATTAGA GAAAACAATC
AACAGTAGTA ACGATGCCTA TGAGGCCATC GAAGCGATGC TGCTTCCACT TGAAGATCCC
TACACAAGAC TATTGAGACC AGACGACTAC ACCGCCATAA AAGCCGCCAA TTTGGGCAGC
GAGATCAACG GCGTTGGTCT ACAGCTGGGG GCCCGCGCTG AAGATGGCCA GGTCGTGGTT
ATCGCTCCGC TTGAAGGATC TCCCGCGGCC GATGCGGGCG TCACAAGCGG CACGGCCCTA
CTCAGCGTGG ATGGCCAGTC TCCACAAGCC CTTGGACTCG AAGCCACAGC TGCAAGGCTG
CGAGGAGAAG TGGGCTCACA AGTTGTAGTA AAACTGCAAC CCCCAAATGG ATCTAGCGAA
GAACTCACCC TCGAGCGACG CAGTGTGGAC CTCAGGCCAG TACGGACCCG CCGACTGAGG
AGTGCTAAAC ACACCCTGGG CTACCTACGC ATCACCCAGT TCAGCGAAGG AGTGCCCGAA
CAGGTCAAAG AAGCACTTCA GGAACTGTCA GAAAAAGAGA TTGAAGGCCT AGTTCTAGAT
CTGCGCAATA ACTCCGGTGG GTTAGTGAGC TCCGGACTAG CTGTAGCGGA TGCCTTCCTG
AGTGGCTCCC CAATCGTGGA GACACGCAAC CGAGAGCGCA TTAACGAAGC AATCCCTTCT
GCAATTGAAA CCCTTTATGA CGGTCCGATG GTGACACTGG TCAACGGCGG GACTGCCAGC
GCGAGCGAAA TTCTGGCTGG TGCCCTCCAA GACAACAGCC GCTCACAGCT GCTTGGCAGC
CGCACGTTTG GCAAGGGTCT GATCCAAACA CTCACCAACC TGAGCGACGG CAGTGGCCTG
GCCGTGACGG TAGCCGGATA CATGACTCCA AGCGGCCGAG ACATTCAAAA CCAGGGCATC
GAGCCGGATC GGATTCTGGA TCCTCCTGAA CCCCTCAATC CTGGAGGGGA AGAAGACCGT
TGGTTGCATG ATGCTGAACT CTGGATGGAG GCCCAAATCG ACCGCGATCA GGATGCCCAG
TTAGAGACCA CAGAAGATCT TCAGCTCGAT AGTGCTGAAG ATGTTGAATT CAAAACTGAG
CAGAATCGTG ATGATCCATG A
 
Protein sequence
MKPMLPTTNS RSHPLRRLWL AWLSLGLGVL LMATPAMALN DAQQLVVESW RLVNQGYLDP 
AKFDQVHWRR LREQALEKTI NSSNDAYEAI EAMLLPLEDP YTRLLRPDDY TAIKAANLGS
EINGVGLQLG ARAEDGQVVV IAPLEGSPAA DAGVTSGTAL LSVDGQSPQA LGLEATAARL
RGEVGSQVVV KLQPPNGSSE ELTLERRSVD LRPVRTRRLR SAKHTLGYLR ITQFSEGVPE
QVKEALQELS EKEIEGLVLD LRNNSGGLVS SGLAVADAFL SGSPIVETRN RERINEAIPS
AIETLYDGPM VTLVNGGTAS ASEILAGALQ DNSRSQLLGS RTFGKGLIQT LTNLSDGSGL
AVTVAGYMTP SGRDIQNQGI EPDRILDPPE PLNPGGEEDR WLHDAELWME AQIDRDQDAQ
LETTEDLQLD SAEDVEFKTE QNRDDP