Gene P9303_08291 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_08291 
Symbol 
ID4776690 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp754100 
End bp755254 
Gene Length1155 bp 
Protein Length384 aa 
Translation table11 
GC content56% 
IMG OID640086338 
Producttrypsin-like serine protease 
Protein accessionYP_001016845 
Protein GI124022538 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCTTG CCCTATCTGC CCACAGGCTG CATCCGATCA GATGGCTTGG CTTGGTCCTC 
ATCAGCATCA ACCTAAGTGG CTGCAACGAA GGGTTACGTC AACGCATCGG CATTGGTTCT
AAGACAAGCC CAGACAACAC CCCTGTCGTC AGTGATCCAC CCAATTCAGC GCCTCTGCAA
CCTGGCACCA ATGTGATCGT GACTGCTGTG GAACAAGTAG GTCCGGCAGT GGTGCGCATC
GACACGGTGA AACGGATCGC AAACCCCCTT GGCAACCTCT TCGGCGGCGG ACCTCCCATC
CAACGGCAAG CAGGCCAGGG GTCAGGTTTC ATCACACGCT CTGACGGGCT GATCTTCACC
AATGCTCATG TGGTTGATGG GGCAGAACAG GTATCGGTAA CCCTTCCAGA TGGCCGCAGT
TACAGCGGCA AAGTGCTTGG TGGTGATCCC CTTACAGATG TCGCCGTGGT CAAAGTCGTG
GCGAAGAAGC TTCCCGTGGC CCCCCTCGGC AATTCCAACA ACATCAAGCC TGGGCAATGG
GCAATCGCTA TCGGCAATCC TCTTGGACTC AACAACACCG TGACTGCAGG CATCATCAGC
TCCGTCGACC GCACCAACGC CTTAGGGGGG GGGCAACGAG TTCCTTACAT CCAAACTGAC
GCCGCCGTAA ACCCTGGCAA TAGCGGAGGA CCACTCATCA ATGCCTCAGG ACAGGTGATC
GGAATCAATA CTGCCATCAA AGTTGCACCG GGAGGCGGGC TGAGTTTTGC AGTACCGATC
AACCTGGCCA AACGCATTGC CCAACAAATC GTGGGGAGAG GGCAAGCTTC TCATCCCTAT
ATCGGGGTAA GGCTTCAGAG CCTTACCCCC CAGCTAGCCA AAGAAATCAA CGCAACAGGA
GGGCAATGCC AGGTGCCTGA AGTCAATGCT GTTCTCGTTG TCGAAGTGAT GTCTCGCAGC
CCTGCAGACA AAGCCGGCGT GCGCCAATGC GACTTAATTA GTGAGGTCAA TGGTGAGGTC
GTCCGCGACC CTTCGCAAGT ACAACTTGCC GTTGATCGTG GGGAGGTTGG CAAGCCCATG
CCGCTCACCC TTGAACGAAA CGACAAGACG ATCGAATTAA TTGTGAAACC AGCAGAGCTA
CCCCGGCAGG GGTGA
 
Protein sequence
MTLALSAHRL HPIRWLGLVL ISINLSGCNE GLRQRIGIGS KTSPDNTPVV SDPPNSAPLQ 
PGTNVIVTAV EQVGPAVVRI DTVKRIANPL GNLFGGGPPI QRQAGQGSGF ITRSDGLIFT
NAHVVDGAEQ VSVTLPDGRS YSGKVLGGDP LTDVAVVKVV AKKLPVAPLG NSNNIKPGQW
AIAIGNPLGL NNTVTAGIIS SVDRTNALGG GQRVPYIQTD AAVNPGNSGG PLINASGQVI
GINTAIKVAP GGGLSFAVPI NLAKRIAQQI VGRGQASHPY IGVRLQSLTP QLAKEINATG
GQCQVPEVNA VLVVEVMSRS PADKAGVRQC DLISEVNGEV VRDPSQVQLA VDRGEVGKPM
PLTLERNDKT IELIVKPAEL PRQG