Gene P9303_02461 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_02461 
Symbol 
ID4778226 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp259014 
End bp260201 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content57% 
IMG OID640085750 
Productserine protease 
Protein accessionYP_001016266 
Protein GI124021959 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGCGT TGCTTGAGGA ATCCTCCGCA GAGTTGCGGT TATCCAGCCA ATCTTCAGGT 
TTGAGCAGGG TTTTCCGTGG TGGTCTTCTG CTTGGCACTG GATTGGTTTG TCTGCCTGTG
TTGTCGGGAC TTTCCCCTCA ACCGCTGCAG GCGGCATCAG CCGCGACAGC CTTATCCCGG
CAGTCGTTTG TAGCGGATGC GGTGGCTCGC AGCGGCCCTG CTGTGGTGAC TCTTGAGACG
AGTCGGACGG TCCGCTCTAT GGGCATGGCT GGTCTGCCTC AGGGACTGTT AGCAGACCCG
CTGTTCCAAC ATTTCTTTGG TTTACCTGGC AGGGTCGCTC CCCGCGCGCG GATAGAACGA
GGGCAGGGAA GTGGTGTGAT TTTTTCGGCG GAAGGATTGG TTCTCACCAA TGCTCATGTC
GTGGAGAAAA CCGATCAGCT GATGGTGGGA TTGCCAGATG GTCGAAGGGT GTCTGGGCGA
CTGGTTGGTC AGGACACGAT TACGGATCTG GCTGTAGTGC AGTTGGATGG CTCTGGGCCT
TGGCCCACAG CTCCATTGGG AGACTCGGAC CAGCTTCGGG TCGGGGATTG GGCCATAGCC
GTTGGCAACC CCTTTGGCCT TGAAAATACG GTGACGTTGG GGATCGTCAG CAACCTCAAC
CGCAACGTCT CTCAGCTAGG GATCTCTGGC AAGCGGTTGG ATTTGATTCA GACCGATGCT
GCTATCAATC CAGGCAACTC AGGAGGACCG TTGTTGAATT CTGAGGGCAA TGTGGTGGGT
ATCAATACAC TCGTTCGCTC TGGGCCAGGC GCTGGGCTGG GTTTTGCTAT TCCTATTAAT
CGGGCCAGGA CCATCGCCCA GCAGCTGGTG GAGCGAGGAC GGGCCAGTCA TCCGATGGTG
GGAGTAGGTC TCTCGCCGGT GCCATCTGCT CGTTCTGGGG AAGCCAATTC TCCTGGTGCT
GTGATTCGTT CCGTGGTGCC GGGTGGGCCT GCAGCAAGTG CCGGTTTGAA GGTTGATGAT
GTGATCGTTT CGGTTGAAGG GTTACCGATT GATGGGCCTG CTGAGGTAGT GAGCGCGATT
GATCGTCATG GAGTTGGGAG TCCAATCACT CTTGGATTAA TCCGGGGCGA CAGTCGGATT
GAGCTGGCGG TTACACCAGT GGAGCTGACG GCGATGCAGG CACCTTGA
 
Protein sequence
MSALLEESSA ELRLSSQSSG LSRVFRGGLL LGTGLVCLPV LSGLSPQPLQ AASAATALSR 
QSFVADAVAR SGPAVVTLET SRTVRSMGMA GLPQGLLADP LFQHFFGLPG RVAPRARIER
GQGSGVIFSA EGLVLTNAHV VEKTDQLMVG LPDGRRVSGR LVGQDTITDL AVVQLDGSGP
WPTAPLGDSD QLRVGDWAIA VGNPFGLENT VTLGIVSNLN RNVSQLGISG KRLDLIQTDA
AINPGNSGGP LLNSEGNVVG INTLVRSGPG AGLGFAIPIN RARTIAQQLV ERGRASHPMV
GVGLSPVPSA RSGEANSPGA VIRSVVPGGP AASAGLKVDD VIVSVEGLPI DGPAEVVSAI
DRHGVGSPIT LGLIRGDSRI ELAVTPVELT AMQAP