Gene A9601_03481 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_03481 
Symbol 
ID4717037 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp318414 
End bp319709 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content36% 
IMG OID640078052 
Productcarboxyl-terminal protease 
Protein accessionYP_001008743 
Protein GI123967885 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0793] Periplasmic protease 
TIGRFAM ID[TIGR00225] C-terminal peptidase (prc) 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATTCAT CTTTTAACAA ACTTTTAACA TTCAAAAATT TGATCACTGC ATCAATGATC 
ATCGTTTTTT CTATCAATCT TTTGTTGATG GAAAGAGTGG ATGCTCTCAG TGATAGCAGG
CAATTAGTAC TTGATGCTTG GACCTTGGTA AACGAAGGTT TTTATGATCC AGAAAAGTTT
GATGAAATCC AATGGAAAAG AATTAGACAA AAAACATTAC AGAAACAAAT TGAAACAAGT
GAAGAGGCTT ATTCCGCAAT TGAAGACATG TTAAGACCTC TAGACGATCC CTACACGAGA
GTTTTACGCC CCAAAGATTA TGAGCTACTG AAATCAAGTA ATTTTGGGAG TGAAATTAAT
GGTGTTGGGC TTCAATTAGG TGAAGATGAC AACAATAAAG TTAAAGTTAT TTCTACTCTT
GGGGGGTCGC CAGCTGAAGA AGCTGGAATA GTAAGCGGGG ACATTATCGA GACAGTTGAT
GGAATCTCAT CAGAAAAATT AGGGCTTGCA AGTACTGCCT CTAAGTTAAG AGGTGAGTCA
GGGACAAAAG TTTTAGTTGA ATTATCTACG GAATCAGGAG AAATTAGGGA AGTCGATTTA
GAGAGGAGAT CAGTAGATCT CAGACCAGTT AGAACAAAAA GATTAAGAGA CGATTCTCAC
ACAATAGGAT ATTTAAGAAT AACTCAATTT AGCGAAAGCG TACCCAAAAA AGTTGAAGAG
GCTCTTCAAG AGTTAAAAGA GAAAGAGGTT GAGGGCTTAA TCTTGGATCT TAGAAATAAT
TCAGGGGGAC TAGTAAGCTC AGGTATTGCA GTTGCAGACA CATTATTGAG TGAGAAACCC
GTAGTCGAGA CAAAAGATAG AAATGGAATC AAAGATGCAA TTATTTCTCA AAAAGAGACA
TCTTTTGATG GACCAATGGT GACTTTAGTA AATAAAGGCA CTGCAAGTGC CAGTGAAATA
CTTGCTGGTT CTTTAAAAGA TAATGAGAGG TCAATTCTTA TGGGAGAACA AACTTATGGT
AAAGGTTTAA TTCAATCCCT AAAAAGTTTG GGAGAAGATA GTGGTATTGC TATAACAGTG
GCTAGTTACT TAACACCAGA TGGTAATAAT ATACAAGGCC AGGGTATAAC ACCTGACAAA
TTACTTGAAC TACCGGAAGC CAGTGATTTT GGAAGTACTG ACGATAAATG GGTAAGGAAT
GCGGAATTAT TATTAGGGTC GCTTCTAGAA AAAGAAGAAG TTCCAGTTCA AACAATTGAT
TTAAACAATG AAGAAATCAA ATCTTTAAAT GGCTAA
 
Protein sequence
MNSSFNKLLT FKNLITASMI IVFSINLLLM ERVDALSDSR QLVLDAWTLV NEGFYDPEKF 
DEIQWKRIRQ KTLQKQIETS EEAYSAIEDM LRPLDDPYTR VLRPKDYELL KSSNFGSEIN
GVGLQLGEDD NNKVKVISTL GGSPAEEAGI VSGDIIETVD GISSEKLGLA STASKLRGES
GTKVLVELST ESGEIREVDL ERRSVDLRPV RTKRLRDDSH TIGYLRITQF SESVPKKVEE
ALQELKEKEV EGLILDLRNN SGGLVSSGIA VADTLLSEKP VVETKDRNGI KDAIISQKET
SFDGPMVTLV NKGTASASEI LAGSLKDNER SILMGEQTYG KGLIQSLKSL GEDSGIAITV
ASYLTPDGNN IQGQGITPDK LLELPEASDF GSTDDKWVRN AELLLGSLLE KEEVPVQTID
LNNEEIKSLN G