Gene A9601_07321 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_07321 
Symbol 
ID4717437 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp651033 
End bp652367 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content30% 
IMG OID640078446 
Productcarboxyl-terminal processing protease 
Protein accessionYP_001009125 
Protein GI123968267 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0793] Periplasmic protease 
TIGRFAM ID[TIGR00225] C-terminal peptidase (prc) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.587372 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAAGATAA GAGAATTGCT TAAAAAAAAA TATATATTTC TGTTTGCGAC ATCCTTTTCA 
GGGTTATTTT TAAATAATTT TGCAGAGGCA ACAGTTTTAA ATAATAGTTA TAAAGAAGTA
ATTGATCATG TTTGGCAAAT TGTATATAGA GATTTTCTTG ATTCAAGTGG CAAATTTCAA
AAGTCCAATT GGATTAATCT AAGAAAAGAA GTTTTATCAA AAACATATTC AGACAGCAAT
GAAGCATATG ATGCGATTAG AGATATGCTT TCTAATTTAG ATGATTCTTA TACAAGATTT
TTAGAACCTA AGGAATTTAA TCAAATGAGA ATTGATACCT CTGGCGAATT AACTGGAGTT
GGTATCCAAA TAGTTAAGGA AAAAGAATCT GATGATTTAA TAATTATTTC TCCCATAGAG
GGCACCCCTG CATTTGATGC TGGAATTAAA GCTAGAGATA AAATATTATC CATAGATGAT
ATTTCTACTA AAGGTATGAA TATTGAGGAT GCCGTGAAAT TAATAAGAGG ACAAAGAGGT
ACTAAAGTAA AGCTTGAAAT TCTTAGAGGT TCTCAATCCT TTTTTAAGAC TTTATCAAGA
GAAAAAATTG AAATAAAAAC TGTATCAAGT AAAATCAATC AAACCAAAAA TGGCTTATCA
ATTGGCTATG TAAGAATTAA ACAATTTAAT GCAAATGCAT CCAAAGAAAC TAGAGATGCT
ATTAAGGATT TAGAAACAAA AAAAGTCGCA GGATATGTTC TTGACTTGAG AAGTAATCCG
GGAGGTTTAT TAGAATCAAG CATTGATATC TCAAGGCACT TCATTAACAA AGGAGTAATA
GTAAGTACAG TAAGTAAAGA TGGTTTAAAA GAAACGAAAA AAGGAAACGG TAAAGCTCTA
ACAAAAAAGC CCTTAGTTGT TTTGGTTAAT GAGGGTTCTG CTAGTGCTAG TGAAATAGTT
TCTGGTGCAA TAAAAGATAA CAAAAGAGGA AAATTAGTTG GGAAGAAAAC ATTTGGTAAA
GGTCTAGTTC AATCCATGAG AACTTTAGTT GATGGTTCAG GTCTAACTGT TACAGTCGCT
AAGTATTTAA CTCCGAACGG CACTGATATA AACAAATCTG GAATTATTCC CGACATAGAA
GTAAGAATGA ATATAAACCC TATACTTCAA AGAGAGATAG GAACTAGAAA AGATAAACAA
TATAGAGCTG GTGAAAAAGA GCTAATAAAT ATAATTAATA GAAAGAATCA GATAAGCGAA
TTTAAGCCCG ACACCACAAA CCTTAATGCA TTCCTAAAAA TTAATAAGGA AGATAAAGTA
TTTTCATTAA ATTAA
 
Protein sequence
MKIRELLKKK YIFLFATSFS GLFLNNFAEA TVLNNSYKEV IDHVWQIVYR DFLDSSGKFQ 
KSNWINLRKE VLSKTYSDSN EAYDAIRDML SNLDDSYTRF LEPKEFNQMR IDTSGELTGV
GIQIVKEKES DDLIIISPIE GTPAFDAGIK ARDKILSIDD ISTKGMNIED AVKLIRGQRG
TKVKLEILRG SQSFFKTLSR EKIEIKTVSS KINQTKNGLS IGYVRIKQFN ANASKETRDA
IKDLETKKVA GYVLDLRSNP GGLLESSIDI SRHFINKGVI VSTVSKDGLK ETKKGNGKAL
TKKPLVVLVN EGSASASEIV SGAIKDNKRG KLVGKKTFGK GLVQSMRTLV DGSGLTVTVA
KYLTPNGTDI NKSGIIPDIE VRMNINPILQ REIGTRKDKQ YRAGEKELIN IINRKNQISE
FKPDTTNLNA FLKINKEDKV FSLN