Gene P9211_10171 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_10171 
Symbol 
ID5730946 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp909588 
End bp910841 
Gene Length1254 bp 
Protein Length417 aa 
Translation table11 
GC content39% 
IMG OID641285384 
ProductZn-dependent peptidase 
Protein accessionYP_001550902 
Protein GI159903558 
COG category[R] General function prediction only 
COG ID[COG0612] Predicted Zn-dependent peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.109713 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCAAGACC TGAAGATAAA TCGATTGGCA CTTAGAAGTG GTGCGGAATG TATTTCAACA 
TCAATGCCAG AATCTGCACT TACCTGCATT GACCTTTGGT GCAAGGCCGG AAGTTCCTTT
GAAGACAGTG ACGAGAAAGG AATGGCTCAT TTTCTTGAAC ATATGATTTT CAAAGGAAGC
AGCAAGTTAC GAGAAGGTGA ATTTGATTTA AAGATTGAAG CCCTTGGCGG CAGCAGTAAT
GCTGCGACAG GTTTTGATGA CGTACATTTT TATGTATTAG TACCTTCAGA AGGTGTCGAG
CAAGCAATTA AGTTATTAAT CGAACTTGTC CTATGTCCAA GCATTATGAA AAATGCATAT
TCATTGGAGC GTGAAGTTGT ACTCGAAGAA ATAGCTCAAC AAAGTGATCA ACCTGACGAG
AAAGTGTTTC AGATGGTTCT GGAAGGTTGT TGGAGCAACC ATCCATATGG GAAATCAATT
CTAGGCAATG CATCAAGCCT TAACGCATCG ACTCCGAATC GAATGAAGTT GTTCCATCAA
AGGCTATATA AACCAGAGAA TTGTGTCCTA TCAATCGCTG GGAAGTCCCC GAGAAATTTA
TTAAAAATAT TGAGCGAAGG TGAACTTGGC AAACAAGTTG ATAAATCAAA TCCCAACAAT
TCAAAACCAA ACTCAAAAAA ACTCAACTTT AATATTGGTC GAAAGATAGT AGAGGTTAAA
AGACTTGAGT CAGCGAGATT AGTTATGGCA TGGCCAGTAC CTCCTGCTTC TGAACAGTTC
ATAATAATGG GGTATGACAT AGCGACTACT CTTCTAGGAG AAGGTAGACG TAGTAGACTT
GTTAATAATT TGCGAGAGGA ACAACAAATA GTCGAATCAA TTGAAATGGA CCTAACTGCA
CTTGAACAGG GAGGCCTTGT ATTGCTAGAA GCGTGTTGCA TAGAAAAAAA TCTCAACAAA
GTAGAAGATT CGATTAATCA AATTCTCATA GAAAGTATTA ATAGTCCTCC TAGCGAACGA
GAAACAAAAC GTGCCAAAGA ATTAGTTAGA AATGGATTTT GCTTCAGCCT TGAACATCCA
GCACAGGTTG CAGCAATTAC AGGTACACAA ACACTTTGGA ACCGTCATCA GCCACTGCTT
GAACCTTTAA AGGATATAGA TGGCTGGTCA AGTTCTATGA TCCAAGAAGA AATATTTAGT
TGTTTGCAGC CAAGCCAATG CTTCACTTTA ATTGCAAAAC CTCTTAGTAG TTGA
 
Protein sequence
MQDLKINRLA LRSGAECIST SMPESALTCI DLWCKAGSSF EDSDEKGMAH FLEHMIFKGS 
SKLREGEFDL KIEALGGSSN AATGFDDVHF YVLVPSEGVE QAIKLLIELV LCPSIMKNAY
SLEREVVLEE IAQQSDQPDE KVFQMVLEGC WSNHPYGKSI LGNASSLNAS TPNRMKLFHQ
RLYKPENCVL SIAGKSPRNL LKILSEGELG KQVDKSNPNN SKPNSKKLNF NIGRKIVEVK
RLESARLVMA WPVPPASEQF IIMGYDIATT LLGEGRRSRL VNNLREEQQI VESIEMDLTA
LEQGGLVLLE ACCIEKNLNK VEDSINQILI ESINSPPSER ETKRAKELVR NGFCFSLEHP
AQVAAITGTQ TLWNRHQPLL EPLKDIDGWS SSMIQEEIFS CLQPSQCFTL IAKPLSS