Gene P9211_17381 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_17381 
Symbol 
ID5730913 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp1565305 
End bp1566486 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content42% 
IMG OID641286123 
Productzinc metallopeptidase 
Protein accessionYP_001551623 
Protein GI159904279 
COG category[R] General function prediction only 
COG ID[COG1473] Metal-dependent amidase/aminoacylase/carboxypeptidase 
TIGRFAM ID[TIGR01891] amidohydrolase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.467436 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTTTTT CAGATTGTCA CAAGGAATCT TTGAGGCTGT TTTTGCCTGA ATTGATAACT 
ATCAGAAGGC ATTTACATGC TCATCCAGAA TTAAGTGGTC AAGAGCATCA GACTGCAGCA
ACTGTTGCGG GGGAATTAAA GAAATATGGT TGGGATGTCA CAGAAGGAGT TGGTCGAACA
GGTGTTATTG CAGAGTTAGG AGATAAATCT GCTCCTTGTG TTGGATTAAG AGTGGATATG
GATGCACTCC CCGTAGAAGA GAAAACTGGG TTGCCTTTTT CTTCTTTCAA TCAAGGAGTT
ATGCATGCCT GTGGGCATGA TCTACATACT TGTATAGGGC TAGGTGTCGC AAAACTTTTG
GCAGAAGATA AGAGTAAATT GTCAGGTGTA AGACTTTTAT TCCAACCTGC TGAAGAAATT
GCCTCTGGAG CAAAATGGAT GAAGGAGGAT GGTGCCTTAA CTGGCTTAGA TGCTCTTTTT
GGTGTTCACG TATTTCCTGA AATACCTGTT GGCCAAATAG GTGTTCGCAG AGGTGTTTTG
ACCGCAGCCG CAGGTGAGTT ACAAGTTGAG ATCTTGGGGA ATGGAGGACA TGGTGCCAGA
CCTCATCAAG CAGTTGATGC AATTTGGATA GCAGCAAGAG TGATTAGTGG CATACAAGAA
GCCATAAGTA GATGCTTGGA TCCTCTCTCA CCAGTTGTGA TTAGTTTTGG CAAAATACAA
GGTGGACAAG CTTTTAATGT CATTGCAGAT AAAGTTAGGC TTTTGGGAAC AGTTCGCTGT
TTAGACCTTC AATTAAATGA AACGTTACCC AATTGGATTG AAGAGACAGT AAAAAAAATT
GCTTCTAATT TTGGAGCTGA GGCAAAAGTG CAATATCGCT CAATAGCACC ACCTGTTTAT
AACGATCCTA AGTTGACTCA ATTACTAGAA AATTCTGCAA TTGCATTACT TGGCGATGCA
AATGTTCTCC GCTTAGAACA GCCTTCTTTA GGAGCAGAAG ATTTTGCTGA GCTTTTGCAA
GATATTCCTG GGACTATGTT TCGTCTGGGT GTAAGTGGTT TGAATGGATG CGCTCCTCTC
CATAATGGGT ACTTTGCTCC TGATGAAAGA TGTTTAGAAA TAGGGATTAG TGTATTGACA
AGTACGCTTT TGGATTGGAT GCAAAAAAGG GGACATCATT GA
 
Protein sequence
MPFSDCHKES LRLFLPELIT IRRHLHAHPE LSGQEHQTAA TVAGELKKYG WDVTEGVGRT 
GVIAELGDKS APCVGLRVDM DALPVEEKTG LPFSSFNQGV MHACGHDLHT CIGLGVAKLL
AEDKSKLSGV RLLFQPAEEI ASGAKWMKED GALTGLDALF GVHVFPEIPV GQIGVRRGVL
TAAAGELQVE ILGNGGHGAR PHQAVDAIWI AARVISGIQE AISRCLDPLS PVVISFGKIQ
GGQAFNVIAD KVRLLGTVRC LDLQLNETLP NWIEETVKKI ASNFGAEAKV QYRSIAPPVY
NDPKLTQLLE NSAIALLGDA NVLRLEQPSL GAEDFAELLQ DIPGTMFRLG VSGLNGCAPL
HNGYFAPDER CLEIGISVLT STLLDWMQKR GHH