Gene P9303_14661 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_14661 
Symbol 
ID4777512 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp1261676 
End bp1262677 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content45% 
IMG OID640086976 
ProductM23/M37 familypeptidase 
Protein accessionYP_001017477 
Protein GI124023170 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0739] Membrane proteins related to metalloendopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.122507 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCCCC TGTTGTTGCT TATCTCTTCC TTTGCAGCAC CAGTGCTTGC TTTAGGGAAT 
CTTGGGTCCT TGCCTGGCTA TGCAGATAGT GCTGTACAGG AGAAGGTCAA GATCTCAAGC
ACAACCTCAC AGCAACTGAT ATGGATAAAA GTTGCCCTAC CAGTAACCAT AGAACAATTA
GCAGGCAAAC TTGGGCTGAA AGCGACTGAA CTGTCAAAAC TCAATAAAAA GTCATCCAAT
ACAGAACTAA CCAAAGGTAG TTGGATTGTT TTGCCAAGAT CTGCTCACAA CCAATTGAGG
CGTATCAGTT ACCTAGATTC TGAGCAGGTG TTGCTGCATA ACCCTCGTAA CAGTAAAAAC
CAACTAAATA ACAATCGATT GAGTCGACTG CTTGATGAAA CTAAGAAAAA AAACACGTTG
TATTCGTTGA ATGATCACAA TAAGAATACA AATCGGGTAC AGAACCAAAC AAAAGTAAGT
AGCAACAACA TCCTTAAGAA GGAGTGCTCC CTTGAGTCTC CATGCAATTG CCCCACCTGC
TTAGATGTTG AATCACCAGA GAGCACAGTA GATCTGTTCA CCAGAAGTAA TGACATGCTT
CAGCTAGGAA GCATTGATTC TGATTCCTAC ATATGGCCTA CTAAAGGTGT TTTCACATCT
GGATTCGGAT GGAGGTGGGG GAGAATGCAT CAAGGTATCG ATATCGCCAA TAAAGTGGGC
ACTCCCGTTT TTGCAGCAAA AGACGGAATA GTCACCTATG CCGGATGGAG GGGGGCCTAC
GGCTACCTCG TAGAAATTGC ACATGGTGGC GGCTCCACAA CTCGCTATGC CCATAACAAT
CAGATTTTGG TGCGCAGTGG TCAGTTCATA CCGCAAGGAG CAACGATCTC GAAGATGGGC
AGCACAGGTC GGAGCACTGG TCCACATCTC CATTTTGAGA TCAGAAAGAA GGGTGGCTTA
GCAATGAATC CAGTCACGTT GCTTCCATCG AATAAGGTCT GA
 
Protein sequence
MKPLLLLISS FAAPVLALGN LGSLPGYADS AVQEKVKISS TTSQQLIWIK VALPVTIEQL 
AGKLGLKATE LSKLNKKSSN TELTKGSWIV LPRSAHNQLR RISYLDSEQV LLHNPRNSKN
QLNNNRLSRL LDETKKKNTL YSLNDHNKNT NRVQNQTKVS SNNILKKECS LESPCNCPTC
LDVESPESTV DLFTRSNDML QLGSIDSDSY IWPTKGVFTS GFGWRWGRMH QGIDIANKVG
TPVFAAKDGI VTYAGWRGAY GYLVEIAHGG GSTTRYAHNN QILVRSGQFI PQGATISKMG
STGRSTGPHL HFEIRKKGGL AMNPVTLLPS NKV