Gene P9303_04951 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_04951 
SymbolpepP 
ID4778181 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp489138 
End bp490415 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content56% 
IMG OID640085999 
Productputative aminopeptidase P 
Protein accessionYP_001016512 
Protein GI124022205 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCAAT TGGGGGGAGT GGCAGCGGTA ATTCCAGCGG CTTCGTTCGT GACGCATCAC 
GCCGATTGCG AGTATCCGTT TCGTCAAAAC AGTGACTTCT GGTATCTCAC GGGTCTGGAT
GAGCCTGATG CGGTGGCTTT GCTTTTGCCC TACCGGCCCG AAGGAGAGCG TTTTGTGCTG
TTTGTGCAAC CCAAAGAGCC AACGACCGAG GTATGGAATG GCTTTCGTTG GGGTGTCGAG
GGAGCAGTGG CGCAGTTCGG TGCTGATCGT GCTCATCCAA TAGCTGAATT GCCTCAGCTG
TTGGGCGATT ACCTCAAAGA CGCCGAAGCG ATTGGCTTCA GGGTGGGGAA GCACCCAAAG
GTGGAGTCGC TGGTTCTTGA GGCCTGGGCT AAGCAACTCG ATCAGGCCCC ACGCAGCGGC
GTATCGGCGC TTGGTCTTGT CGCCCCTTGC CCTTTCATCC ATCAGTTGAG GTTGCGCAAG
GGAGCAGAGG AAATCGAGCG CATGCGTGAG ACGGCTCGGA TTTCTGCGCA AGCTCATCAG
CTGGTTCGGA ATACGGCGCG TCCGGGCATG AATGAGCGAC AGTTGCAGGC TGTAATCGAG
CAGCATTTTC TTGAGCAAGG CGCGCGCGGC CCTGCCTATG GATCGATCGT GGCCGGCGGC
GATAACGCCT GCGTGTTGCA TTACACCGCT AACAATGCTC CGCTTGTTGA TGGCGAGCTG
GTGCTGATCG ATGCAGGCTG TTCGCTCGTT GATTACTACA ACGGAGATAT CACTAGGACA
TTCCCTGTTA ACGGACGCTT TAGCGCAGAG CAACGCGCTC TGTATGAGCT GGTTTTAGCA
GCGCAGCAGG CAGCGATCGC TGAGGTAAGA CCGGGTGGCA CTGCTGATCG CGTGCATGAT
CTTGCGGTGC GGGTGCTCGT TGAAGGGCTT GTGGAGTTGG GTTTATTGCT TGGCAGTGTT
GATGGACTGA TTGAACAAGG GGCCTACCGA CACCTTTATA TGCATCGCAC TGGCCATTGG
CTGGGCCTTG ATGTTCATGA TGTAGGCGCC TATCGCCTTG GTGAGCATCC GGTGGATCTT
GAGCCCGGCA TGGTGTTAAC AGTGGAACCG GGTTTGTATG TCAGTGACCG TTTGCCAGTG
CCGGATGGGC AGCCTGCGAT TGCTGATCGT TGGAAGGGGA TCGGCATTCG TATTGAAGAT
GATGTGTTGG TAAGTGAGCA GGGAAATGAG GTGCTCACAT CGTTAGCGGA GAAAAGTGTT
GAAGCAATGC AGCGATAA
 
Protein sequence
MAQLGGVAAV IPAASFVTHH ADCEYPFRQN SDFWYLTGLD EPDAVALLLP YRPEGERFVL 
FVQPKEPTTE VWNGFRWGVE GAVAQFGADR AHPIAELPQL LGDYLKDAEA IGFRVGKHPK
VESLVLEAWA KQLDQAPRSG VSALGLVAPC PFIHQLRLRK GAEEIERMRE TARISAQAHQ
LVRNTARPGM NERQLQAVIE QHFLEQGARG PAYGSIVAGG DNACVLHYTA NNAPLVDGEL
VLIDAGCSLV DYYNGDITRT FPVNGRFSAE QRALYELVLA AQQAAIAEVR PGGTADRVHD
LAVRVLVEGL VELGLLLGSV DGLIEQGAYR HLYMHRTGHW LGLDVHDVGA YRLGEHPVDL
EPGMVLTVEP GLYVSDRLPV PDGQPAIADR WKGIGIRIED DVLVSEQGNE VLTSLAEKSV
EAMQR