Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_18411 |
Symbol | pepP |
ID | 4780608 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | - |
Start bp | 1502777 |
End bp | 1504096 |
Gene Length | 1320 bp |
Protein Length | 439 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 640085130 |
Product | putative aminopeptidase P |
Protein accession | YP_001015661 |
Protein GI | 124026546 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0006] Xaa-Pro aminopeptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.502832 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGCTGATT CAAGTGTTTT TCATCGCCGA AGAGACTCGT TTCTGTCAGG TTTAGGTTCG TATGCGGCTA TTGTTCCAGC TGGAGAGTTA GTTACTCATC ACGCAGATTG CGAATATCCT TTCAGACAAA ATAGCGACTT TTGGTATTTA ACTGGTTTTG ATGAGCCAAA TGCAGTGGCT TTATTTTTGC CTCATAAACC CAAAGGAGAA CAATACGTAT TATTTGTCTT ACCTAAAGAG TCTGCGGCAG AGGTTTGGAC AGGCTTTAGA TGGGGAACAA AAGGAGTTTT AGATAATTTT GATGTTGATA TAGCCCACTC TTTGAATGAA TTGCCAAGCC TTTTAGCTCA TTATTTAGAG GGGGCAGAGG GAATTGCTTT TCGAATTGGG AAGCATCCGA ACATAGAGCC TTTGGTTTTA AAAACTTGGT CGGAGCATTT ACAAAAACTT CCAAGAAGTG GTTTGGCCCC ATTATCCATG ATTGCGCCTT GTCCAATACT TCACGATATG AGACTTCGTA AGGACGATTT TGAAATAGAA AGGATGCGTA TTGCATCACA AATTTCTGCA GAGGCTCATG AATTAGTTAG AGAATTTGCT CGTCCAGGAA TGAATGAGAG GGATTTACAA GCGCAGATAG AAAAATACTT TCTTGAGAAG GGGACTAGAG GACCTGCTTA TGGCTCAATA GTTGCATCAG GTGATAATGC ATGCGTGCTT CATTACACGG AGAACAATTC ACTCATAAAG AATGGGGACC TTGTTTTGAT AGATGCAGGT TGCTCTCTAG ATGACTATTA CAATGGTGAC ATAACAAGAA CCTTTCCAGT TAATGGAAGG TTTTCTGGAG AGCAAAAAGC CTTATATGAA ATTGTTCTAA GTTCTCAAAA AGCTGCAATT AATTGTGTTC GACCAGGTGA TAATGCTGAG AACGTACATA TGACTGCCTT AAAACATCTC GTTGGGGGGT TAGTCGATAT TGGCTTACTT GTTGGCGATG TTGATTCTAT TATTGAACAA CAAGCTTACT CGCATTTGTA CATGCATCGA ACAGGGCATT GGTTAGGACT TGATGTCCAT GATGTAGGTG CATATAGGCT TGGTGACTAT CATTTGAATC TTGAACCTGG GATGGTTTTA ACGGTTGAAC CTGGCATTTA TATAAGTGAT CGGTTAGCGG TTCCCCAAGG GCAACCTGAG ATAGACAAAA GATGGAAAGG TATTGGAATT CGTATCGAGG ATGATGTTTT GGTTACACAA GATTCTGTAG AAGTATTGAG TTGTAAAGCA GCTAAGGATT TGATAGATAT GGAATGTTAG
|
Protein sequence | MADSSVFHRR RDSFLSGLGS YAAIVPAGEL VTHHADCEYP FRQNSDFWYL TGFDEPNAVA LFLPHKPKGE QYVLFVLPKE SAAEVWTGFR WGTKGVLDNF DVDIAHSLNE LPSLLAHYLE GAEGIAFRIG KHPNIEPLVL KTWSEHLQKL PRSGLAPLSM IAPCPILHDM RLRKDDFEIE RMRIASQISA EAHELVREFA RPGMNERDLQ AQIEKYFLEK GTRGPAYGSI VASGDNACVL HYTENNSLIK NGDLVLIDAG CSLDDYYNGD ITRTFPVNGR FSGEQKALYE IVLSSQKAAI NCVRPGDNAE NVHMTALKHL VGGLVDIGLL VGDVDSIIEQ QAYSHLYMHR TGHWLGLDVH DVGAYRLGDY HLNLEPGMVL TVEPGIYISD RLAVPQGQPE IDKRWKGIGI RIEDDVLVTQ DSVEVLSCKA AKDLIDMEC
|
| |