Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_05571 |
Symbol | pepB |
ID | 4776373 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | - |
Start bp | 538563 |
End bp | 540083 |
Gene Length | 1521 bp |
Protein Length | 506 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 640086062 |
Product | leucyl aminopeptidase |
Protein accession | YP_001016574 |
Protein GI | 124022267 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0260] Leucyl aminopeptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.612563 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGCTAA ACGGCACGGC TAACATCGCG CCACCTCATC AACATCCCAT GGAGATTTGC CTTTTTCCAG CAACCTTGCA GAGCTGGACT GGCGATGTGC TCATGGTGGG CATGTTTGAG GGGAAGATGG AGGAACGTTT AAACGAGCTG GAAACACTCT GCAAAGGCTC TCTGATGCAG AGCCTTGAGA AGCAGATGTT CAAAGGCAAA TCCGGAGAAA TCGCCACCGT TCAACTGCTC CAAAACAAGC CAAGCCTGCT GGTGCTCGTA GGGCTCGGAG AACCCCAGAA GATGAGGCTT GACGACCTTC GCAAGGCGGC TGCCCTGGGA GCGAAAGCAA GCCTCGGATG CAGTGGAACC CTGGGGATAA TGCTCCCCTG GGAGCCCTTA GATGCGGCAT CTGCCGCAAG AGCCGTCGCA GAAGCCGTTC GCCTTTCCCT CTATAAAGAC CTCCGCTTTC GCAGTGCTCC TGAGCCCCGC TCTACGCCAA CAAAACTGGA GCTCATTGGT CTGCCCGACT CAGCCGGCAA AGACCTTCAA GCCGTTCACC CCACCTGTGC GGGCGTGGAG CTAGCAAGGC AGCTGGTTGC GGCTCCTGCC AACAGTCTTA CCCCTGCGGC CCTCGCCCAA ACAGCGATTC AACTTGCCCA CGAACACGGC CTCGAGTGCA CTGTGCTGGA GCGGTCTGAC TGCGCCGAAC GGGAAATGGG TGCGTACCTA GCTGTCTCTC AAGGTTCTGA TCTGGAGCCA AAATTCATTC ACCTCACCTA TCGCCCTCAA GGACCAGTTC AACGACGACT GGCCCTTGTA GGCAAGGGAC TGACCTTCGA TTCCGGTGGA TACAACCTCA AGGTTGGAGC TGCCCAGATC GATCTGATGA AGTTCGACAT GGGCGGCAGC GCAGCTGTAC TTGGGGCCGC CCGAGCGATT GCAGAACTGC GACCCAAGGG AGTGGAAGTG CATGTGATCG TGGCAGCCTG CGAAAACATG GTGAATGGAT CCGCTGTCCA TCCAGGAGAC ATCGTGCGGG CCTCAAACGG CACGACCATC GAGATCAACA ACACCGATGC CGAAGGCCGT CTGACCCTTG CCGATGCACT GGTATACACC TGCGGACTCG AACCAGACGC AATCGTGGAT TTAGCAACCC TTACAGGGGC TTGCGTGATC GCACTTGGCG AAGAGATCGC TGGCCTTTGG ACAGGTCATG ATCCCCTGGC TGAGGGACTA ACCGCTGCCG CCGAGGCGGC CGGCGAAGGC CTTTGGCGAA TGCCATTGCC AAGCTCCTAT CGAGAGGGTC TCAAATCCAA CCTCGCTGAC CTCAAGAACA CAGGACCTCG TCCTGGCGGA TCCATTACCG CGGCCCTTTT CCTCAAAGAG TTTGTTGAAG CCTCAATCCC ATGGGCCCAC ATCGACATCG CCGGAACGGT CTGGTCTGAA AAGGGCCGTG GCCTCAATCC ATCTGGTGCC ACCGGCTACG GGGTTCGCAC TTTGGTGAAT TGGATTTGTA GTCAGTCTTG A
|
Protein sequence | MQLNGTANIA PPHQHPMEIC LFPATLQSWT GDVLMVGMFE GKMEERLNEL ETLCKGSLMQ SLEKQMFKGK SGEIATVQLL QNKPSLLVLV GLGEPQKMRL DDLRKAAALG AKASLGCSGT LGIMLPWEPL DAASAARAVA EAVRLSLYKD LRFRSAPEPR STPTKLELIG LPDSAGKDLQ AVHPTCAGVE LARQLVAAPA NSLTPAALAQ TAIQLAHEHG LECTVLERSD CAEREMGAYL AVSQGSDLEP KFIHLTYRPQ GPVQRRLALV GKGLTFDSGG YNLKVGAAQI DLMKFDMGGS AAVLGAARAI AELRPKGVEV HVIVAACENM VNGSAVHPGD IVRASNGTTI EINNTDAEGR LTLADALVYT CGLEPDAIVD LATLTGACVI ALGEEIAGLW TGHDPLAEGL TAAAEAAGEG LWRMPLPSSY REGLKSNLAD LKNTGPRPGG SITAALFLKE FVEASIPWAH IDIAGTVWSE KGRGLNPSGA TGYGVRTLVN WICSQS
|
| |