Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | A9601_15321 |
Symbol | pepB |
ID | 4718258 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. AS9601 |
Kingdom | Bacteria |
Replicon accession | NC_008816 |
Strand | + |
Start bp | 1311615 |
End bp | 1313087 |
Gene Length | 1473 bp |
Protein Length | 490 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 640079257 |
Product | leucyl aminopeptidase |
Protein accession | YP_001009922 |
Protein GI | 123969064 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0260] Leucyl aminopeptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.113645 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAATTTT CCACATTCCA AACAAATCTA GATAACTGGC AAGGTGCTTC ATTAATTTTT GGAGTTCTAG AGGAAGAAAT TGCAAGCCAA CTTGAAAACA TAAAATTTGT TGTTGACCCA AAATTATTAC TAAAAAAAGT TACTCAAAAA AAATTCAAAG GAGAAAAAGG AAAAACTTTA AGCTTTGAAT TTTTAGATCA AAAATTAGAA ACTTTAATCA TAGTTGGTCT TGGCAAATCA AAAGACCTAA ACAAAAGTGA TATAGAAAAC TCTATAGGAA ATCTAGTTAG GAAAACTGTT GATAAAAATG CAAAAATCAG CATCTTGCTA CCTTGGGAAT TAATAAATTC ACAACTAGAG ATAACTAAAC TAGCAGAATC AGCCAGATTA TCTGCCTATA AGGACAATAG ATTTAATAAG AAAAAAGATG AAAAGAAAGT TCTTAAAGAA ATAGAGTTTT TGAATTTAAA ACAATTTGAG AATATTAGCT TTGAAGAGAC AGCACAAATA TGTGAAGGTG TAGAACTAGC TAGAAAACTT GTAGCCGCCC CTCCAAATAG TCTTACACCT CAGGAAATGT CTTTACAAGC TTCTAAAATA GCTAAAGATC ATGGTTTGGA AGTAAAAATT TTAGAGGCAA AAGATTGTGA AGATTTAGAA ATGGGTGCAT ATTTAGCTGT AGCAAAAGGT TCTGATCTAG ATCCTAAATT TATACATCTT ACTTTGAAGT CAGAGGGGCC TATAAAAGAA AAGATTGCAC TTGTTGGGAA GGGTTTAACC TTTGATTCTG GAGGGTACAA TCTGAAAGTA GGAGCCTCTC AAATTGAAAT GATGAAATAT GATATGGGAG GAAGCGCTGC AGTTTTAGGA GCAGCAAAAG CACTTGGAGC AATAAAACCA AAGGGATTAG AAATTCATTT TATTGTGGCA GCTTGCGAAA ACATGATAAA TGGATCTGCT GTTCATCCTG GAGATGTAGT TAAAGCATCA AATGGTAAGA CAATTGAAAT AAATAATACT GATGCAGAGG GTAGGCTCAC ATTAGCTGAT GCTTTAACTT ATGCATCCAA TTTAAACCCC GATTCAATAA TAGATCTTGC CACTTTAACA GGAGCTATCG TTGTTGCATT AGGGAATGAT GTAGCTGGAT TCTGGAGCAA TAATGATGAT CTAGCAAATG ACCTAAAAGC TGCATCAGCC CAGTCTGGAG AAAAATTATG GCAAATGCCT TTACAAAAAT CTTATAAAGA AGGGTTAAAG TCTCATATAG CTGACATGAA AAATACAGGC CCTAGAGCAG GTGGGTCAAT AACTGCGGCT TTGTTTCTAG AGGAATTCTT TGATTCAGAG ATTAAATGGG CTCATATTGA TATTGCTGGG ACTTGTTGGA CTGATAAGAA TAAAGGAATT AATCCATCAG GTGCAACCGG TTTTGGAGTT AAAACTCTTG TTCAATGGAT TAAAAATAAA TAA
|
Protein sequence | MQFSTFQTNL DNWQGASLIF GVLEEEIASQ LENIKFVVDP KLLLKKVTQK KFKGEKGKTL SFEFLDQKLE TLIIVGLGKS KDLNKSDIEN SIGNLVRKTV DKNAKISILL PWELINSQLE ITKLAESARL SAYKDNRFNK KKDEKKVLKE IEFLNLKQFE NISFEETAQI CEGVELARKL VAAPPNSLTP QEMSLQASKI AKDHGLEVKI LEAKDCEDLE MGAYLAVAKG SDLDPKFIHL TLKSEGPIKE KIALVGKGLT FDSGGYNLKV GASQIEMMKY DMGGSAAVLG AAKALGAIKP KGLEIHFIVA ACENMINGSA VHPGDVVKAS NGKTIEINNT DAEGRLTLAD ALTYASNLNP DSIIDLATLT GAIVVALGND VAGFWSNNDD LANDLKAASA QSGEKLWQMP LQKSYKEGLK SHIADMKNTG PRAGGSITAA LFLEEFFDSE IKWAHIDIAG TCWTDKNKGI NPSGATGFGV KTLVQWIKNK
|
| |