Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_17591 |
Symbol | pepB |
ID | 4780101 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | + |
Start bp | 1439576 |
End bp | 1441063 |
Gene Length | 1488 bp |
Protein Length | 495 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 640085047 |
Product | leucyl aminopeptidase |
Protein accession | YP_001015579 |
Protein GI | 124026464 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0260] Leucyl aminopeptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.697239 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAAATTT CAGCAGTCCC CAAAGAAATT AACGAATGGT CAGGATCAGT GCTTATAGCT GGGATTTTGG AAGGAACAAT CGAAAGCCAA ATTAATTTAT TTAAGGCAAT AATAAAAGAT ACTTTTTTGA GCCAAAGGTT TATTGATTCA AAATTCGAAG GCAAAAAAAA TCAGAAATTA TCAATTGAAC TAATAGAAGG CAAAGTTAAA AAAGTAATTT TTGTAGGCTT AGGCAAGGCC GAAACTCTTG GAATTGATGA TCTGCGGAAA GCAGCTTCAA TTGGTACTCG TCAAGTTTCA GGCTATGAAA GAAAGTTAGG TATATTTTTC CCTTGGGATG CATTTGACCC TTCCTCCGCT GCATGCGCAG TTGGCGAAGC AGTTCGATTG TCATCTATTA AAGATTTTAG ATTCAAATCA GAACCAAAAG AACCTACTCC AATAGATCAA GTTGAATTAA TAGGTTTGGA CACCAAAACC ACTAAATCAG CGATTGATGA AATAAATCCA ATATGCGAAG GAGTTAAATT TGCAAGAGAA CTTGTTTCAG CCCCTCCCAA TTTTCTTACC CCATATCAAA TGTCTAAGGA GGCTGAAAAG TTAGCCACTG ACTATGATCT TGATTTGAAA GTTCTAGATA GAAAAGAGTG CGAAAATCAA GGGATGGGAG CTTACTTAGC AGTTGCTAAA GGATCAGATC TAGATCCTAA TTTTATACAT TTAAAATATT CTCCAAAAAA TGCAAAAACC AAAGTCGTCT TAATTGGCAA AGGCTTAACT TTTGACTCTG GTGGATACAA CTTAAAAGTA GGTGCATCTC AAATTGAAAA AATGAAGTAC GACATGGGAG GTAGTGCTTC TGTTCTTGGA GCAGCCAGAG CCATCGCAGA ATTAAAACCG AATAACATCG AGGTTCATTT TATTATTGCT GCTTGCGAAA ATATGATCAA CGGCTCTGCA TTGCATCCTG GAGATATCAT CAAAGCTTCG AATGGAAAAA CCATTGAAGT AAACAATACC GATGCAGAAG GAAGGTTAAC TTTAGCTGAT GCTTTGGTTT ATGCATGCAA GCTGAAGCCT GACGCCATAG TAGATCTAGC CACTCTTACT GGGGCTTGTG TCATTGCATT AGGAGATGAA ATAGCAGGTT TATGGACTGA CAATGATCAG CTCTCTGAGC AATTAACGAA AGCTGCGTGT AAAGCTGGAG AGGGTATTTG GAGAATGCCA ATGCAAGATT CATATAAATC TGGAATTAAA TCAACTATTG CTGATTTGCA AAACACAGGG CCTAGGCCAG GGGGGTCAAT TACTGCAGCC TTGTTTCTCA AAGAATTTGT GAACTCAAGC ATTCCATGGG CGCACATTGA CATAGCAGGT ACATGCTGGA CAGAAAAAGA TAGAGATATA ACTCCAAAGG GTGCTACTGG TTATGGAGTT AGAACGTTAA TTAATTGGAT CAAGGAGTTG AGTCTAAACA CCAATTAA
|
Protein sequence | MQISAVPKEI NEWSGSVLIA GILEGTIESQ INLFKAIIKD TFLSQRFIDS KFEGKKNQKL SIELIEGKVK KVIFVGLGKA ETLGIDDLRK AASIGTRQVS GYERKLGIFF PWDAFDPSSA ACAVGEAVRL SSIKDFRFKS EPKEPTPIDQ VELIGLDTKT TKSAIDEINP ICEGVKFARE LVSAPPNFLT PYQMSKEAEK LATDYDLDLK VLDRKECENQ GMGAYLAVAK GSDLDPNFIH LKYSPKNAKT KVVLIGKGLT FDSGGYNLKV GASQIEKMKY DMGGSASVLG AARAIAELKP NNIEVHFIIA ACENMINGSA LHPGDIIKAS NGKTIEVNNT DAEGRLTLAD ALVYACKLKP DAIVDLATLT GACVIALGDE IAGLWTDNDQ LSEQLTKAAC KAGEGIWRMP MQDSYKSGIK STIADLQNTG PRPGGSITAA LFLKEFVNSS IPWAHIDIAG TCWTEKDRDI TPKGATGYGV RTLINWIKEL SLNTN
|
| |