Gene P9301_15181 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9301_15181 
SymbolpepB 
ID4912576 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9301 
KingdomBacteria 
Replicon accessionNC_009091 
Strand
Start bp1284709 
End bp1286181 
Gene Length1473 bp 
Protein Length490 aa 
Translation table11 
GC content34% 
IMG OID640161114 
Productleucyl aminopeptidase 
Protein accessionYP_001091742 
Protein GI126696856 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0260] Leucyl aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAATTTT CCACATTCCA AAAAAATCTA GATAACTGGC AAGGTGCTTC ATTAATTTTT 
GGAGTTTTAG AAGAAGAAAT TGCGAGCCAA CTTGAAAAAA TAAAATTTGT TATTGACCCA
AAATTATTAC TAAAAAAAGT TTCTCAAAAA AATTTCAAAG GAGAGAAAGG AAAAACTTTA
AGCTTTGAAT TTCTAGATCA AAAATTAGAA ACTTTAATCA TAGTTGGTCT TGGCAAATCA
AAAGACCTTA ATAAAAGTGA TATAGAAAAC TCTATAGGAA ATCTAGTTAG GAAAACCATT
GATAAAAATG AAAAAATCAG CATCTTGCTA CCTTGGGAAT TTATAAATTC ACAACTAGAA
ATAAATCAAT TAGCAGAGTC AGCCAGATTA TCTGCCTATA AGGACAACAG ATTCAACAAG
AAAAAAGATG AAAAGAAAGT TCTTAAAGAA ATTGAGTTTT TGAATTTAAA AAAATTTGAG
AATATTAGCT TTAAAGAGAC AGCACAAATA TGTGAAGGTG TAGAACTAGC TAGAAGACTT
GTAGCAGCCC CTCCAAATAG TCTTACGCCT CAGGAAATGT CTATACAAGC TTCTCAAATA
GCTAAAGATC ATGGTTTGGA AGTAAAAATT TTAGAGGCAA AAGATTGTGA AGATTTAGGA
ATGGGTGCAT ATTTAGCTGT AGCAAAAGGT TCTGATCTAA ATCCTAAATT TATACATCTT
ACTTTAAAGT CAGATGGGCC TATTAAAGAA AAGATTGCGC TTGTTGGGAA GGGTTTAACC
TTTGACTCTG GAGGATACAA CCTGAAAGTA GGAGCCTCTC AAATTGAGAT GATGAAATAT
GATATGGGTG GAAGCGCTGC TGTTTTAGGA GCAGCAAAAG CACTTGGAGC AATAAAACCA
AAGGGACTAG AAATACATTT TATTGTGGCA TCCTGCGAAA ATATGATAAA TGGATCTGCA
GTACATCCTG GAGATGTAGT CAAGGCATCT AATGGTAAGA CAATTGAAAT AAATAACACT
GATGCAGAGG GCAGACTCAC ATTAGCTGAT GCTTTAACTT ACGCATCAAA TTTAAAACCG
GATTCAATAA TAGATCTTGC CACTTTAACA GGAGCTATTG TTGTGGCATT AGGAAATGAC
GTAGCTGGAT TCTGGAGCAA TAATGATGAT CTAGCAAATG ATCTAAAAGC TGCGTCAGCC
CAGGCTGGTG AAGAATTATG GCAAATGCCT TTACAAAAAT CTTATAAAGA AGGGCTAAAG
TCTCATATAG CTGATATGAA AAATACGGGG CCTAGAGCAG GTGGGTCAAT AACTGCTGCT
TTGTTTTTAG AGGAATTCTT TGATCCAGAG ATTAAATGGG CTCATGTTGA TATTGCTGGG
ACTTGTTGGA CTGATAAGAA TAAGGGGATT AATCCATCAG GTGCAACCGG TTTTGGAGTT
AAAACTCTTG TTCAATGGAT TAAAAATAAA TAA
 
Protein sequence
MQFSTFQKNL DNWQGASLIF GVLEEEIASQ LEKIKFVIDP KLLLKKVSQK NFKGEKGKTL 
SFEFLDQKLE TLIIVGLGKS KDLNKSDIEN SIGNLVRKTI DKNEKISILL PWEFINSQLE
INQLAESARL SAYKDNRFNK KKDEKKVLKE IEFLNLKKFE NISFKETAQI CEGVELARRL
VAAPPNSLTP QEMSIQASQI AKDHGLEVKI LEAKDCEDLG MGAYLAVAKG SDLNPKFIHL
TLKSDGPIKE KIALVGKGLT FDSGGYNLKV GASQIEMMKY DMGGSAAVLG AAKALGAIKP
KGLEIHFIVA SCENMINGSA VHPGDVVKAS NGKTIEINNT DAEGRLTLAD ALTYASNLKP
DSIIDLATLT GAIVVALGND VAGFWSNNDD LANDLKAASA QAGEELWQMP LQKSYKEGLK
SHIADMKNTG PRAGGSITAA LFLEEFFDPE IKWAHVDIAG TCWTDKNKGI NPSGATGFGV
KTLVQWIKNK