Gene P9211_13861 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_13861 
SymbolpepB 
ID5731800 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp1251901 
End bp1253382 
Gene Length1482 bp 
Protein Length493 aa 
Translation table11 
GC content41% 
IMG OID641285762 
Productleucyl aminopeptidase 
Protein accessionYP_001551271 
Protein GI159903927 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0260] Leucyl aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0993846 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAATCT CGATTATTCA AAAAGGTCTA GAGGGATGGA GGGGTTCGAT ACTGGTATTT 
GGTCTCCTAG AAGGAGCACT GGAAAGTCAA CTAAATGCTT TAAAAGAGAT TTGCACTCCA
GCATCGCTCG CGAAGGCTCT CCAAGATAAG GAATTTGTTG GGAAACAAGG AGATCTTCAA
AGCTTCCAAT TAATAGGTAA AGAACCTCGT GAAATTGTTC TCATAGGTCT TGGTTCAGCA
GAAAAACTTG TACTAGATGA TTTGAGAAAA GCCACAGCCA TAAGTTGTCG AAAAGTTATT
GGCCAAGAGG GTACTTTAGG AATATTGCTT CCTTGGGATA TTTTTGATTC TGATATAGCA
GCTAAAGCTG TAGGTGAAGC TGTTATCCTT TCATTCTTCA AGGACAATCG ATTCCAAAAA
GATCCCAAAC AAAAAAAATT ACCTAACAAA CTGGAACTTT TAGGACTGCC TGAATCTTCG
CAAAAATACT TGAGCGAAAT AGTCCCTATT TGCTCAGGCG TCAAGCTTGC TAGAGAACTT
GTAGGAGCTC CCCCGAATAG TCTGACCCCT TCAGCATTAG CTAATCAAGC AAAAGAAATA
GCTAATCAAT TTGGTCTTGA AGCAAAAATA CTGGGACAGG AAGAATGCCA AGCTAAAAAT
ATGGGAGCCT TCTTAGCCGT ATCGCAAGGA TCAGATCTAA GTCCTAAATT TATTCACCTC
ACTTATAGAG CCAAAGGGGA AATAAAGCGT CGTATAGCAA TGGTAGGGAA AGGCCTGACT
TTTGACTCTG GCGGCTACAA CCTAAAAGTT GGTGCTTCTC AAATAGAAAT GATGAAATAT
GACATGGGTG GTAGCGCAGC AGTAATTGGC GCAGCTAGAG CAATAGGTGA ACTAGCTCCT
TCTGGTGTAG AGATTCATTT TTTAGTCGCG ACATGTGAAA ACATGATCAA TGGTTCTGCA
GTTCATCCAG GAGATATTGT TAAAGCCTCA AATGGAACAA CAATTGAAAT CAATAATACC
GATGCCGAAG GACGTCTTAC TCTTGCTGAT GCACTTACTT ATGCATGCGA GTTGAAGCCA
GATGCAATCG TTGACTTAGC AACGCTTACT GGAGCATGTG TCATTGCCCT CGGAGAGGAA
TTGGCTGGTT TGTGGACTAA TAGCAAACAT CTTTCTAAAG AGCTCAAAGA ATCAGCAGAA
GCATGTGGAG AGGGGCTTTG GGAAATGCCT TTGCAAGATT CATACAAAGA AGGTCTTAAA
TCTATGCTCG CAGATATAAA AAATACTGGG CCAAGAGCAG GTGGATCTAT AACAGCAGCT
CTTTTCCTAA AAGAGTTTAT TAAAGAAGAT ATTGCTTGGG CACATATCGA TATTGCGGGT
ACTTGCTGGA CCGATAAAGA CAGAGGCATA AACCCTGCAG GAGCGACTGG GTTTGGAGTA
AGAACTTTAG TTAATTGGGC TAGCAGATCA ATCAATCCCT AA
 
Protein sequence
MQISIIQKGL EGWRGSILVF GLLEGALESQ LNALKEICTP ASLAKALQDK EFVGKQGDLQ 
SFQLIGKEPR EIVLIGLGSA EKLVLDDLRK ATAISCRKVI GQEGTLGILL PWDIFDSDIA
AKAVGEAVIL SFFKDNRFQK DPKQKKLPNK LELLGLPESS QKYLSEIVPI CSGVKLAREL
VGAPPNSLTP SALANQAKEI ANQFGLEAKI LGQEECQAKN MGAFLAVSQG SDLSPKFIHL
TYRAKGEIKR RIAMVGKGLT FDSGGYNLKV GASQIEMMKY DMGGSAAVIG AARAIGELAP
SGVEIHFLVA TCENMINGSA VHPGDIVKAS NGTTIEINNT DAEGRLTLAD ALTYACELKP
DAIVDLATLT GACVIALGEE LAGLWTNSKH LSKELKESAE ACGEGLWEMP LQDSYKEGLK
SMLADIKNTG PRAGGSITAA LFLKEFIKED IAWAHIDIAG TCWTDKDRGI NPAGATGFGV
RTLVNWASRS INP