Gene P9303_05571 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_05571 
SymbolpepB 
ID4776373 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp538563 
End bp540083 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content58% 
IMG OID640086062 
Productleucyl aminopeptidase 
Protein accessionYP_001016574 
Protein GI124022267 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0260] Leucyl aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.612563 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGCTAA ACGGCACGGC TAACATCGCG CCACCTCATC AACATCCCAT GGAGATTTGC 
CTTTTTCCAG CAACCTTGCA GAGCTGGACT GGCGATGTGC TCATGGTGGG CATGTTTGAG
GGGAAGATGG AGGAACGTTT AAACGAGCTG GAAACACTCT GCAAAGGCTC TCTGATGCAG
AGCCTTGAGA AGCAGATGTT CAAAGGCAAA TCCGGAGAAA TCGCCACCGT TCAACTGCTC
CAAAACAAGC CAAGCCTGCT GGTGCTCGTA GGGCTCGGAG AACCCCAGAA GATGAGGCTT
GACGACCTTC GCAAGGCGGC TGCCCTGGGA GCGAAAGCAA GCCTCGGATG CAGTGGAACC
CTGGGGATAA TGCTCCCCTG GGAGCCCTTA GATGCGGCAT CTGCCGCAAG AGCCGTCGCA
GAAGCCGTTC GCCTTTCCCT CTATAAAGAC CTCCGCTTTC GCAGTGCTCC TGAGCCCCGC
TCTACGCCAA CAAAACTGGA GCTCATTGGT CTGCCCGACT CAGCCGGCAA AGACCTTCAA
GCCGTTCACC CCACCTGTGC GGGCGTGGAG CTAGCAAGGC AGCTGGTTGC GGCTCCTGCC
AACAGTCTTA CCCCTGCGGC CCTCGCCCAA ACAGCGATTC AACTTGCCCA CGAACACGGC
CTCGAGTGCA CTGTGCTGGA GCGGTCTGAC TGCGCCGAAC GGGAAATGGG TGCGTACCTA
GCTGTCTCTC AAGGTTCTGA TCTGGAGCCA AAATTCATTC ACCTCACCTA TCGCCCTCAA
GGACCAGTTC AACGACGACT GGCCCTTGTA GGCAAGGGAC TGACCTTCGA TTCCGGTGGA
TACAACCTCA AGGTTGGAGC TGCCCAGATC GATCTGATGA AGTTCGACAT GGGCGGCAGC
GCAGCTGTAC TTGGGGCCGC CCGAGCGATT GCAGAACTGC GACCCAAGGG AGTGGAAGTG
CATGTGATCG TGGCAGCCTG CGAAAACATG GTGAATGGAT CCGCTGTCCA TCCAGGAGAC
ATCGTGCGGG CCTCAAACGG CACGACCATC GAGATCAACA ACACCGATGC CGAAGGCCGT
CTGACCCTTG CCGATGCACT GGTATACACC TGCGGACTCG AACCAGACGC AATCGTGGAT
TTAGCAACCC TTACAGGGGC TTGCGTGATC GCACTTGGCG AAGAGATCGC TGGCCTTTGG
ACAGGTCATG ATCCCCTGGC TGAGGGACTA ACCGCTGCCG CCGAGGCGGC CGGCGAAGGC
CTTTGGCGAA TGCCATTGCC AAGCTCCTAT CGAGAGGGTC TCAAATCCAA CCTCGCTGAC
CTCAAGAACA CAGGACCTCG TCCTGGCGGA TCCATTACCG CGGCCCTTTT CCTCAAAGAG
TTTGTTGAAG CCTCAATCCC ATGGGCCCAC ATCGACATCG CCGGAACGGT CTGGTCTGAA
AAGGGCCGTG GCCTCAATCC ATCTGGTGCC ACCGGCTACG GGGTTCGCAC TTTGGTGAAT
TGGATTTGTA GTCAGTCTTG A
 
Protein sequence
MQLNGTANIA PPHQHPMEIC LFPATLQSWT GDVLMVGMFE GKMEERLNEL ETLCKGSLMQ 
SLEKQMFKGK SGEIATVQLL QNKPSLLVLV GLGEPQKMRL DDLRKAAALG AKASLGCSGT
LGIMLPWEPL DAASAARAVA EAVRLSLYKD LRFRSAPEPR STPTKLELIG LPDSAGKDLQ
AVHPTCAGVE LARQLVAAPA NSLTPAALAQ TAIQLAHEHG LECTVLERSD CAEREMGAYL
AVSQGSDLEP KFIHLTYRPQ GPVQRRLALV GKGLTFDSGG YNLKVGAAQI DLMKFDMGGS
AAVLGAARAI AELRPKGVEV HVIVAACENM VNGSAVHPGD IVRASNGTTI EINNTDAEGR
LTLADALVYT CGLEPDAIVD LATLTGACVI ALGEEIAGLW TGHDPLAEGL TAAAEAAGEG
LWRMPLPSSY REGLKSNLAD LKNTGPRPGG SITAALFLKE FVEASIPWAH IDIAGTVWSE
KGRGLNPSGA TGYGVRTLVN WICSQS