Gene NATL1_18411 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_18411 
SymbolpepP 
ID4780608 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1502777 
End bp1504096 
Gene Length1320 bp 
Protein Length439 aa 
Translation table11 
GC content40% 
IMG OID640085130 
Productputative aminopeptidase P 
Protein accessionYP_001015661 
Protein GI124026546 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.502832 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCTGATT CAAGTGTTTT TCATCGCCGA AGAGACTCGT TTCTGTCAGG TTTAGGTTCG 
TATGCGGCTA TTGTTCCAGC TGGAGAGTTA GTTACTCATC ACGCAGATTG CGAATATCCT
TTCAGACAAA ATAGCGACTT TTGGTATTTA ACTGGTTTTG ATGAGCCAAA TGCAGTGGCT
TTATTTTTGC CTCATAAACC CAAAGGAGAA CAATACGTAT TATTTGTCTT ACCTAAAGAG
TCTGCGGCAG AGGTTTGGAC AGGCTTTAGA TGGGGAACAA AAGGAGTTTT AGATAATTTT
GATGTTGATA TAGCCCACTC TTTGAATGAA TTGCCAAGCC TTTTAGCTCA TTATTTAGAG
GGGGCAGAGG GAATTGCTTT TCGAATTGGG AAGCATCCGA ACATAGAGCC TTTGGTTTTA
AAAACTTGGT CGGAGCATTT ACAAAAACTT CCAAGAAGTG GTTTGGCCCC ATTATCCATG
ATTGCGCCTT GTCCAATACT TCACGATATG AGACTTCGTA AGGACGATTT TGAAATAGAA
AGGATGCGTA TTGCATCACA AATTTCTGCA GAGGCTCATG AATTAGTTAG AGAATTTGCT
CGTCCAGGAA TGAATGAGAG GGATTTACAA GCGCAGATAG AAAAATACTT TCTTGAGAAG
GGGACTAGAG GACCTGCTTA TGGCTCAATA GTTGCATCAG GTGATAATGC ATGCGTGCTT
CATTACACGG AGAACAATTC ACTCATAAAG AATGGGGACC TTGTTTTGAT AGATGCAGGT
TGCTCTCTAG ATGACTATTA CAATGGTGAC ATAACAAGAA CCTTTCCAGT TAATGGAAGG
TTTTCTGGAG AGCAAAAAGC CTTATATGAA ATTGTTCTAA GTTCTCAAAA AGCTGCAATT
AATTGTGTTC GACCAGGTGA TAATGCTGAG AACGTACATA TGACTGCCTT AAAACATCTC
GTTGGGGGGT TAGTCGATAT TGGCTTACTT GTTGGCGATG TTGATTCTAT TATTGAACAA
CAAGCTTACT CGCATTTGTA CATGCATCGA ACAGGGCATT GGTTAGGACT TGATGTCCAT
GATGTAGGTG CATATAGGCT TGGTGACTAT CATTTGAATC TTGAACCTGG GATGGTTTTA
ACGGTTGAAC CTGGCATTTA TATAAGTGAT CGGTTAGCGG TTCCCCAAGG GCAACCTGAG
ATAGACAAAA GATGGAAAGG TATTGGAATT CGTATCGAGG ATGATGTTTT GGTTACACAA
GATTCTGTAG AAGTATTGAG TTGTAAAGCA GCTAAGGATT TGATAGATAT GGAATGTTAG
 
Protein sequence
MADSSVFHRR RDSFLSGLGS YAAIVPAGEL VTHHADCEYP FRQNSDFWYL TGFDEPNAVA 
LFLPHKPKGE QYVLFVLPKE SAAEVWTGFR WGTKGVLDNF DVDIAHSLNE LPSLLAHYLE
GAEGIAFRIG KHPNIEPLVL KTWSEHLQKL PRSGLAPLSM IAPCPILHDM RLRKDDFEIE
RMRIASQISA EAHELVREFA RPGMNERDLQ AQIEKYFLEK GTRGPAYGSI VASGDNACVL
HYTENNSLIK NGDLVLIDAG CSLDDYYNGD ITRTFPVNGR FSGEQKALYE IVLSSQKAAI
NCVRPGDNAE NVHMTALKHL VGGLVDIGLL VGDVDSIIEQ QAYSHLYMHR TGHWLGLDVH
DVGAYRLGDY HLNLEPGMVL TVEPGIYISD RLAVPQGQPE IDKRWKGIGI RIEDDVLVTQ
DSVEVLSCKA AKDLIDMEC