Gene NATL1_17591 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_17591 
SymbolpepB 
ID4780101 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1439576 
End bp1441063 
Gene Length1488 bp 
Protein Length495 aa 
Translation table11 
GC content38% 
IMG OID640085047 
Productleucyl aminopeptidase 
Protein accessionYP_001015579 
Protein GI124026464 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0260] Leucyl aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.697239 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAATTT CAGCAGTCCC CAAAGAAATT AACGAATGGT CAGGATCAGT GCTTATAGCT 
GGGATTTTGG AAGGAACAAT CGAAAGCCAA ATTAATTTAT TTAAGGCAAT AATAAAAGAT
ACTTTTTTGA GCCAAAGGTT TATTGATTCA AAATTCGAAG GCAAAAAAAA TCAGAAATTA
TCAATTGAAC TAATAGAAGG CAAAGTTAAA AAAGTAATTT TTGTAGGCTT AGGCAAGGCC
GAAACTCTTG GAATTGATGA TCTGCGGAAA GCAGCTTCAA TTGGTACTCG TCAAGTTTCA
GGCTATGAAA GAAAGTTAGG TATATTTTTC CCTTGGGATG CATTTGACCC TTCCTCCGCT
GCATGCGCAG TTGGCGAAGC AGTTCGATTG TCATCTATTA AAGATTTTAG ATTCAAATCA
GAACCAAAAG AACCTACTCC AATAGATCAA GTTGAATTAA TAGGTTTGGA CACCAAAACC
ACTAAATCAG CGATTGATGA AATAAATCCA ATATGCGAAG GAGTTAAATT TGCAAGAGAA
CTTGTTTCAG CCCCTCCCAA TTTTCTTACC CCATATCAAA TGTCTAAGGA GGCTGAAAAG
TTAGCCACTG ACTATGATCT TGATTTGAAA GTTCTAGATA GAAAAGAGTG CGAAAATCAA
GGGATGGGAG CTTACTTAGC AGTTGCTAAA GGATCAGATC TAGATCCTAA TTTTATACAT
TTAAAATATT CTCCAAAAAA TGCAAAAACC AAAGTCGTCT TAATTGGCAA AGGCTTAACT
TTTGACTCTG GTGGATACAA CTTAAAAGTA GGTGCATCTC AAATTGAAAA AATGAAGTAC
GACATGGGAG GTAGTGCTTC TGTTCTTGGA GCAGCCAGAG CCATCGCAGA ATTAAAACCG
AATAACATCG AGGTTCATTT TATTATTGCT GCTTGCGAAA ATATGATCAA CGGCTCTGCA
TTGCATCCTG GAGATATCAT CAAAGCTTCG AATGGAAAAA CCATTGAAGT AAACAATACC
GATGCAGAAG GAAGGTTAAC TTTAGCTGAT GCTTTGGTTT ATGCATGCAA GCTGAAGCCT
GACGCCATAG TAGATCTAGC CACTCTTACT GGGGCTTGTG TCATTGCATT AGGAGATGAA
ATAGCAGGTT TATGGACTGA CAATGATCAG CTCTCTGAGC AATTAACGAA AGCTGCGTGT
AAAGCTGGAG AGGGTATTTG GAGAATGCCA ATGCAAGATT CATATAAATC TGGAATTAAA
TCAACTATTG CTGATTTGCA AAACACAGGG CCTAGGCCAG GGGGGTCAAT TACTGCAGCC
TTGTTTCTCA AAGAATTTGT GAACTCAAGC ATTCCATGGG CGCACATTGA CATAGCAGGT
ACATGCTGGA CAGAAAAAGA TAGAGATATA ACTCCAAAGG GTGCTACTGG TTATGGAGTT
AGAACGTTAA TTAATTGGAT CAAGGAGTTG AGTCTAAACA CCAATTAA
 
Protein sequence
MQISAVPKEI NEWSGSVLIA GILEGTIESQ INLFKAIIKD TFLSQRFIDS KFEGKKNQKL 
SIELIEGKVK KVIFVGLGKA ETLGIDDLRK AASIGTRQVS GYERKLGIFF PWDAFDPSSA
ACAVGEAVRL SSIKDFRFKS EPKEPTPIDQ VELIGLDTKT TKSAIDEINP ICEGVKFARE
LVSAPPNFLT PYQMSKEAEK LATDYDLDLK VLDRKECENQ GMGAYLAVAK GSDLDPNFIH
LKYSPKNAKT KVVLIGKGLT FDSGGYNLKV GASQIEKMKY DMGGSASVLG AARAIAELKP
NNIEVHFIIA ACENMINGSA LHPGDIIKAS NGKTIEVNNT DAEGRLTLAD ALVYACKLKP
DAIVDLATLT GACVIALGDE IAGLWTDNDQ LSEQLTKAAC KAGEGIWRMP MQDSYKSGIK
STIADLQNTG PRPGGSITAA LFLKEFVNSS IPWAHIDIAG TCWTEKDRDI TPKGATGYGV
RTLINWIKEL SLNTN