Gene NATL1_15041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_15041 
SymbolpepN 
ID4780664 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1215893 
End bp1218514 
Gene Length2622 bp 
Protein Length873 aa 
Translation table11 
GC content31% 
IMG OID640084785 
Productaminopeptidase N 
Protein accessionYP_001015326 
Protein GI124026210 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0308] Aminopeptidase N 
TIGRFAM ID[TIGR02414] aminopeptidase N, Escherichia coli type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.168245 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAACTC AGAAATCTAT TAAATTATCA GACTATGTTG AATATCCTTT CTTAATACCC 
AGTATATATT TAGATTTTGA TATCGGTACG GATTGTGTTG TTGTTCAATC TTCAATGATA
ATTAAGCCAA AGAAGAAAGA ATCTTCAAAG CTTGTTCTTA AGGGTAATCA AATTAAATTA
TTATCAATAT CAATAAATGG AAAAGAATTG AAGTTGCCTG AATATTCTTT TTCCGATAAA
AGCTTGATTA TAAATAGTCC TCCAAAATCA GAATTTGAAT TAAAAATAAG ATCTCAAATA
GATCCTTTTA GAAATACATC ATTAGAAGGA TTATATTTAA GTTCAGGAAT GTTAACTACA
CAATGTGAGG CTGAAGGATT TAGAAGAATT TGTTTTCATC CTGATAGACC AGATGTTTTA
AGTAGATATA CAGTAAGAAT AGAGGCAGAA AGAACTTTGT ATCCTATATT ATTATCTAAT
GGGAATGAGA AGTATTCAGG TAATTTAAAT AGTAATAATC TTAGACATGA ATTTATATGG
GAAGATCCCT TCCCAAAACC TTGCTATTTA TTCGCTTTAG TCGCTGGTAA ATTAAATTCA
GTCTCGGATA CATATATTAC AAATAAGGGA AGATTAATTG ATATTAGAAT TTATGTGGAA
AAAGGAGATG AGAAATATAC AAAACATGCC GTTAACTCAC TAAAAAAGGC AATGAAATGG
GATGAGGATA ATTACGGTCT TGAATATGAT TTAGATGAAT ATAAAATTGT TGCTGTAAGG
CATTTTAATA TGGGAGCGAT GGAAAATAAG GGACTAAATA TTTTTAATTC CAAGTTGGTA
TTAGCTGACT CAAAGACTGC GACTGATGAT GAATTAGAGA GAATAGAAAG TGTTATAGCA
CACGAATATT TTCACAATTG GACTGGCAAT AGAATTACAT GCCGTGATTG GTTTCAGCTT
TCATTAAAAG AAGGTCTGAC AGTTTTTAGA GATCAATCTT TTACCTCAGA CCTACATAGT
AAAGGACTAA AAAGAATAGA AGACGTCTCA TTTCTTAGAA ATTTTCAATT CGCTGAAGAT
AAGGGCCCTA CCTCTCATGC GGTTAAGCCT AAAGAGTATG TAGCGATCGA TAATTTTTAT
ACAACAACAA TATATGAGAA GGGTGCTGAG TTGATAAGAA TGCTTGAGCT ATTGCTAGGA
AAAGAAAAGT TTTTTAGAGG TATTAATCTT TATATTAAAA CTTTTGATGG TAGTGCCGCT
ACAACTGAGG ATTTTATTAA TTCATTAATA AAAGGTGCTT ATCTGGAAGA AAAAAACTGT
CCTTTTGATT TAGATAAATT TCTTAATTGG TATTATAAAT CAGGTACTCC GAAAGTTTAT
ATAAATCAAT CTTGGGATTC AAAGAATTCA ATTTTGAATG TCTCTTTTGA GCAGAAAATA
GATACCGATA AAACTAATGA TAATACCGAA ATGGTTATTC CAATTCTTTA TTCTTGTTAT
AGCAGAGAAA AAGGAGCTAA TCCCTTGGCT GAGAATAATT TATTTGTTTT AGATAAAAAT
AAAAAGTATC TAAAAATAAA TACTCTCCCA GGTGAGCAAC AAGCTCCAGT TCTATCACTT
TTTAGATGTT TTTCTTCACC TGTTGTTTGG GAATCTGATT TAGTTATAGA TGACTATCTT
TTCCTTTTTT TAAATGATAA TGATTATTTT TCGAGGTGGG ACTCTGGTCA GTATTTGATG
CGTGAAATTT TGAAAACTAG GCTTTGCAAT AAAAACAATT TCTCATTGGA GCATAAGTTT
ATTAATGCTA TTAAACAAAC TATAAAATCT TTAGAAATTA ATGATCCATT TTTTTTAGCA
ACTCTTATAA CAATACCTGG TTTTGCGGAG TTGGAATCCT TATTCGAAAA AGTTGATCCA
ATAAGAATTT ATAGTGAGTC CATAGATTTC CAAGTATTAA TTGGTAATGA AATTCTTCAA
GAGCTGAGAG TAATAGCTAA AAATTTATTT GGTAAAATTG ATCATGAATG GCCAATGGGT
AAAGGAGAGA GAAAACTTTT AGGAACTATA TGGTTTTATT TATCTCTTGC GGGCGAAAGA
GATGTGCAAA AAAATTGTGT TGAATCAATT AGTCATTCTT CAATGACAAT ATCAAGGGCG
GCTTTAGGAG CATTAAAGCC ACTCGATAAC AATTTGACCG AAGAAGCTTC TAATTTATTT
TATAACCTTT GGAAAGAAAA TCCAGTGGTC TTAGACTCAT GGTTCGCTTA TGAGGCTTCA
AGACCTCATA AGCGAGGAAT TAATGTGATT GAAAAATTAC TATCACATCC TAAATTTGAT
TGGAAGGCTC CAAATGCCAT ACGAGCTGTT CTGGGAGGAT TTAGTAAAAA CATTGATTTA
TTTCATTCTC TAGATGGACA AGGTTATTTA TTTATGGCTG ATAAATTAAT AGAGGTAGAT
AAAATTAACC CAATAACGGC TTCAAGAATG GTAAAAGTTT TTAGTAAATG GAAAACTTAT
ATAGATAAAA ATAAGGAAGG GATTTATGAA TCACTATTAA AATTAAACAA AGCAAATATA
TCTTCTAATA CAAGAGAGGT AGTGGAACTG ATTTTGAATT AA
 
Protein sequence
MSTQKSIKLS DYVEYPFLIP SIYLDFDIGT DCVVVQSSMI IKPKKKESSK LVLKGNQIKL 
LSISINGKEL KLPEYSFSDK SLIINSPPKS EFELKIRSQI DPFRNTSLEG LYLSSGMLTT
QCEAEGFRRI CFHPDRPDVL SRYTVRIEAE RTLYPILLSN GNEKYSGNLN SNNLRHEFIW
EDPFPKPCYL FALVAGKLNS VSDTYITNKG RLIDIRIYVE KGDEKYTKHA VNSLKKAMKW
DEDNYGLEYD LDEYKIVAVR HFNMGAMENK GLNIFNSKLV LADSKTATDD ELERIESVIA
HEYFHNWTGN RITCRDWFQL SLKEGLTVFR DQSFTSDLHS KGLKRIEDVS FLRNFQFAED
KGPTSHAVKP KEYVAIDNFY TTTIYEKGAE LIRMLELLLG KEKFFRGINL YIKTFDGSAA
TTEDFINSLI KGAYLEEKNC PFDLDKFLNW YYKSGTPKVY INQSWDSKNS ILNVSFEQKI
DTDKTNDNTE MVIPILYSCY SREKGANPLA ENNLFVLDKN KKYLKINTLP GEQQAPVLSL
FRCFSSPVVW ESDLVIDDYL FLFLNDNDYF SRWDSGQYLM REILKTRLCN KNNFSLEHKF
INAIKQTIKS LEINDPFFLA TLITIPGFAE LESLFEKVDP IRIYSESIDF QVLIGNEILQ
ELRVIAKNLF GKIDHEWPMG KGERKLLGTI WFYLSLAGER DVQKNCVESI SHSSMTISRA
ALGALKPLDN NLTEEASNLF YNLWKENPVV LDSWFAYEAS RPHKRGINVI EKLLSHPKFD
WKAPNAIRAV LGGFSKNIDL FHSLDGQGYL FMADKLIEVD KINPITASRM VKVFSKWKTY
IDKNKEGIYE SLLKLNKANI SSNTREVVEL ILN