Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_15041 |
Symbol | pepN |
ID | 4780664 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | - |
Start bp | 1215893 |
End bp | 1218514 |
Gene Length | 2622 bp |
Protein Length | 873 aa |
Translation table | 11 |
GC content | 31% |
IMG OID | 640084785 |
Product | aminopeptidase N |
Protein accession | YP_001015326 |
Protein GI | 124026210 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0308] Aminopeptidase N |
TIGRFAM ID | [TIGR02414] aminopeptidase N, Escherichia coli type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.168245 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCAACTC AGAAATCTAT TAAATTATCA GACTATGTTG AATATCCTTT CTTAATACCC AGTATATATT TAGATTTTGA TATCGGTACG GATTGTGTTG TTGTTCAATC TTCAATGATA ATTAAGCCAA AGAAGAAAGA ATCTTCAAAG CTTGTTCTTA AGGGTAATCA AATTAAATTA TTATCAATAT CAATAAATGG AAAAGAATTG AAGTTGCCTG AATATTCTTT TTCCGATAAA AGCTTGATTA TAAATAGTCC TCCAAAATCA GAATTTGAAT TAAAAATAAG ATCTCAAATA GATCCTTTTA GAAATACATC ATTAGAAGGA TTATATTTAA GTTCAGGAAT GTTAACTACA CAATGTGAGG CTGAAGGATT TAGAAGAATT TGTTTTCATC CTGATAGACC AGATGTTTTA AGTAGATATA CAGTAAGAAT AGAGGCAGAA AGAACTTTGT ATCCTATATT ATTATCTAAT GGGAATGAGA AGTATTCAGG TAATTTAAAT AGTAATAATC TTAGACATGA ATTTATATGG GAAGATCCCT TCCCAAAACC TTGCTATTTA TTCGCTTTAG TCGCTGGTAA ATTAAATTCA GTCTCGGATA CATATATTAC AAATAAGGGA AGATTAATTG ATATTAGAAT TTATGTGGAA AAAGGAGATG AGAAATATAC AAAACATGCC GTTAACTCAC TAAAAAAGGC AATGAAATGG GATGAGGATA ATTACGGTCT TGAATATGAT TTAGATGAAT ATAAAATTGT TGCTGTAAGG CATTTTAATA TGGGAGCGAT GGAAAATAAG GGACTAAATA TTTTTAATTC CAAGTTGGTA TTAGCTGACT CAAAGACTGC GACTGATGAT GAATTAGAGA GAATAGAAAG TGTTATAGCA CACGAATATT TTCACAATTG GACTGGCAAT AGAATTACAT GCCGTGATTG GTTTCAGCTT TCATTAAAAG AAGGTCTGAC AGTTTTTAGA GATCAATCTT TTACCTCAGA CCTACATAGT AAAGGACTAA AAAGAATAGA AGACGTCTCA TTTCTTAGAA ATTTTCAATT CGCTGAAGAT AAGGGCCCTA CCTCTCATGC GGTTAAGCCT AAAGAGTATG TAGCGATCGA TAATTTTTAT ACAACAACAA TATATGAGAA GGGTGCTGAG TTGATAAGAA TGCTTGAGCT ATTGCTAGGA AAAGAAAAGT TTTTTAGAGG TATTAATCTT TATATTAAAA CTTTTGATGG TAGTGCCGCT ACAACTGAGG ATTTTATTAA TTCATTAATA AAAGGTGCTT ATCTGGAAGA AAAAAACTGT CCTTTTGATT TAGATAAATT TCTTAATTGG TATTATAAAT CAGGTACTCC GAAAGTTTAT ATAAATCAAT CTTGGGATTC AAAGAATTCA ATTTTGAATG TCTCTTTTGA GCAGAAAATA GATACCGATA AAACTAATGA TAATACCGAA ATGGTTATTC CAATTCTTTA TTCTTGTTAT AGCAGAGAAA AAGGAGCTAA TCCCTTGGCT GAGAATAATT TATTTGTTTT AGATAAAAAT AAAAAGTATC TAAAAATAAA TACTCTCCCA GGTGAGCAAC AAGCTCCAGT TCTATCACTT TTTAGATGTT TTTCTTCACC TGTTGTTTGG GAATCTGATT TAGTTATAGA TGACTATCTT TTCCTTTTTT TAAATGATAA TGATTATTTT TCGAGGTGGG ACTCTGGTCA GTATTTGATG CGTGAAATTT TGAAAACTAG GCTTTGCAAT AAAAACAATT TCTCATTGGA GCATAAGTTT ATTAATGCTA TTAAACAAAC TATAAAATCT TTAGAAATTA ATGATCCATT TTTTTTAGCA ACTCTTATAA CAATACCTGG TTTTGCGGAG TTGGAATCCT TATTCGAAAA AGTTGATCCA ATAAGAATTT ATAGTGAGTC CATAGATTTC CAAGTATTAA TTGGTAATGA AATTCTTCAA GAGCTGAGAG TAATAGCTAA AAATTTATTT GGTAAAATTG ATCATGAATG GCCAATGGGT AAAGGAGAGA GAAAACTTTT AGGAACTATA TGGTTTTATT TATCTCTTGC GGGCGAAAGA GATGTGCAAA AAAATTGTGT TGAATCAATT AGTCATTCTT CAATGACAAT ATCAAGGGCG GCTTTAGGAG CATTAAAGCC ACTCGATAAC AATTTGACCG AAGAAGCTTC TAATTTATTT TATAACCTTT GGAAAGAAAA TCCAGTGGTC TTAGACTCAT GGTTCGCTTA TGAGGCTTCA AGACCTCATA AGCGAGGAAT TAATGTGATT GAAAAATTAC TATCACATCC TAAATTTGAT TGGAAGGCTC CAAATGCCAT ACGAGCTGTT CTGGGAGGAT TTAGTAAAAA CATTGATTTA TTTCATTCTC TAGATGGACA AGGTTATTTA TTTATGGCTG ATAAATTAAT AGAGGTAGAT AAAATTAACC CAATAACGGC TTCAAGAATG GTAAAAGTTT TTAGTAAATG GAAAACTTAT ATAGATAAAA ATAAGGAAGG GATTTATGAA TCACTATTAA AATTAAACAA AGCAAATATA TCTTCTAATA CAAGAGAGGT AGTGGAACTG ATTTTGAATT AA
|
Protein sequence | MSTQKSIKLS DYVEYPFLIP SIYLDFDIGT DCVVVQSSMI IKPKKKESSK LVLKGNQIKL LSISINGKEL KLPEYSFSDK SLIINSPPKS EFELKIRSQI DPFRNTSLEG LYLSSGMLTT QCEAEGFRRI CFHPDRPDVL SRYTVRIEAE RTLYPILLSN GNEKYSGNLN SNNLRHEFIW EDPFPKPCYL FALVAGKLNS VSDTYITNKG RLIDIRIYVE KGDEKYTKHA VNSLKKAMKW DEDNYGLEYD LDEYKIVAVR HFNMGAMENK GLNIFNSKLV LADSKTATDD ELERIESVIA HEYFHNWTGN RITCRDWFQL SLKEGLTVFR DQSFTSDLHS KGLKRIEDVS FLRNFQFAED KGPTSHAVKP KEYVAIDNFY TTTIYEKGAE LIRMLELLLG KEKFFRGINL YIKTFDGSAA TTEDFINSLI KGAYLEEKNC PFDLDKFLNW YYKSGTPKVY INQSWDSKNS ILNVSFEQKI DTDKTNDNTE MVIPILYSCY SREKGANPLA ENNLFVLDKN KKYLKINTLP GEQQAPVLSL FRCFSSPVVW ESDLVIDDYL FLFLNDNDYF SRWDSGQYLM REILKTRLCN KNNFSLEHKF INAIKQTIKS LEINDPFFLA TLITIPGFAE LESLFEKVDP IRIYSESIDF QVLIGNEILQ ELRVIAKNLF GKIDHEWPMG KGERKLLGTI WFYLSLAGER DVQKNCVESI SHSSMTISRA ALGALKPLDN NLTEEASNLF YNLWKENPVV LDSWFAYEAS RPHKRGINVI EKLLSHPKFD WKAPNAIRAV LGGFSKNIDL FHSLDGQGYL FMADKLIEVD KINPITASRM VKVFSKWKTY IDKNKEGIYE SLLKLNKANI SSNTREVVEL ILN
|
| |