Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_03171 |
Symbol | ileS |
ID | 4780471 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | + |
Start bp | 293790 |
End bp | 296693 |
Gene Length | 2904 bp |
Protein Length | 967 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 640083583 |
Product | isoleucyl-tRNA synthetase |
Protein accession | YP_001014146 |
Protein GI | 124025030 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0060] Isoleucyl-tRNA synthetase |
TIGRFAM ID | [TIGR00392] isoleucyl-tRNA synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAATAACA TCAATAAAAA TTCTCAAAAG GATCGTCCCA CTTACAAAGA CACTCTCAAC CTTTTGCAGA CTAATTTTGG AATGAGGGCA AATGCAACTC TAAGAGAACC TGAGTTACAA GCTTTTTGGA GAGAAAAAAA TATAGATTTC GAATTAGGCT TAAATAATAC TGGAGAGACT TTTACTTTGC ATGATGGCCC GCCATATGCA AATGGAACGC TTCATATGGG GCATGCTCTC AACAAAGTAT TGAAGGACAT AATCAATAAA TTTCAAACAA TGAAAGGGAA AAAAGTTTGT TATGTCCCTG GATGGGATTG CCATGGATTG CCTATTGAAT TGAAAGTTCT TCAAGCTATG GATAAAAGTC AACGAGCTGA ATTAACACCT ATTAAGTTGA GAAAAAAAGC TGCTGCTTAT GCAAAAAAGC AAGTTTCCCA ACAAATGGAT GGTTTTAAAA GATGGGGCGT ATGGGGTGAC TGGGATCAAC CATATTTAAG TTTAGACAAA AAGTTTGAGG CCTCTCAAAT CAAGTTGTTT GGTGAAATGG TCTTCAAGGG ATACATATAT CGAGGCCTAA AACCAGTTCA TTGGAGTCCA AGTTCTCAAA CAGCTCTGGC CGAGGCGGAA TTAGAATATC CAACCGGTCA TACTAGCAAA AGTATTTATG TGGGATTTAA AGTTAATCAA ATACCAAAAA GATTAACTCA AGAAATTTCT AAGCAAGCTC CAGATCTTAT TAATTCTGAA GGGAAATTAA AAGAAGTAAA ACTTGTCATT TGGACTACTA CTCCTTGGAC AATTCCTGCA AATGAGGCCA TTTCTGTTAA CCAAAAATTA GAATATGTAA TTGCACAAAG TTCTGATCGT TCATTGATAA TTATTGCTAA CGATCTTTTG GATGAAGTAT CTAAGAGTGT AGGAATTAAT TATGAAAAAA GAGTATTAAT CAAAGGATCA ATCTTAGATG GAATTATATA TAAACATCCT TTATTTGATA AAATAAGCCC TGTTGTTTTA GGAGGAGATT ATATTACAAC TGAATCCGGA ACTGGATTAG TACATACTGC TCCAGGTCAT GGTGTTGATG ATTTTAATAC TGGTAAAAAA TATAATTTAT CAATTTCTTG CACAGTTGAT GCAAAGGGTT TTCTAACGAA AGAAGCCGGT AAATATGAAG GTCTAAATGT ATTAAAAGAT GCTAATAGTG TCATAATAAG TGATCTAATT AATTCTGGAT CTTTGCTTAA AGAAATTCCA TATGAGCATA GGTATCCTTA TGATTGGAGA ACTAAAAAAC CAACTATTTT TAGAGCTACA GAACAATGGT TTGCTTCCGT TGAAGGATTT AGAGATAAAG CCCTTTCTGC CATAGAAGAT GTTATTTGGC TTCCTGAATC GGGAAAAAAT AGAATTAATT CTATGGTTAG AGAAAGAGGA GATTGGTGTA TCTCCCGACA AAGGACCTGG GGAGTTCCAA TACCAGTATT TTATGAAAAG AATGGACAAG AAATCTTGCT CAATAAAGAA ACTATTTCTC ATATAGCTGA TTTATTTTCT GTTCATGGAG CAGATATTTG GTGGGAATAT GAAGTATCTG AGCTATTACC TCCTTCTTAT TTAAATCAGG CAGATCGATG GCAAAAAGGT ACTGATACTA TGGATGTTTG GTTTGACTCT GGCTCTAGTT GGTCTTCAGT TATTTCTAAG AAAGAAAATT TAAACTATCC AGCAGATTTA TATTTGGAGG GATCTGATCA ACATCGGGGT TGGTTCCAGT CCTCTTTATT AACTTCGGTA GCAGTGAATG AACATGCACC TTTTAAAAAG GTCCTTACTC ATGGTTTTGC ATTAGATGAG AATGGTAGGA AGATGAGTAA ATCCTTAGGA AACATTATTG ATCCTTTAGT TATAATTAAT GGTGGTTCAA ATAAGAAATT AGATCCTGCG TATGGAGCTG ATGTTTTGAG GTTATGGGTT AGTTCTGTTG ATTATTCTGC AGATGTTCCT ATTGGATCAA ACATACTAAA GCAAATTTCT GATGTTTATC GTAAGGTTCG AAATACGTCT AGGTATCTAT TAGGTAACCT CTATGATTTT GATTATAAAA TTGATTCCAT TGATATTGCT AACTTACCAT TGTTAGATAA GTGGATGTTG AATAGAACAG CTGAAGTAAT TGATGAAATA TCAGATGCAT ATAATAATTT TGAATTTTCT AAATTTTTCC AAACAATTCA AAATTTTTGT GTTGTTGATC TATCTAATTT TTACTTAGAT ATTGCAAAAG ATAGGTTGTA TGTGAGTTCT AAATCTGACT TTAGAAGAAG AAGTTGTCAG ACAGTTTTAT CCTTGGTAAT TGAAAAAATA TCTGGATTAA TTGCACCTGT TTTATGTCAT ATGGCAGAAG ATATTTGGCA GAATATTCCA TATGACTTAG AGGAAGCCTC AGTATTTCAA AGAGGATGGC CTAATGTACC TAAATCATGG CGAAATAGTA GTTTTAATTG TCATGTGACT GAACTCCGTA AACTCAGAGC AGTTATTAAT CGTATGTTGG AGAGTTGTAG AAATAATCAA GCGTTAGGTT CTTCTTTGGA AGCATCAGTA AGGGTTGATA TATCTGATGA AAAAGTTCAA GCTGCTATTG AATGGTTAGC TGAAAGCGAA TCTAATAATG TTGATGTATT AAGAGATTGG TTCCTAGTTT CATCTTTACA AATTGGCGGT GAGCCATGGG CTGAGGTTTT AGTTAGTGAG GACAATGATT ATGCTTCAGT CGAGATTGCA AAAGCAAGGG GATTTAAGTG TGAAAGATGT TGGCATTATG AAATAGAAAT GAGCAAGAAT CCTCAACATA CAAATATTTG CAAAAGGTGC GAAAAAGTAG TCTTAGCTAT TTAA
|
Protein sequence | MNNINKNSQK DRPTYKDTLN LLQTNFGMRA NATLREPELQ AFWREKNIDF ELGLNNTGET FTLHDGPPYA NGTLHMGHAL NKVLKDIINK FQTMKGKKVC YVPGWDCHGL PIELKVLQAM DKSQRAELTP IKLRKKAAAY AKKQVSQQMD GFKRWGVWGD WDQPYLSLDK KFEASQIKLF GEMVFKGYIY RGLKPVHWSP SSQTALAEAE LEYPTGHTSK SIYVGFKVNQ IPKRLTQEIS KQAPDLINSE GKLKEVKLVI WTTTPWTIPA NEAISVNQKL EYVIAQSSDR SLIIIANDLL DEVSKSVGIN YEKRVLIKGS ILDGIIYKHP LFDKISPVVL GGDYITTESG TGLVHTAPGH GVDDFNTGKK YNLSISCTVD AKGFLTKEAG KYEGLNVLKD ANSVIISDLI NSGSLLKEIP YEHRYPYDWR TKKPTIFRAT EQWFASVEGF RDKALSAIED VIWLPESGKN RINSMVRERG DWCISRQRTW GVPIPVFYEK NGQEILLNKE TISHIADLFS VHGADIWWEY EVSELLPPSY LNQADRWQKG TDTMDVWFDS GSSWSSVISK KENLNYPADL YLEGSDQHRG WFQSSLLTSV AVNEHAPFKK VLTHGFALDE NGRKMSKSLG NIIDPLVIIN GGSNKKLDPA YGADVLRLWV SSVDYSADVP IGSNILKQIS DVYRKVRNTS RYLLGNLYDF DYKIDSIDIA NLPLLDKWML NRTAEVIDEI SDAYNNFEFS KFFQTIQNFC VVDLSNFYLD IAKDRLYVSS KSDFRRRSCQ TVLSLVIEKI SGLIAPVLCH MAEDIWQNIP YDLEEASVFQ RGWPNVPKSW RNSSFNCHVT ELRKLRAVIN RMLESCRNNQ ALGSSLEASV RVDISDEKVQ AAIEWLAESE SNNVDVLRDW FLVSSLQIGG EPWAEVLVSE DNDYASVEIA KARGFKCERC WHYEIEMSKN PQHTNICKRC EKVVLAI
|
| |