Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmar_1120 |
Symbol | |
ID | 5773659 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosopumilus maritimus SCM1 |
Kingdom | Archaea |
Replicon accession | NC_010085 |
Strand | + |
Start bp | 1022506 |
End bp | 1025703 |
Gene Length | 3198 bp |
Protein Length | 1065 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 641316762 |
Product | isoleucyl-tRNA synthetase |
Protein accession | YP_001582454 |
Protein GI | 161528628 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0060] Isoleucyl-tRNA synthetase |
TIGRFAM ID | [TIGR00392] isoleucyl-tRNA synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 0.439306 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAACTAT CTACAAAGTT TGATGCAAAA GCAATCGAAT CAGAGATTAA AGAATATGTA AAATCAATTG ATTTAGAAAA GCAGATTTTT GCATCAGACA AGCCTGAAAA GATCAGATTC ATTGAAGGTC CACCTACAAT GAACGGAATT CCTCACGCAG GACATCTTAG AGGCAGAGTC ATCAAAGATT TGTGGTATAG GTACAATACA CTCCAAGGGA AGAAGATTGA ATTTAATGGA GGATGGGACA CACAAGGACT TCCTGTAGAA CTACAAGTTG AAAAAGAGCT GGGAGTTACA GGTGGAAAGA CTGAAGCAAT CAAACAATTT GGGGTAGAGA GAATTGTATC TGAATGCAAA AAAGTTGTTG AAAAATACAA CAAGACATGG GTAGAAGTTG ATGAATTACT TGGGATGTCA TTTAATCATG ATAAAGCATA TTGGACTTTT AGAGATGAAT TTATTGAAAG AGAATGGCAA GTTCTAAAAA AAGCACACGA GAATGGAATT TTAGAAGAAG ATTTTACAGT TATTGCATAT TGTCCTAGTT GTCAGACATC ACTTAGTCAT GCAGAAGTTA ATCAAGGGTA TGAAGAAGTA AAGGACCCAT CACTATACTA TAAAGTAAAA CTGGTAGATG AAGATGTATT TTTGATTGTA TGGACTACAA TGCCATTTAC ACTAGTTACT GATGCAATGG TAGGATTACA GCCAGAAGAA GATTATGCTT ATGTCAAAGT AGAAAATGAA ACTTGGGTTG TTGGAAAAAC AAGATTAGAA GAATTCATGA CTGAAGTAAA AATTGAAGAT TACAAAATCG AAAAGACTGT CAAAGGTTCA GAATTTGAAG GGAAAAAATA CATCCATCCA TTATTGGATT TGATTCCAGA GTTAAACGAA TGCTCAAAGG CAGACAATTT CCATGTAGCA GTATCAGAAT CGTTCGTTGA TGCTAGTACT GGTAGTGGTC TTGTACATCT GTCTCCAGCA AACGGTGAGG AAGATATCAA GATTGCAAAC AAGAGAAAAG TCAAGATTTT CAGCCCCATA GATGACGAGG TAAAGTTCAC TTCTCAGGCA GGAAAATACC AAGGAATGTT TGTCAGAGAT GCGGACAGAC CAATTGTAGA AGATTTGAAA GAGTGTAATG CTCTAGTAAA GATTGGCAAA ATCAAACACA AGTATCCACT TTGTTGGAGA TCACACCATC CAATTGTATG GCTTGCAAGA AAGGGTTGGT TCTACAAACT AGACAGACTA GATAACAAGG CAATTGATGC AGCAGAAAGT GTAGAGTATT ATTTTGAACA ACCAAAAAAC AGATTCTTAG GAATTATCAA AGAGAGACAT CCTTGGTGTA TCTCAAGGGA GAGAATTTGG GGATGTCCAT TGCCAGTATG GGCATGTGAA GAATGCAATG AAAGAAACTG GTTTTTTACA AGAAAAGAGA TTGTAGAATC AGCTGACAAT CTTCCTGATG GCCCAGACTT TGAATTACAC AGACCATGGA TTGATAACAT TACAATAAAG TGTAAAAAAT GTGGCAGTAC AAAAACAAAG AGAGAAGAAT ATGTTTTAGA TACTTGGCAC AATAGTGGTT CTGCACCATA TTCATCATTA ACTGATGAAG AGTACACAAA CGAAATTCCA GCACCATTCT TCACTGAAGG AATTGATCAA ACTAGAGGAT GGGCATATAC ACTACTCATT GAAAATGTAA TTCTAAACAA CGGACCAACC CCACCATACA AGTCATTCTT GTTCCAAGGA CATGTACTTG ATGAGAAAGG AGGCAAGATG AGCAAGAGTA AAGGCAATGT TTTGGAAGGA ATTGAATTAC TAGAAAAATA TCCTGCAGAT TTGATTAGAT TTTATTTCAT GTGGAAGGCT AGTCCAATTG AACCACTTAG TTTCAGTACA GAAGAGTTAA TGTCAAGACC ATATCAAGTA ATCAACACAC TATTCAATTT ACACTTGTAC TTTAAGCAAA ACAGCCAGTA TGATAACTTT GAAAAAGAAA ACACGATAGA GTGGGCAAAA CAAAAGAATT TGTTAACATC ACCAGACATT TGGCTCTTAT CAAAACTTCA AAAATTAATC TCAAAAATTA CAGACCGCAA TGATTCTTGC AAATTTCATG AAGGTGCAAA AGCAATTGAC GACTTTATCA TTAACAATCT AAGTCAAATC TACATTCCAA TTACAAGAGG AGAATTGTGG GATGAAGACG AGGATAAAAA GGAGAGAAGG CTTGCAATTT ATGCAGTACT AGAGGAAGTT CTAAGAACAT TAGATATCTT GATTCATCCA TTTTGTCCAT TTACCAGTGA GCATCTGTAC CAAACAGTCT TTGATGGAAA ACAAAGCATA CTACTAGACA AATGGCCAAA ATCACAAGAG TCACTAGTCA ATGAAGAGAT TGAAGAATCA TTTGACATTA TGAAAGATGT AGTATCAGTT TCATCAGCTG CAAGAATGAA AGGCAAACTC AAAAGAAGAT GGCCATTAAA TGAAGCAAAA ATTTGTGTTA AGAAAGGACT AAAGTCAAAG TTAGAATCAC TATCAGAACT ACTCCAGTCT CAGCTAAATG TAGAGAAATT CAGCATTGCT GAAACTGAAA AAGAATCAGG ATTAGAACAA ATTTTAGAAT TAAAACAACT AGGACTGCCT GCAAAGCCAA TTGTTGAATT GGAAAGAAAG AGAATCGGGC CAAAAGCAAA ACAACACATG GGAAAACTTG TTGCAAAGTT CTCTGAAACA AATCCTGATG AAATAATTTC ATCTTTACAA AATAATTCAA AGTTTGATTT TGATATTGAT GATGAAACGA TTTCACTTGA CAATGAAGAT TTTGTTGTAG ACTTTGATGC TGATGAAAAT TATGCAATGT CAAAAAGAGA TGATTATGTA GTGTTTATCT CAACATCGCG AAACAAAGAG ATGATGGCAA AAGGATTAGT CAAAGATGTT GCAAGAAGAT TGCAAACTTT GAGAAAAGAG AGAGGATACA ATCCAACTGA TGTTTTAGGA GTCGCATCAA TTCTTGATTT AGATGAAGAA TCACTTGAAA TGATCAAAGA AAAATCAGAA GACTTGGCTT TCTTGGTAAG AGTAAAGCAA GTCAACTTTA CAGAATCATG TAAAGAATAC AAAGATGACG ACATTGATGG TCAGAAGATT AGAATTTCAG TAGAGTAA
|
Protein sequence | MELSTKFDAK AIESEIKEYV KSIDLEKQIF ASDKPEKIRF IEGPPTMNGI PHAGHLRGRV IKDLWYRYNT LQGKKIEFNG GWDTQGLPVE LQVEKELGVT GGKTEAIKQF GVERIVSECK KVVEKYNKTW VEVDELLGMS FNHDKAYWTF RDEFIEREWQ VLKKAHENGI LEEDFTVIAY CPSCQTSLSH AEVNQGYEEV KDPSLYYKVK LVDEDVFLIV WTTMPFTLVT DAMVGLQPEE DYAYVKVENE TWVVGKTRLE EFMTEVKIED YKIEKTVKGS EFEGKKYIHP LLDLIPELNE CSKADNFHVA VSESFVDAST GSGLVHLSPA NGEEDIKIAN KRKVKIFSPI DDEVKFTSQA GKYQGMFVRD ADRPIVEDLK ECNALVKIGK IKHKYPLCWR SHHPIVWLAR KGWFYKLDRL DNKAIDAAES VEYYFEQPKN RFLGIIKERH PWCISRERIW GCPLPVWACE ECNERNWFFT RKEIVESADN LPDGPDFELH RPWIDNITIK CKKCGSTKTK REEYVLDTWH NSGSAPYSSL TDEEYTNEIP APFFTEGIDQ TRGWAYTLLI ENVILNNGPT PPYKSFLFQG HVLDEKGGKM SKSKGNVLEG IELLEKYPAD LIRFYFMWKA SPIEPLSFST EELMSRPYQV INTLFNLHLY FKQNSQYDNF EKENTIEWAK QKNLLTSPDI WLLSKLQKLI SKITDRNDSC KFHEGAKAID DFIINNLSQI YIPITRGELW DEDEDKKERR LAIYAVLEEV LRTLDILIHP FCPFTSEHLY QTVFDGKQSI LLDKWPKSQE SLVNEEIEES FDIMKDVVSV SSAARMKGKL KRRWPLNEAK ICVKKGLKSK LESLSELLQS QLNVEKFSIA ETEKESGLEQ ILELKQLGLP AKPIVELERK RIGPKAKQHM GKLVAKFSET NPDEIISSLQ NNSKFDFDID DETISLDNED FVVDFDADEN YAMSKRDDYV VFISTSRNKE MMAKGLVKDV ARRLQTLRKE RGYNPTDVLG VASILDLDEE SLEMIKEKSE DLAFLVRVKQ VNFTESCKEY KDDDIDGQKI RISVE
|
| |