Gene Nmar_1120 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_1120 
Symbol 
ID5773659 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp1022506 
End bp1025703 
Gene Length3198 bp 
Protein Length1065 aa 
Translation table11 
GC content35% 
IMG OID641316762 
Productisoleucyl-tRNA synthetase 
Protein accessionYP_001582454 
Protein GI161528628 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0060] Isoleucyl-tRNA synthetase 
TIGRFAM ID[TIGR00392] isoleucyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.439306 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAACTAT CTACAAAGTT TGATGCAAAA GCAATCGAAT CAGAGATTAA AGAATATGTA 
AAATCAATTG ATTTAGAAAA GCAGATTTTT GCATCAGACA AGCCTGAAAA GATCAGATTC
ATTGAAGGTC CACCTACAAT GAACGGAATT CCTCACGCAG GACATCTTAG AGGCAGAGTC
ATCAAAGATT TGTGGTATAG GTACAATACA CTCCAAGGGA AGAAGATTGA ATTTAATGGA
GGATGGGACA CACAAGGACT TCCTGTAGAA CTACAAGTTG AAAAAGAGCT GGGAGTTACA
GGTGGAAAGA CTGAAGCAAT CAAACAATTT GGGGTAGAGA GAATTGTATC TGAATGCAAA
AAAGTTGTTG AAAAATACAA CAAGACATGG GTAGAAGTTG ATGAATTACT TGGGATGTCA
TTTAATCATG ATAAAGCATA TTGGACTTTT AGAGATGAAT TTATTGAAAG AGAATGGCAA
GTTCTAAAAA AAGCACACGA GAATGGAATT TTAGAAGAAG ATTTTACAGT TATTGCATAT
TGTCCTAGTT GTCAGACATC ACTTAGTCAT GCAGAAGTTA ATCAAGGGTA TGAAGAAGTA
AAGGACCCAT CACTATACTA TAAAGTAAAA CTGGTAGATG AAGATGTATT TTTGATTGTA
TGGACTACAA TGCCATTTAC ACTAGTTACT GATGCAATGG TAGGATTACA GCCAGAAGAA
GATTATGCTT ATGTCAAAGT AGAAAATGAA ACTTGGGTTG TTGGAAAAAC AAGATTAGAA
GAATTCATGA CTGAAGTAAA AATTGAAGAT TACAAAATCG AAAAGACTGT CAAAGGTTCA
GAATTTGAAG GGAAAAAATA CATCCATCCA TTATTGGATT TGATTCCAGA GTTAAACGAA
TGCTCAAAGG CAGACAATTT CCATGTAGCA GTATCAGAAT CGTTCGTTGA TGCTAGTACT
GGTAGTGGTC TTGTACATCT GTCTCCAGCA AACGGTGAGG AAGATATCAA GATTGCAAAC
AAGAGAAAAG TCAAGATTTT CAGCCCCATA GATGACGAGG TAAAGTTCAC TTCTCAGGCA
GGAAAATACC AAGGAATGTT TGTCAGAGAT GCGGACAGAC CAATTGTAGA AGATTTGAAA
GAGTGTAATG CTCTAGTAAA GATTGGCAAA ATCAAACACA AGTATCCACT TTGTTGGAGA
TCACACCATC CAATTGTATG GCTTGCAAGA AAGGGTTGGT TCTACAAACT AGACAGACTA
GATAACAAGG CAATTGATGC AGCAGAAAGT GTAGAGTATT ATTTTGAACA ACCAAAAAAC
AGATTCTTAG GAATTATCAA AGAGAGACAT CCTTGGTGTA TCTCAAGGGA GAGAATTTGG
GGATGTCCAT TGCCAGTATG GGCATGTGAA GAATGCAATG AAAGAAACTG GTTTTTTACA
AGAAAAGAGA TTGTAGAATC AGCTGACAAT CTTCCTGATG GCCCAGACTT TGAATTACAC
AGACCATGGA TTGATAACAT TACAATAAAG TGTAAAAAAT GTGGCAGTAC AAAAACAAAG
AGAGAAGAAT ATGTTTTAGA TACTTGGCAC AATAGTGGTT CTGCACCATA TTCATCATTA
ACTGATGAAG AGTACACAAA CGAAATTCCA GCACCATTCT TCACTGAAGG AATTGATCAA
ACTAGAGGAT GGGCATATAC ACTACTCATT GAAAATGTAA TTCTAAACAA CGGACCAACC
CCACCATACA AGTCATTCTT GTTCCAAGGA CATGTACTTG ATGAGAAAGG AGGCAAGATG
AGCAAGAGTA AAGGCAATGT TTTGGAAGGA ATTGAATTAC TAGAAAAATA TCCTGCAGAT
TTGATTAGAT TTTATTTCAT GTGGAAGGCT AGTCCAATTG AACCACTTAG TTTCAGTACA
GAAGAGTTAA TGTCAAGACC ATATCAAGTA ATCAACACAC TATTCAATTT ACACTTGTAC
TTTAAGCAAA ACAGCCAGTA TGATAACTTT GAAAAAGAAA ACACGATAGA GTGGGCAAAA
CAAAAGAATT TGTTAACATC ACCAGACATT TGGCTCTTAT CAAAACTTCA AAAATTAATC
TCAAAAATTA CAGACCGCAA TGATTCTTGC AAATTTCATG AAGGTGCAAA AGCAATTGAC
GACTTTATCA TTAACAATCT AAGTCAAATC TACATTCCAA TTACAAGAGG AGAATTGTGG
GATGAAGACG AGGATAAAAA GGAGAGAAGG CTTGCAATTT ATGCAGTACT AGAGGAAGTT
CTAAGAACAT TAGATATCTT GATTCATCCA TTTTGTCCAT TTACCAGTGA GCATCTGTAC
CAAACAGTCT TTGATGGAAA ACAAAGCATA CTACTAGACA AATGGCCAAA ATCACAAGAG
TCACTAGTCA ATGAAGAGAT TGAAGAATCA TTTGACATTA TGAAAGATGT AGTATCAGTT
TCATCAGCTG CAAGAATGAA AGGCAAACTC AAAAGAAGAT GGCCATTAAA TGAAGCAAAA
ATTTGTGTTA AGAAAGGACT AAAGTCAAAG TTAGAATCAC TATCAGAACT ACTCCAGTCT
CAGCTAAATG TAGAGAAATT CAGCATTGCT GAAACTGAAA AAGAATCAGG ATTAGAACAA
ATTTTAGAAT TAAAACAACT AGGACTGCCT GCAAAGCCAA TTGTTGAATT GGAAAGAAAG
AGAATCGGGC CAAAAGCAAA ACAACACATG GGAAAACTTG TTGCAAAGTT CTCTGAAACA
AATCCTGATG AAATAATTTC ATCTTTACAA AATAATTCAA AGTTTGATTT TGATATTGAT
GATGAAACGA TTTCACTTGA CAATGAAGAT TTTGTTGTAG ACTTTGATGC TGATGAAAAT
TATGCAATGT CAAAAAGAGA TGATTATGTA GTGTTTATCT CAACATCGCG AAACAAAGAG
ATGATGGCAA AAGGATTAGT CAAAGATGTT GCAAGAAGAT TGCAAACTTT GAGAAAAGAG
AGAGGATACA ATCCAACTGA TGTTTTAGGA GTCGCATCAA TTCTTGATTT AGATGAAGAA
TCACTTGAAA TGATCAAAGA AAAATCAGAA GACTTGGCTT TCTTGGTAAG AGTAAAGCAA
GTCAACTTTA CAGAATCATG TAAAGAATAC AAAGATGACG ACATTGATGG TCAGAAGATT
AGAATTTCAG TAGAGTAA
 
Protein sequence
MELSTKFDAK AIESEIKEYV KSIDLEKQIF ASDKPEKIRF IEGPPTMNGI PHAGHLRGRV 
IKDLWYRYNT LQGKKIEFNG GWDTQGLPVE LQVEKELGVT GGKTEAIKQF GVERIVSECK
KVVEKYNKTW VEVDELLGMS FNHDKAYWTF RDEFIEREWQ VLKKAHENGI LEEDFTVIAY
CPSCQTSLSH AEVNQGYEEV KDPSLYYKVK LVDEDVFLIV WTTMPFTLVT DAMVGLQPEE
DYAYVKVENE TWVVGKTRLE EFMTEVKIED YKIEKTVKGS EFEGKKYIHP LLDLIPELNE
CSKADNFHVA VSESFVDAST GSGLVHLSPA NGEEDIKIAN KRKVKIFSPI DDEVKFTSQA
GKYQGMFVRD ADRPIVEDLK ECNALVKIGK IKHKYPLCWR SHHPIVWLAR KGWFYKLDRL
DNKAIDAAES VEYYFEQPKN RFLGIIKERH PWCISRERIW GCPLPVWACE ECNERNWFFT
RKEIVESADN LPDGPDFELH RPWIDNITIK CKKCGSTKTK REEYVLDTWH NSGSAPYSSL
TDEEYTNEIP APFFTEGIDQ TRGWAYTLLI ENVILNNGPT PPYKSFLFQG HVLDEKGGKM
SKSKGNVLEG IELLEKYPAD LIRFYFMWKA SPIEPLSFST EELMSRPYQV INTLFNLHLY
FKQNSQYDNF EKENTIEWAK QKNLLTSPDI WLLSKLQKLI SKITDRNDSC KFHEGAKAID
DFIINNLSQI YIPITRGELW DEDEDKKERR LAIYAVLEEV LRTLDILIHP FCPFTSEHLY
QTVFDGKQSI LLDKWPKSQE SLVNEEIEES FDIMKDVVSV SSAARMKGKL KRRWPLNEAK
ICVKKGLKSK LESLSELLQS QLNVEKFSIA ETEKESGLEQ ILELKQLGLP AKPIVELERK
RIGPKAKQHM GKLVAKFSET NPDEIISSLQ NNSKFDFDID DETISLDNED FVVDFDADEN
YAMSKRDDYV VFISTSRNKE MMAKGLVKDV ARRLQTLRKE RGYNPTDVLG VASILDLDEE
SLEMIKEKSE DLAFLVRVKQ VNFTESCKEY KDDDIDGQKI RISVE