Gene Nmag_1744 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmag_1744 
Symbol 
ID8824584 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatrialba magadii ATCC 43099 
KingdomArchaea 
Replicon accessionNC_013922 
Strand
Start bp1777528 
End bp1780740 
Gene Length3213 bp 
Protein Length1070 aa 
Translation table11 
GC content64% 
IMG OID 
Productisoleucyl-tRNA synthetase 
Protein accessionYP_003479880 
Protein GI289581414 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCAGGT TTAGCGAGGT CGACGACCAG TACGACCCCG ATGCCGTCGA GCAACGGGTG 
TTCGACTACT GGGACGACGT CGATGCCTAC GAGCAAACTG TCGAGCACCG ATCGGACGGT
GAATCGTTCT TCTTCGTCGA CGGCCCGCCG TACACGTCGG GCTCGGCACA CATGGGGACC
ACCTGGAACA AGTCCCTCAA GGACGTCTAC ATCCGCTTCC ACCGGATGCA GGGCTACGAC
GTCACCGACC GGCCGGGCTA CGACATGCAC GGGCTCCCGA TCGAAACCCG CGTCGAGGGC
CAACTCGGCT TCGAGAACAA GAAGGACATC GAGGAGTACG GCGAGGAGAA CTTCATCGAG
GCCTGCAAGG AGTACGCAGA CGAGCAACTC GAGGGCCTCC AGTCTGACTT CCAGGACTTC
GGCGTCTGGA TGGACTGGGA GAACCCGTAT CGGACGGTTA GCCCCGAGTA CATGGAGGCA
GCCTGGTGGG GCTTCTCGAA GGCTGCAGAG CGCGGCTTAG TCGAGAAAGG CCACCGCTCG
ATTTCACAGT GTCCGCGCTG TGAGACGGCA ATCGCGAACA ACGAGGTCGA GTACGAGGAC
GTCGAGGACC CCTCGATCTA CGTGAAGTTC GACCTCGCCG ACCGCGAGGG CTCGATCGTC
ATCTGGACGA CGACGCCGTG GACTATCCCG GCGAACACCT TCGTCGCAGT CGACGAGGAG
GGCGACTACG TCGGCGTCCG CGCCGAGAAG GACGGCGAGG AGGAACTGCT GTACGTCGCC
GAGGCCAAAC ACGAGGACGT GCTGAAGACG GGCCGCTACG ACGACTACGA GGTCGTCGAG
GAGGTCGCCG GCGAAGAGAT GATCGGCTGG GCGTACGAGC ACCCACTCGC CGAGGAAGTG
CCAGACACCG TCGACGCAGA GGGAACCCAC GAGGTCTACG CTGCCGACTA CGTGGACACG
GGCGGCGACG GCACCGGTCT CGTCCACTCC GCACCGGGTC ACGGTGAGGA GGACTTCGAG
CGCGGAACGG AGCTTGGCTT CCCGATCTTC TGTCCCGTCG ATGGCGACGG CGTCTACACC
GAGGAGGCCG GCAAGTACGA AGGCGAGTTC GTCAAGGACG CAGACCCAGA GATCACGGCC
GATCTCGAGG ACAACGGCGC GCTGCTCGCC TCCGAGACGG TTTCCCACAG CTACGGTCAC
TGCTGGCGGT GTGATACGGG CATCATCCAG ATCGTTACCG ACCAGTGGTT CATCACGATC
ACGGACGTGA AAGACGAGCT CCTCGAGAAC ATCGAGGACA GCCAGTGGCA CCCAGACTGG
GCGCGAGACA ACCGCTTCCG CGACTTCGTC GAGGAGGCAC CCGACTGGAA CGTCTCCCGA
CAGCGCTACT GGGGTATTCC GCTGCCCGTC TGGACGCCAG AGGACCGTGA TGATGATGAA
GACATGATCG TCATCGGCGA GCGCGAGGAA CTCGTCGACC GCGTCGATCA GGATATCGAC
GTCGATACGG TCGACCTGCA CAAGGACACC GTCGACGACC TCACGATCAC CGAGGACGGC
ACCACCTACA CTCGCGTGCC CGACGTGTTC GACGTCTGGC TCGACTCTTC GGTCGCCTCC
TGGGGAACCC TGAACTACCC CTCGGACGAC AGCCAGTTCG ACGACCTCTG GCCCGCAGAC
TTCATCCTCG AAGCCCACGA CCAGACCCGC GGCTGGTTCT GGTCCCAGCT CGGCATGGGC
ACCGCCGCAC TCGGCGACAT TCCCTATGAA GAGGTCCTCA TGCACGGCCA CGCGCTCATG
CCCGACGGCC GCGCAATGTC CAAGTCCAAG GACATTCTGG TCGACCCCCA CGAGGCCATC
GACCGCCACG GCCGGGACGT GATGCGTGCG TTCTTGCTGT CGAACAACCC GCAGGGCGAC
GACATGCGCT TCTCCTGGGA GGGCATGCAG ACGATGGAGA ACCACCTCCG GACGCTGTGG
AACGTCTTCC GGTTCCCGCT GCCGTACATG CGGCTCGATG AGTTTGATCC GCAGGCGACG
ACCGTCGAAG ACGTGCAAGC GGATCTCGAA CTCATCGACG AGTGGGTGCT CGCCCGCCTG
CAGTCCACCA AGGACGAGAT GACCGCCCAC TTCGACGAGC GCCGCCAGGA CAAGGCCCTC
AACGCGCTCA TCGACTTCGT CGTCGAGGAC GTCTCACGGT TCTACGTCCA GGCCGTCCGC
GAGCGTATGT GGGAAGAAGA GGACAGCGCC TCCAAGGAAG CCGCCTACGC GACCATCTAC
CGTGTGCTCC GCGAAACCGT CGCGCTGCTC GCTCCCTACG CGCCGTTCAT CAGTGAGGAA
ATCTACGGCA CGCTCACCGG CGACGCCGAA CACGACACCG TCCATATGTG CGACTGGCCC
ACCGTCGACG AGACGTTCGT CGACGAGCAA CTCGAGGAGG ACGTCGCATT CCTTCGCGCC
ATCGAAGAAG CCGGCGCGAA CGCTCGTCAG CAGGCCGGCC GCAAACTGCG CTGGCCCGTC
CCACGCGTCG TCGTCGCAGC CGACGACCAG CGCGTCGTCG ATGCTGTCGA GCGCCACACC
CCACTGCTCG AGGATCGCCT CAACGCCCGC GAGATCGAAC TCGTCTCGCC GGACGACCGC
TGGGGCGAAC TGAACTACAG TGCCGAAGCA GACATGAGCG AACTCGGACC GACGTTCGGC
GACCGCGCCG GCCAGGTCAT GAACGCGCTC AACGAGGCCC GGATCGACGA GCCGACACTC
GAGTCGATCG CAGCGGCTGT GGAGGATGTA CTCGAGTCCG GCGAGGAGAT CACCGAGGAG
ATGGTCTCGT TCGTCACCCA GACGCCGGAG GGTGTCGCCG GCACGGCCTT TGGGCTGAAC
GGCGACGACC GCGGGGTTGC GTACGTCGAC GCCTCGCTCA CTGACGATAT CGAGAGCGAG
GGGTACGCCC GCGAGGTTAT CCGCCGGGTA CAGGAGATGC GCAAGGACCT CGACTTAGAT
GTCGAGGAGC GAATTGCGCT GGAGCTCGAA ATCGAAGACG ACCGCGTTGC CTCGCTGGTC
GACGAGCGTG CGGATCTGAT CCGCGAGGAG GTTCGTGCGG ATGAGTTCGG CGGCGACGTT
GTCGACGATG GTCACCGCAA GGAGTGGGAG GTTGAAGGCG TGGCGATGGA GATCGCGATC
GAGTCGTTGG CAGCGCCGGA AGCGTCTGAA TAA
 
Protein sequence
MSRFSEVDDQ YDPDAVEQRV FDYWDDVDAY EQTVEHRSDG ESFFFVDGPP YTSGSAHMGT 
TWNKSLKDVY IRFHRMQGYD VTDRPGYDMH GLPIETRVEG QLGFENKKDI EEYGEENFIE
ACKEYADEQL EGLQSDFQDF GVWMDWENPY RTVSPEYMEA AWWGFSKAAE RGLVEKGHRS
ISQCPRCETA IANNEVEYED VEDPSIYVKF DLADREGSIV IWTTTPWTIP ANTFVAVDEE
GDYVGVRAEK DGEEELLYVA EAKHEDVLKT GRYDDYEVVE EVAGEEMIGW AYEHPLAEEV
PDTVDAEGTH EVYAADYVDT GGDGTGLVHS APGHGEEDFE RGTELGFPIF CPVDGDGVYT
EEAGKYEGEF VKDADPEITA DLEDNGALLA SETVSHSYGH CWRCDTGIIQ IVTDQWFITI
TDVKDELLEN IEDSQWHPDW ARDNRFRDFV EEAPDWNVSR QRYWGIPLPV WTPEDRDDDE
DMIVIGEREE LVDRVDQDID VDTVDLHKDT VDDLTITEDG TTYTRVPDVF DVWLDSSVAS
WGTLNYPSDD SQFDDLWPAD FILEAHDQTR GWFWSQLGMG TAALGDIPYE EVLMHGHALM
PDGRAMSKSK DILVDPHEAI DRHGRDVMRA FLLSNNPQGD DMRFSWEGMQ TMENHLRTLW
NVFRFPLPYM RLDEFDPQAT TVEDVQADLE LIDEWVLARL QSTKDEMTAH FDERRQDKAL
NALIDFVVED VSRFYVQAVR ERMWEEEDSA SKEAAYATIY RVLRETVALL APYAPFISEE
IYGTLTGDAE HDTVHMCDWP TVDETFVDEQ LEEDVAFLRA IEEAGANARQ QAGRKLRWPV
PRVVVAADDQ RVVDAVERHT PLLEDRLNAR EIELVSPDDR WGELNYSAEA DMSELGPTFG
DRAGQVMNAL NEARIDEPTL ESIAAAVEDV LESGEEITEE MVSFVTQTPE GVAGTAFGLN
GDDRGVAYVD ASLTDDIESE GYAREVIRRV QEMRKDLDLD VEERIALELE IEDDRVASLV
DERADLIREE VRADEFGGDV VDDGHRKEWE VEGVAMEIAI ESLAAPEASE