Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmag_2983 |
Symbol | |
ID | 8825843 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Natrialba magadii ATCC 43099 |
Kingdom | Archaea |
Replicon accession | NC_013922 |
Strand | - |
Start bp | 3069797 |
End bp | 3072643 |
Gene Length | 2847 bp |
Protein Length | 948 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | |
Product | Protein of unknown function DUF1998 |
Protein accession | YP_003481097 |
Protein GI | 289582631 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAACGA ACGAGGACGG AGATGGGGGA GATGAGGACG ATCAGGACGA ACAGGGCGAC CGGGGCGACC AACAAGCGAT CCCGATCACT GGCGAGGAGT TGCGAAAGAC GTTCCCGCGC GCAGGGGCTC GCGAGTCCGG TGACATCTCC GTCCTCGAAC TCCCCGGCCG GGACGCAGCG ACCGTTTCGA ACGCCGACGT CCTCCGCCCG GAACTCGCCG AGCAGTTAGA CCACGACCTC TACGCTCACC AGGCCGAGGC GCTCGAGGAA CTCGAGTGTG GGCAGAACGT CTGCGTCGCG ACGAGCACTG CCTCGGGGAA GACACGAATC TATGCGCTCC AGATCGCGAG ACACTATCTG GATGCGCGCG AGCGGGCCGC AGAGACAGGC GGGAAAAAGG AAAGACAGAA CGGAGGCGAG AAGGCGGGTG CACGTGCAGC GTCGGCTCCC ACGCCGACCG CCTACATCTG CTACCCCACG AAAGCTCTCT CGCGCGACCA GGAGCGCGAA CTGAACGACT TCTACGACGA TCTCGGTCTC GATATCACCG TCCGCGTCTA CGACGGCGAC ACCGAACGCG GGGAGACGCG CCGCCAGATC CGCGAGGAAG CGGACGTCAT CATCACCAAC TTCGCCGGCG TGAACACCTA CTTGCACGAC CACGACCGCT GGGCACGCTT CTTCTCCGCC TGCGACCTCG TGGTCATCGA CGAATCCCAC ACCTACACCG GCGTCCACGG CATGCACGTC GCCTGGATTA TCCGCCGACT GAAGCGCGTG CTCGCCTACT ACGACGCCGA TCCGCAGTTC GTCCTCACCA GCGCAACGAT CGGAAATCCG GGCGCACACT CGAGTACCCT GATCGACGAC CCGGTAACGG TGGTCGACCA GGACGGTTCC CCGACGGGGC CGCGGGAACT GGTGCTGTGG AATCCGCCGC CGCAGTCGAG CGAGAAAGAC GGGGACAGAG CCGGTGCCGA AGACGCCGTC ACCGACCGCG TCCCTGCCAC CGTCGAAGCA CCGCGGCTCC TCTCGCATCT GACCTACCAC GACGCCCAGA CGCTGCTGTT TACGCCCTCC CGAAAGCTCG CCGAACTCTC GGTCAAGCGC GCAGCGAAAC ACCGCCGCGA CCGGTCGCGA TACTACACGA ACCCGGACCG GGGCAGCGCC ATCGAACCCT ACCACGCGGG CCACTCGCGG CGGAATCGAC ACGGCACGGA ACACCAGCTC AAGACCGGCG TGCTCGACGG CGTCGCCTCG ACGAATGCAC TCGAGTTGGG GATCAACGTT GGCAAGATGG ACGCCACCGT CCAGTTGGGC TACCCCGGCC AGCGCCAGTC GTTCTGGCAG CAGATCGGTC GCGCTGGCCG TGGGGCAAAC CGCGCGCTCT CGGTGCTCGT CGCGGGACAC CGTACCCTCG ATCAGTACGT CGTGAGTAAT CCGGACTACC TCCTCGAGAA CGACGTCGAG GACGCCGTCG TCGACACCGA GAACGACGCG GTCTTCGCAC AGCACCTGCT CTGTGCGGCG GCCGAACTCG CGCTCGACGA GCGCGACGCC GGTGCAGACG GACTCGCCGA CCGTGAGCGA CTCGATCGGG CGATCGAGAT GTGGCGACGC GCAGGGAAGC TGACGGGCCA CCTCGAAACG GGCGTCTCCT ACACCGGTCC GCCCAGACCG CAGGGATCGA TCTCGCTGTA CGCCACGACC GGCGAGGAGT ATACGGTCGA ACTCGCTGAC GGCGTCGACG AGCGCCACGA CCCCGAGATG GAACCGCTCG CCGAGGAGCG CGTGCTGCGG GACTTCCACG AGGGCGCGGT CAGGCTCCAC GAGGGCCAGC AGTACGAGGT CTGTGCGGTC GATCACACGA CGCCGCGGCC CTCGGTGACG CTGCGCCCGA CGGATGTAGC GTACCACACA CGGACGCGCA CGGACGTGAC GGTTCTCGAC GCCGTCTCAG AGGAGTCGCG CGAGATTGGG TCCTTTACGC TCCACTTCGG CCGCGGGCGA GTGCTCGTCT ACCACGACAC CTACGACGAG GTCGCGATCC ACGGCGGCAA AAAGAAAGCA CAGCAACTCC CCACCGGAAA CCCGCCGCTC TCGATGGAGA CGCAACTGTG CTGGCTCGAA GTCCCCGAGC ACGTCGAGTC CGCGCTGGTC GAGCGGTACC GAGACTTTTC CGTGCCGGGA CTGGACAGCG ACCTCGCAGA TACCGCCCAT CTCGGCTACG CGGGCGGCCT CCACGCCGCC GAGCACGCGA CGATCGGTGT CGCCCCACTG GAACTGATGG TCGACAAGCG CGACCTCGGC GGACTGGCGA CGCTGTCGAT CGACTCGCAT CTGGCCAGCG CCGCATCGGA CGGGAACGGT GACGACGAGA CGAGCGTCGG CGAAGATCCC GCCCCGCAGA ACATCGCCGC TGCCGAGGCC ACCGTCAGGG AACTCGCGAT GGGCCTCGAC CGCAAACCCG CGAGCGGCTG GTTCATCTAC GACGGAATCG ACGGCGGACT CGGTTTCTCG CGGGCGATCT ACGAGAACTT CGAGGCCGTC GCTCGGCGGG CGCGCGCCCT GATCGAAACC TGTGACTGCG GCCGCATCGA CGGCTGTCCC GCTTGCGTGA TGGACGACCA GTGCGGGAAC GACAACCAAC CACTACACCG CGAGGCGGCC GTCGACGTGT TAGACCTCCT CCTAGCGAGT GCGAGTGGGG ATGGTAAAAA GAGTGTACTC GAGTCCGATC TGCTGGTAGA CGACGGTGAG AGCGGAGATG GGGAGAGAGG TAGAGATCGC GACAAGCATG ACACTCGTGA GACCGATGAC AGACGGCCGC CGCTATTCTA CGCCTGA
|
Protein sequence | MATNEDGDGG DEDDQDEQGD RGDQQAIPIT GEELRKTFPR AGARESGDIS VLELPGRDAA TVSNADVLRP ELAEQLDHDL YAHQAEALEE LECGQNVCVA TSTASGKTRI YALQIARHYL DARERAAETG GKKERQNGGE KAGARAASAP TPTAYICYPT KALSRDQERE LNDFYDDLGL DITVRVYDGD TERGETRRQI REEADVIITN FAGVNTYLHD HDRWARFFSA CDLVVIDESH TYTGVHGMHV AWIIRRLKRV LAYYDADPQF VLTSATIGNP GAHSSTLIDD PVTVVDQDGS PTGPRELVLW NPPPQSSEKD GDRAGAEDAV TDRVPATVEA PRLLSHLTYH DAQTLLFTPS RKLAELSVKR AAKHRRDRSR YYTNPDRGSA IEPYHAGHSR RNRHGTEHQL KTGVLDGVAS TNALELGINV GKMDATVQLG YPGQRQSFWQ QIGRAGRGAN RALSVLVAGH RTLDQYVVSN PDYLLENDVE DAVVDTENDA VFAQHLLCAA AELALDERDA GADGLADRER LDRAIEMWRR AGKLTGHLET GVSYTGPPRP QGSISLYATT GEEYTVELAD GVDERHDPEM EPLAEERVLR DFHEGAVRLH EGQQYEVCAV DHTTPRPSVT LRPTDVAYHT RTRTDVTVLD AVSEESREIG SFTLHFGRGR VLVYHDTYDE VAIHGGKKKA QQLPTGNPPL SMETQLCWLE VPEHVESALV ERYRDFSVPG LDSDLADTAH LGYAGGLHAA EHATIGVAPL ELMVDKRDLG GLATLSIDSH LASAASDGNG DDETSVGEDP APQNIAAAEA TVRELAMGLD RKPASGWFIY DGIDGGLGFS RAIYENFEAV ARRARALIET CDCGRIDGCP ACVMDDQCGN DNQPLHREAA VDVLDLLLAS ASGDGKKSVL ESDLLVDDGE SGDGERGRDR DKHDTRETDD RRPPLFYA
|
| |