Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmag_1938 |
Symbol | |
ID | 8824779 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Natrialba magadii ATCC 43099 |
Kingdom | Archaea |
Replicon accession | NC_013922 |
Strand | + |
Start bp | 1971599 |
End bp | 1973533 |
Gene Length | 1935 bp |
Protein Length | 644 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | |
Product | hydrolase CocE/NonD family protein |
Protein accession | YP_003480071 |
Protein GI | 289581605 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTGGAG ATGACATACC ACACGATGGG GAGTCGAGCG AACACGGACT TACACGCCGT GAGGCACTCG GAACGTACGC CGCTGCCGGA ATCGGCGCGG CGATGGTGCC GGGAGCAGCC ACCGCAACTG CGGCGACAGA CGCCGACGGC CCCGTCTTCG AGGACGGTCG CGCCCAGCCC GTCTTCGACG AGGACGACGT CCTCCGCGAG GAGTTCTGGG TCGAAACGGA GACCGATACG ACCAACACCG GCGACCTGGA CCGGATTCAC GTCGAAATTG CCAGGCCCGA ATCGACTGTA GACAGCGACG TCGCGCTGCC GGTGATTATG GAACCCAGTC CGTACTTCGG TGGGCTGGAT ATGAGCACGG CTGACCTCTA CGACGTCGAC GTTTCGCTGT ACGAACCAGA TAAACCGGGT CGCGATACGC AACCGCGGAG CAACACGGCG ACCGAGCAGA CGATCGACAC CGACGATCTG ACGGCGTTCA GCGGCAGTGC AACCGATTGG ATCGGGCCGA GCACCTACGA GGAGTACTTC GTTCCCAGAG GATTCGTCTT CGCGTACGCC TCCTCGCGCG GCACGCACAA GTCGACCGGT GCGAACACCT GTGGCGACGA ACACGAGGTG AACGGCATCA AAGCCGTCGT CGACTGGCTG AACGGCCGCG CGACGGCCTA CGACTCCCGC TCCGGCGGCG ACCCAGTCGA GGCCGAGTGG ACGACCGGAA AGACCGGCAT GATCGGCGCG TCGTACAACG GCACGCTCCC GAACGGCGTC GCAGCGACCG GCGTCGACGG TCTCGAGGCC ATCGTCCCCG AAGTCGCAAT CTCGAGCTGG TATGACTACT TCAGAGCGAA CGGCCACGTC GTCGCACCCG GCGGCTGGCA GGGCGAGGAT GCCTACCAGC TCGCCGCCTG GGTCACGACC CGGGAGGATC GGGAGGTCGC CGAACCGATC CTCGAGCAGA TCGAAGCCGA CCAGGGCCGC GAGACTGGCA ACTACAACGA GTTCTGGGAC GCCCGCAACT ACGTCCACGA TGCCGACAAC GTCGAGGCCG CCGTCCTCAT TACCCACGGG CTCAACGACG ACAACGTCAA AACCAAGCAG TTCGCCCAGT GGTACGACGC ACTGCGAGAC GCCGACGTCT CGCGCAAAAT CTGGCTCCAC CAGGGCGGGC ACTCGAGTCC ACTTAGCCAC CGGCCGGAGG AGTGGCTCGA CGAGTTGAAC CTGTGGTGGA CGCGCTGGCT CTTCGGCGTC GAGAACGACG TGATGGATGG GCCGACGGCG ACGGTCCAGC GGGAGGACGA CTCCTGGACC ACGTACGACG AGTGGCCGGT TCCGGGGACG AGCGAGGCCG AACTCAACTT CACGCCGGGC GGCCGGACGT CGGGTGGGCT CACACTCGAG CACACGCGTG GTCGACCGGT CACCGAGACG GTCGTCGACC CGGCGGAACC GGAAACGCCG GCGGATGACC TGATCGCGGC CGAGGAGTCA GAACACAGGT TGCTGTATAC GACGGCGCAA CTCGAGGAGG ACGTACATCT GAGCGGCACG GTCGAACTCG ACGTGCGACT CTCGTTTGAC TCGGAATCGG CGAACGTGAC GGGTGTACTG GTCGACGTTG GGCCGGACGG GGAGACTGAA ATCATCAACC GGGGGTGGAT GAACCCCCAG AACCGAAAGT CGGACTCGGA GACGTTCGCT ATCCATCCCG GCACGCCATA CCGGCTTTCG TTCGACCTCC AGCCGGACGA CCACGTCTTC GCGCCGGACC ACCGGATCGG TATCGCCGTC CTCTCGACGG ACTACGACTT CACGCAGCGG CCACCGGAGG AGAAAGAACT CACGCTCGAC GTGAAACAGA GCGCTGCCCG ACTGCCGGTC GTCGGCGGTG CGGATGCGCT GAGTGACGCG CTTTCGGACG ACTGA
|
Protein sequence | MSGDDIPHDG ESSEHGLTRR EALGTYAAAG IGAAMVPGAA TATAATDADG PVFEDGRAQP VFDEDDVLRE EFWVETETDT TNTGDLDRIH VEIARPESTV DSDVALPVIM EPSPYFGGLD MSTADLYDVD VSLYEPDKPG RDTQPRSNTA TEQTIDTDDL TAFSGSATDW IGPSTYEEYF VPRGFVFAYA SSRGTHKSTG ANTCGDEHEV NGIKAVVDWL NGRATAYDSR SGGDPVEAEW TTGKTGMIGA SYNGTLPNGV AATGVDGLEA IVPEVAISSW YDYFRANGHV VAPGGWQGED AYQLAAWVTT REDREVAEPI LEQIEADQGR ETGNYNEFWD ARNYVHDADN VEAAVLITHG LNDDNVKTKQ FAQWYDALRD ADVSRKIWLH QGGHSSPLSH RPEEWLDELN LWWTRWLFGV ENDVMDGPTA TVQREDDSWT TYDEWPVPGT SEAELNFTPG GRTSGGLTLE HTRGRPVTET VVDPAEPETP ADDLIAAEES EHRLLYTTAQ LEEDVHLSGT VELDVRLSFD SESANVTGVL VDVGPDGETE IINRGWMNPQ NRKSDSETFA IHPGTPYRLS FDLQPDDHVF APDHRIGIAV LSTDYDFTQR PPEEKELTLD VKQSAARLPV VGGADALSDA LSDD
|
| |