Gene Nmag_1938 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmag_1938 
Symbol 
ID8824779 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatrialba magadii ATCC 43099 
KingdomArchaea 
Replicon accessionNC_013922 
Strand
Start bp1971599 
End bp1973533 
Gene Length1935 bp 
Protein Length644 aa 
Translation table11 
GC content65% 
IMG OID 
Producthydrolase CocE/NonD family protein 
Protein accessionYP_003480071 
Protein GI289581605 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTGGAG ATGACATACC ACACGATGGG GAGTCGAGCG AACACGGACT TACACGCCGT 
GAGGCACTCG GAACGTACGC CGCTGCCGGA ATCGGCGCGG CGATGGTGCC GGGAGCAGCC
ACCGCAACTG CGGCGACAGA CGCCGACGGC CCCGTCTTCG AGGACGGTCG CGCCCAGCCC
GTCTTCGACG AGGACGACGT CCTCCGCGAG GAGTTCTGGG TCGAAACGGA GACCGATACG
ACCAACACCG GCGACCTGGA CCGGATTCAC GTCGAAATTG CCAGGCCCGA ATCGACTGTA
GACAGCGACG TCGCGCTGCC GGTGATTATG GAACCCAGTC CGTACTTCGG TGGGCTGGAT
ATGAGCACGG CTGACCTCTA CGACGTCGAC GTTTCGCTGT ACGAACCAGA TAAACCGGGT
CGCGATACGC AACCGCGGAG CAACACGGCG ACCGAGCAGA CGATCGACAC CGACGATCTG
ACGGCGTTCA GCGGCAGTGC AACCGATTGG ATCGGGCCGA GCACCTACGA GGAGTACTTC
GTTCCCAGAG GATTCGTCTT CGCGTACGCC TCCTCGCGCG GCACGCACAA GTCGACCGGT
GCGAACACCT GTGGCGACGA ACACGAGGTG AACGGCATCA AAGCCGTCGT CGACTGGCTG
AACGGCCGCG CGACGGCCTA CGACTCCCGC TCCGGCGGCG ACCCAGTCGA GGCCGAGTGG
ACGACCGGAA AGACCGGCAT GATCGGCGCG TCGTACAACG GCACGCTCCC GAACGGCGTC
GCAGCGACCG GCGTCGACGG TCTCGAGGCC ATCGTCCCCG AAGTCGCAAT CTCGAGCTGG
TATGACTACT TCAGAGCGAA CGGCCACGTC GTCGCACCCG GCGGCTGGCA GGGCGAGGAT
GCCTACCAGC TCGCCGCCTG GGTCACGACC CGGGAGGATC GGGAGGTCGC CGAACCGATC
CTCGAGCAGA TCGAAGCCGA CCAGGGCCGC GAGACTGGCA ACTACAACGA GTTCTGGGAC
GCCCGCAACT ACGTCCACGA TGCCGACAAC GTCGAGGCCG CCGTCCTCAT TACCCACGGG
CTCAACGACG ACAACGTCAA AACCAAGCAG TTCGCCCAGT GGTACGACGC ACTGCGAGAC
GCCGACGTCT CGCGCAAAAT CTGGCTCCAC CAGGGCGGGC ACTCGAGTCC ACTTAGCCAC
CGGCCGGAGG AGTGGCTCGA CGAGTTGAAC CTGTGGTGGA CGCGCTGGCT CTTCGGCGTC
GAGAACGACG TGATGGATGG GCCGACGGCG ACGGTCCAGC GGGAGGACGA CTCCTGGACC
ACGTACGACG AGTGGCCGGT TCCGGGGACG AGCGAGGCCG AACTCAACTT CACGCCGGGC
GGCCGGACGT CGGGTGGGCT CACACTCGAG CACACGCGTG GTCGACCGGT CACCGAGACG
GTCGTCGACC CGGCGGAACC GGAAACGCCG GCGGATGACC TGATCGCGGC CGAGGAGTCA
GAACACAGGT TGCTGTATAC GACGGCGCAA CTCGAGGAGG ACGTACATCT GAGCGGCACG
GTCGAACTCG ACGTGCGACT CTCGTTTGAC TCGGAATCGG CGAACGTGAC GGGTGTACTG
GTCGACGTTG GGCCGGACGG GGAGACTGAA ATCATCAACC GGGGGTGGAT GAACCCCCAG
AACCGAAAGT CGGACTCGGA GACGTTCGCT ATCCATCCCG GCACGCCATA CCGGCTTTCG
TTCGACCTCC AGCCGGACGA CCACGTCTTC GCGCCGGACC ACCGGATCGG TATCGCCGTC
CTCTCGACGG ACTACGACTT CACGCAGCGG CCACCGGAGG AGAAAGAACT CACGCTCGAC
GTGAAACAGA GCGCTGCCCG ACTGCCGGTC GTCGGCGGTG CGGATGCGCT GAGTGACGCG
CTTTCGGACG ACTGA
 
Protein sequence
MSGDDIPHDG ESSEHGLTRR EALGTYAAAG IGAAMVPGAA TATAATDADG PVFEDGRAQP 
VFDEDDVLRE EFWVETETDT TNTGDLDRIH VEIARPESTV DSDVALPVIM EPSPYFGGLD
MSTADLYDVD VSLYEPDKPG RDTQPRSNTA TEQTIDTDDL TAFSGSATDW IGPSTYEEYF
VPRGFVFAYA SSRGTHKSTG ANTCGDEHEV NGIKAVVDWL NGRATAYDSR SGGDPVEAEW
TTGKTGMIGA SYNGTLPNGV AATGVDGLEA IVPEVAISSW YDYFRANGHV VAPGGWQGED
AYQLAAWVTT REDREVAEPI LEQIEADQGR ETGNYNEFWD ARNYVHDADN VEAAVLITHG
LNDDNVKTKQ FAQWYDALRD ADVSRKIWLH QGGHSSPLSH RPEEWLDELN LWWTRWLFGV
ENDVMDGPTA TVQREDDSWT TYDEWPVPGT SEAELNFTPG GRTSGGLTLE HTRGRPVTET
VVDPAEPETP ADDLIAAEES EHRLLYTTAQ LEEDVHLSGT VELDVRLSFD SESANVTGVL
VDVGPDGETE IINRGWMNPQ NRKSDSETFA IHPGTPYRLS FDLQPDDHVF APDHRIGIAV
LSTDYDFTQR PPEEKELTLD VKQSAARLPV VGGADALSDA LSDD