Gene Nmag_1995 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmag_1995 
Symbol 
ID8824837 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatrialba magadii ATCC 43099 
KingdomArchaea 
Replicon accessionNC_013922 
Strand
Start bp2026710 
End bp2031155 
Gene Length4446 bp 
Protein Length1481 aa 
Translation table11 
GC content61% 
IMG OID 
ProductDNA polymerase II, large subunit DP2 
Protein accessionYP_003480128 
Protein GI289581662 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCGCGG AAGACGAACG ATACTTCGAG CAACTCGAGT CCCAGCTCGA GGAGGCGTTC 
GACGTCGCCG AGCGAGCCAA AGAGCGCGGC GGCGACCCAG AACCCGACGT CGAGATTCCG
ACTGCGCGGG ACATGGCCGA CCGCGTCGAG AACATCCTGG GGATCGACGG CGTCGCCGAG
CGCGTCAGGG AACTCGAGGG CGAGATGTCT CGCGAAGAGG CAGCCCTCGA ACTCGCGGAG
GATTTCGCAG AAGGGCGCGT CGGTGACTAC GAGACGAAAG CCGGCAAGGT CGAGGGCGCG
GTTCGAACGG CGGTTGCCCT GCTTACGGAG GGTGTCGTCG CAGCACCGAT CGAGGGGATC
GACCGGGTCG AGATTCTGAC GAACGACGAC GGGACGGAGT TCGTCAACGT CTACTACGCC
GGCCCGATCC GGTCTGCGGG TGGGACGGCA CAGGCACTCT CCGTGCTGGT CGCAGACTAC
ACACGCGCGC TCGTCGGCAT CGAGCAGTAC GAAGCCAGAC AGGAGGAGGT CGAACGGTAC
GCGGAGGAAA TCTCCCTCTA CGACAAAGAG ACCGGCCTGC AGTACTGCCC GAAGGACAAG
GAGGCGAAAT TCATCGCCAA ACACCTCCCG ATCATGCTGG ACGGAGAGGC GACCGGCGAC
GAGGAGGTCT CTGGCTTTCG CGACCTGGAG CGAGTCGACA CCAACAGCGC CCGCGGCGGG
ATGTGTCTCG TGCTTGGTGA GGGGATCGCG CTCAAAGCCC CCAAGATCCA GCGCTACACC
CGAAATCTGG ACGAGATCGA CTGGCCGTGG CTGCAGGATC TGATCGACGG TAATTACGCA
GCTGACGCAA GTGATGGAGA CGACACAGCT GACGCAGCTG GCTCAGATGA CGGAAGCAAC
GAAGCAGATA CGGACGAGAA CGCTGAGAAC GACGGCGATG ATACCGACGG TGACGAAACA
GAAGCCGACG AAAGCAGCGA CGAAGCCGCT GTCACCGACG AGACAGATAC CTCCGACAGC
GAACCGGTCG GCCCACCCCG GGCCGAACCG TCGAAGAAGT TCCTCCGGGA CCTGATCGCC
GGCCGCCCAG TCTTCACGCA CCCCAGTTCG CCGGGTGGTT TCCGACTCCG GTACGGCCGC
GCGCGAAACC ACGGCTTCGC GACCGGCGGC ATCCACCCCG CGACGATGCA CCTCGTCGAC
GACTTCCTCG CGACGGGGAC CCAGATCAAG ACGGAACGAC CCGGAAAGGC CCACGGGATC
ATCCCCGTCG ACAGTATCGA CGGCCCCACG GTCAAACTCG CAAACGGCGA CGTTCGCCGA
ATCGACGACC CCGAAGAGGC CAAAGAGATC AGAAACGGCG TCGAGAAAAT CCTCGATACC
GGCGAGTATC TGGTCAACTA CGGCGAGTTC GTCGAGAACA ACCATCCGCT CGCCCCCGCC
TCCTACGTCT ACGAGTGGTG GATCCAGGAC ATGGCGACCG CCGGCGCGGA CGTGCAAGCC
CTCGAGGACG ACCCCCGAAT TGACCTCGAG TTTCCCGACC CAGCGGAGGC ACTCGAGTGG
GCCACGGAAT ACGACGCACC GTTGCACCCC CAGTACACCT ACCTCTGGCA CGACATCTCG
GTCGAGTCGT TTTGCACCCT TGCCGACGCT GTCGCGGAGG GGTGGGTCGA GGCTGAACCC
AGGGAGGGTG CAGGTACGGG AGCGAACGAT GACACCCTGG TACTCGAGTG CACCGAGTCA
GTGCAGGAGG CACTCGAGAC GCTCATCCTC GAACACCGCC AGCGACCGGA CGAGAATCGA
ATCGAGATCG ACGACTGGCG GCCGTTCGCT CGAACGCTCG GCTGTGAGCC ACGGCAGGCG
GTTGCGGACG GTGCGGCGCT GGATTCTGAT GTGGGGGATG CAGATTCGGA CCAGGCCTCG
GACTCGAACT CGAACTCGAA CTCGGACTCG AACCCGAACG CAGCCCACGA CAACACGCTC
CCCATCGAAC TCGAGCGCAC CTGGGACGAC GACGACCTCT CGGAACGCGC CCGCACCTGG
GGCCACGAGG ACGAAGCCGA CGGCGCCAAC GCGATCGAGG CAGTCAACGA GGTTGCCCCC
TTCGAGGTTC GCGAACGCGC ACCCACACGA ATCGGGAACC GGATGGGTCG TCCGGAGAAA
TCCGAGCGCC GCGACCTCAG CCCACCGGTC CACACCCTGT TCCCCATCGG CGAGGCCGGC
GGCGCACAGC GCAACGTTGC CGACGCGGCT AAACACGCCG AAACCATGTC GGACACGCCC
GGCATCGTCG AGATCCAGAT CGGCCGCCAG CAGTGTCCCA GCTGTGGGAC GGAGACGTTC
AAAAACCGCT GTCCCGACTG CGAAACGCGA ACCGAACCCG ACTACCGCTG TCCCGACTGC
GACCAGTCGC TCGAGCCAGA CGATTCCGGC CGCGTCGAAT GCGGCCGCTG TGAGATCGAA
GGCACCTGTG TCGAACCCCG CGAGATCGAC ATCAACGACG AATTCCGGAG TGCACTCGAG
TCCGTCGGCG AGCGAGAGAA CGCCTTCGAG ATCCTGAAGG GTGTCAAGGG GCTGACCTCG
CAGAACAAGA TTCCCGAGCC GATCGAGAAG GGGATCTTGC GCGCCAAACA CGACGTCTCG
GCGTTCAAGG ACGGGACTGT CCGCTACGAC ATGACGGACT TGCCAGTTAC GTCCGTCCGT
GCAAACGAAC TCGATATCGA CGTCGGGCAG CTACAGGCGC TCGGCTACGA GGAGGATATC
CACGGTGACC CGCTCACCCA CGAGGATCAG TTGGTCGAAC TCAACGTACA GGACATCGTC
CTCTCGGACG GCGCAGCCGA ACACATGATG CAGACGGCCG ACTTCATCGA CGACCTGCTG
GAGCAGTACT ACGGACTCGA GCCGTTCTAC GAGATCGATG ACCGCCAGGA ACTCGTCGGC
GAGTTGGTGT TCGGGATGGC ACCCCACACG AGTGCTGCAA CTGTCGGTCG GGTGATTGGT
TTCACGAGCG CAGCGGTCGG ATACGCTCAT CCGTACTTTC ACGCCGCGAA ACGCCGGAAC
TGCTTCCATC CCGAGACGGA GATCGTTTAT CGTGAAGGGG AGTCGCGGTC TATTTACGAC
AAGCAGCAGC TCGAATCCGA TGACGGGTTG CCACCCGAGG AACGTCCACA ACGGGACAGT
ATTCGCACCT TCGTCGAGGA TCGACTCGAG AACCCGGAGC AAGACGATTT CGGTACGGAA
TATCAGGAGC TCGATGACGA TGTCTGGGTA CTCTCGTACA CAGGTGTGTG CTGTCCGAAA
CGCGTGACGA CCGTCTCGAA GCATCCGGCA CCGGATCATC TTCTCTCGAT CACGACCGAA
AGCGGCCGCA AACTCCGAGT GACACCGGAT CACACGATGC TTCGGTTCGA CGAAGATTCC
TCGGATCGCC TCTCACAAGA ATACTGGAAG GCAGAGACGG TATCGGCTCA GGAACTCTCC
CCTGGTGACG AACTTCCAAC GCCGTCGCCT GTCGGGGAGC GTACAGTCGA TGACTTCCTG
TTCGGTCCCG TCAAGGAGAT CAACGACTTC CCGCCTGCTG TCACAGGCTG GGATGTGATC
TCTGATATCG AAATTATTGA ATCGGATACT GATTACGTAT ATTGTCTGGA AGTAGAGAGT
TCGAACACGC TCTCAGCGAA TGGAATAATC ACAGGACAGT GTGATGGCGA CGAAGACTGT
GTAATGTTAT TGCTCGACGG ACTATTGAAT TTCAGTAAAT CATTCTTGCC TGATAAGAGG
GGTGGTAAGA TGGACGCACC CCTGGTCATG TCCTCCCGGA TCGATCCCTC CGAAATCGAC
GACGAGGCCC ACAACATGGA CATCGTCTCC CAGTACCCTC GCGAGTTTTA CCTCGCAACC
CGCGAACAGG CCGATCCGGA GGAAGTCGAT ATCCAGATTG GCGAGGACAC GCTCGGAACC
GACGGCGAGT ACACCGGCTT CGAGCACACC CACGACACCA CCGACATCGC GATGGGACCG
GACCTCTCGG CGTACAAGAC GCTCGGCTCG ATGATGGACA AGATGGACGC CCAGCTCGAA
CTCTCGCGGA AACTCGCCGC CGTCGACGAA ACTGACGTGG CCGAGCGAGT CATCGAATAC
CACTTCCTTC CGGACCTCAT CGGAAACCTT CGCGCGTTCT CTCGGCAGGA AACCCGCTGT
CTCGACTGCG GCGAGAAGTT CCGACGGATG CCACTGACCG GCGACTGTCG GGAGTGTGGC
GGTCGCGTCA ATCTCACCGT CCACCAGGGA TCGGTCAACA AGTACATGCA GACGGCGATT
CAGGTCGCCG AGGAGTACGA CTGCCGGGAC TATACCAAGC AGCGACTGGA AGTACTCGAG
AAGTCACTCG AGAGTATCTT CGAGAACGAC AAGAACAAAC AGTCCGGTAT CGAGGACTTC
ATGTAA
 
Protein sequence
MRAEDERYFE QLESQLEEAF DVAERAKERG GDPEPDVEIP TARDMADRVE NILGIDGVAE 
RVRELEGEMS REEAALELAE DFAEGRVGDY ETKAGKVEGA VRTAVALLTE GVVAAPIEGI
DRVEILTNDD GTEFVNVYYA GPIRSAGGTA QALSVLVADY TRALVGIEQY EARQEEVERY
AEEISLYDKE TGLQYCPKDK EAKFIAKHLP IMLDGEATGD EEVSGFRDLE RVDTNSARGG
MCLVLGEGIA LKAPKIQRYT RNLDEIDWPW LQDLIDGNYA ADASDGDDTA DAAGSDDGSN
EADTDENAEN DGDDTDGDET EADESSDEAA VTDETDTSDS EPVGPPRAEP SKKFLRDLIA
GRPVFTHPSS PGGFRLRYGR ARNHGFATGG IHPATMHLVD DFLATGTQIK TERPGKAHGI
IPVDSIDGPT VKLANGDVRR IDDPEEAKEI RNGVEKILDT GEYLVNYGEF VENNHPLAPA
SYVYEWWIQD MATAGADVQA LEDDPRIDLE FPDPAEALEW ATEYDAPLHP QYTYLWHDIS
VESFCTLADA VAEGWVEAEP REGAGTGAND DTLVLECTES VQEALETLIL EHRQRPDENR
IEIDDWRPFA RTLGCEPRQA VADGAALDSD VGDADSDQAS DSNSNSNSDS NPNAAHDNTL
PIELERTWDD DDLSERARTW GHEDEADGAN AIEAVNEVAP FEVRERAPTR IGNRMGRPEK
SERRDLSPPV HTLFPIGEAG GAQRNVADAA KHAETMSDTP GIVEIQIGRQ QCPSCGTETF
KNRCPDCETR TEPDYRCPDC DQSLEPDDSG RVECGRCEIE GTCVEPREID INDEFRSALE
SVGERENAFE ILKGVKGLTS QNKIPEPIEK GILRAKHDVS AFKDGTVRYD MTDLPVTSVR
ANELDIDVGQ LQALGYEEDI HGDPLTHEDQ LVELNVQDIV LSDGAAEHMM QTADFIDDLL
EQYYGLEPFY EIDDRQELVG ELVFGMAPHT SAATVGRVIG FTSAAVGYAH PYFHAAKRRN
CFHPETEIVY REGESRSIYD KQQLESDDGL PPEERPQRDS IRTFVEDRLE NPEQDDFGTE
YQELDDDVWV LSYTGVCCPK RVTTVSKHPA PDHLLSITTE SGRKLRVTPD HTMLRFDEDS
SDRLSQEYWK AETVSAQELS PGDELPTPSP VGERTVDDFL FGPVKEINDF PPAVTGWDVI
SDIEIIESDT DYVYCLEVES SNTLSANGII TGQCDGDEDC VMLLLDGLLN FSKSFLPDKR
GGKMDAPLVM SSRIDPSEID DEAHNMDIVS QYPREFYLAT REQADPEEVD IQIGEDTLGT
DGEYTGFEHT HDTTDIAMGP DLSAYKTLGS MMDKMDAQLE LSRKLAAVDE TDVAERVIEY
HFLPDLIGNL RAFSRQETRC LDCGEKFRRM PLTGDCRECG GRVNLTVHQG SVNKYMQTAI
QVAEEYDCRD YTKQRLEVLE KSLESIFEND KNKQSGIEDF M