Gene Nmag_1149 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmag_1149 
Symbol 
ID8823980 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatrialba magadii ATCC 43099 
KingdomArchaea 
Replicon accessionNC_013922 
Strand
Start bp1169578 
End bp1171725 
Gene Length2148 bp 
Protein Length715 aa 
Translation table11 
GC content68% 
IMG OID 
Productglucan 14-alpha-glucosidase 
Protein accessionYP_003479295 
Protein GI289580829 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACTTC ACAGCGCCCT GAACGACGTG AAACGCTCGT GGGGTGACGA CCGCCGATTC 
CCCGGCGAGC GCCGGTCGAC GACGGGCCGC TTCTCCGGCT TCGACGACAG GCTCGTCCAC
GTCGCACCGA ACGGTGCACT CAGGGACTAC TCCTACCCGC TCTCGGGACT GGTCGGTATC
GACCATTCCC GGTTCGGTCT CGAACTCGAC GACACGCTCT ACTGGCTCGA TAAACCGGGC
GACCAGCGCT ACGTCGACGA CACAGCGCTC GTGGAGACGG CACACGAGGT CGCCGGCCAC
ACGCTCACGC AGTACGATCT CACGCTCGGC CGCCTCCACC TCACCCACTT CTCCCTCTCT
ACCGACGAGC ACGACGACAC CGACCCGGAC GCGACTCTCC ACGCCTGCGT CGCCTTCGCC
CCCGAAGACC GCGTCAGCCG CATCGGCCAG TTGCTCCACG GTGACGCCGT CGAAGTCCAC
CACGACCGCG AACACGACTT CCTGACCGCC GCGACCGACA TCGAGGTCAC CGGCCAGATC
CCCGCGACAT TTGCGGAGCT ACTCGAGTCC GAACCAACGG AACTCCCCCG TGGCGAGGAC
GATGGCCGGT ACGAGGAGGC ACGACTGAGC CCGATCACGC TGGCCGAAGT CGACCTTTCG
GAATCGGACG CCTCGACAAC GGTGGCGACG TTGCTCGCCG ATTCGGAGGC AGACGAGGAG
TCCCGAACTG GTGCGCTCAA ACGCGCCCGC GCTGGCGCTG CGGAGCACGC GACTCGTGAC
GGACTCCTCG AGGCAGCCCG AGCGCAGGCC GAAACCGCGT TCGAGGGCGT TCGAGCGACT
GCAGACGACG GCGAGACTGT CGATATCGCC GATCCGATCG CCGACTTGCG AACGCTGCGC
CTGCTTCGCG CGCCAACCGG GGCGCGCATC GCCGGTCCCG AGTTCGACCC GTTCTACCGC
TACTCGGGCG GCTACGGCTA CACGTGGTTC CGAGACGACG CCGAAATCGC CGGCTTCCTG
CTCGCGGCCG ACCGTCGCGC CGGACTCGGG CTCGAACAGT GGCACCGCCA GAGCGCACAC
TTCTACACCA CGAGTCAGCT CGCAGACGGC ACCTGGCCTC ACCGCGTCTG GCCGCGAAAC
GGACGGCTCG CTCCCGGCTG GGCACACGGC CGCGTCGAGG AGGCCGCTGA CTCGACGGAC
TACCAGGCCG ACCAGACTGC GAGCGTCGCC GCCTATCTCG CGACGTACCT TCGCACCGTC
GACGCCGACG ACGAGCAGGT CCGGGAGGCG CTCGTCGCCG CACTCGACGG CCTCGACGCG
ACGCTCGCCG ACGACGGCCT GCCGGAGCGC GTCCAGAACG CCTGGGAGAA CATGACCGGG
CGATTCACTC ACACCGCCGC GACGTTCCTC GAGGCCTACG CGGCGATCGC TCGCGCTCCC
ATCGCGGACG AGCACCGCGA ACGTGCACGC GAACGGGCTC GCACCGTCTA CAGCGCGCTC
GACGACCTCT GGGTCGCCGA CCGCGGCTGC TACGCGCTCC GACTCGACGA TGGAGTACTG
GACGAGCGCC TCGACGGGAG CACCTTCGCG CTCGCTGCAG CCCATCGTGA GTTCGACGTA
CTCGAGTCTG AGCAATCAGC CGCTGACGGA ACTGGCGAGA ACGGCGGCGA CGGCGTCGGG
GTCGACGCCG ATCGCCTCGA CCGACTGGTG ACACACATCG AGACGACCAT CGACGGGCTC
TACCGCGATC CCGAGGGGGC ACTCGAGGGC GTCGCTCGCT TCGAGGACGA TCCGTGGCGT
GTCGACGATC AGGCTGACGC CAAGATCTGG TCCGTGACGA CGGCGTGGGG AGCACACGCC
GCGGCCGAGA TGGGATCGCT GCTCGCCGCA CACGACCACG AGACCGCCTC GCGTTTCGAC
GAGCGAGCGC GTGAATTGCT CGCGCTCGTT GCGCCCGGTG GGAAACTCCG TCGCGACGGG
GAGTATCTCC CCGAGCAGTT CTTCGACGAC GGAACGGCCG ACAGTGCGAC GCCGCTGGGC
TGGCCGCACG CGTTGCGGCT CGCAACGGCG GCGGAGCTGG CGACGGGTGC GCAGGAAGGA
ACAGCAGCGC CGGTAGTGGA CTCAGATCGC GCGCCGGTTC AGGACTGA
 
Protein sequence
MKLHSALNDV KRSWGDDRRF PGERRSTTGR FSGFDDRLVH VAPNGALRDY SYPLSGLVGI 
DHSRFGLELD DTLYWLDKPG DQRYVDDTAL VETAHEVAGH TLTQYDLTLG RLHLTHFSLS
TDEHDDTDPD ATLHACVAFA PEDRVSRIGQ LLHGDAVEVH HDREHDFLTA ATDIEVTGQI
PATFAELLES EPTELPRGED DGRYEEARLS PITLAEVDLS ESDASTTVAT LLADSEADEE
SRTGALKRAR AGAAEHATRD GLLEAARAQA ETAFEGVRAT ADDGETVDIA DPIADLRTLR
LLRAPTGARI AGPEFDPFYR YSGGYGYTWF RDDAEIAGFL LAADRRAGLG LEQWHRQSAH
FYTTSQLADG TWPHRVWPRN GRLAPGWAHG RVEEAADSTD YQADQTASVA AYLATYLRTV
DADDEQVREA LVAALDGLDA TLADDGLPER VQNAWENMTG RFTHTAATFL EAYAAIARAP
IADEHRERAR ERARTVYSAL DDLWVADRGC YALRLDDGVL DERLDGSTFA LAAAHREFDV
LESEQSAADG TGENGGDGVG VDADRLDRLV THIETTIDGL YRDPEGALEG VARFEDDPWR
VDDQADAKIW SVTTAWGAHA AAEMGSLLAA HDHETASRFD ERARELLALV APGGKLRRDG
EYLPEQFFDD GTADSATPLG WPHALRLATA AELATGAQEG TAAPVVDSDR APVQD