Gene Nmag_2331 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmag_2331 
Symbol 
ID8825183 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatrialba magadii ATCC 43099 
KingdomArchaea 
Replicon accessionNC_013922 
Strand
Start bp2384244 
End bp2385989 
Gene Length1746 bp 
Protein Length581 aa 
Translation table11 
GC content61% 
IMG OID 
ProductAldehyde Dehydrogenase 
Protein accessionYP_003480455 
Protein GI289581989 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.222051 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTTGAAC CCAGACTGGA ACCCGAATCT GGTGCCGATA CCGATGTCGA TGCTGATACC 
GACGCCGATG CCGCTACCGA TGTCGATGCT GATACCGACG CCGATGCCGC TACCGATGTC
GATGCCGATG CCGATGCAAA CCTGGGCACC CACCAGCTTC GAGATGCTGC CACCAGCAGC
GTTTTTACGT CGCTCGTTGT CCACTCGAGT ATGAGCCAGC AAGCAACAGA GCAGGTGAAC
CAACACTACA TCGGTGGTGA GTGGACCGAC GGCAGCGGCT CGGAAACGTT CGAAAGCGAG
AACCCGGCGA CAGGGGAGGC GCTGGCAGTG TTCCAGCGCG GTACGGAAGA CGATGTCGAC
GCTGCGTTGG CTGCAGCGAA CGACGCGTTC GAGGAGTGGC GAGAACTCTC CCACATCGAC
CGCGCGGAGT ACCTCTGGGA CATCTACCAC GAACTGCGCG AGCGCCACGA CGAGTTGGGT
GAGATCATCT CCAAGGAGTG TGGCAAGGAG ATCTCCGAAG GGAAAGCAGA CGTCGCTGAG
GCCTGGCACA TGGTCGAGTG GGCGGCAGGC AACGCGCGCC ACCCACACGG GGACGTGGTA
CCGAGCGAAG TCGGCTCGAA AGATGCGTAC ATGCGCCGGA AACCTCGTGG CGTTATCGGC
TGCATCACGC CGTGGAACTT CCCGGTTGCA ATTCCGTTCT GGCACATGGC CATTGCCCTC
GTCGAGGGCA ACACGGTCGT CTGGAAGCCA GCCGAACAAA CGCCATGGTG TGGCCAGATC
ATCGCCGAGA TGTTCGAGGA CGCCGGCATT CCGGACGGTG TCTTCAACAT GGTTCAGGGC
TTCGGTGACG CCGGTGCGGC CATCACGGAC GACGAGCGCG TCAACACAGT CCTGTTCACC
GGCTCTGCCG AGGTTGGCCA GGAAATCGCC AGCAAGGTCG GCGGCGAACC CGGCAAGCTC
GCAGCCTGTG AGATGGGCGG CAAGAACGGC ATCATCGTCA CCGAGGAAGC CGACCTCGAC
ATCGCCGTTC ACTCGGCGAT CATGTCAAGC TTCAAGACGA CCGGCCAGCG CTGTGTCTCG
AGCGAGCGCC TGATCGTCCA CACCGACGTC TACGACGAGT TCAAGGAGCG CTACGTCGAC
ATTGCCAAGG ATGTCGCCGT CGGCGACCCA CTGCAGGAAA ACACCTTCAT GGGGCCGGCC
ATCGAGCCAG AGCACGTCGA GAAGATCAAG AAGCACAACG AGTTGGCAGA ACAGGAAGGC
GCAGACGTGC TCGTCGACCG CTTCGAACTC GAGGACGACG AGATTCCTGA GGGCCACGAG
GACGGCAACT GGGTCGGCCC GTTCGTCTAC GAAATCGAGT ACGATACCGA CCTGCGCTGT
CTGAAAGAGG AGTGTTTCGG CCCACACGTC GCACTACTCG AGTACGACGG CGACATCGAG
GACGCAGTCG AAATCCACAA CGACACGCCG TACGGGCTTG CCGGCGCCAT TATCTCCGAG
GACTACCGCC AGATCAACTA CTTCCGCGAC CGCGCGGAGA TCGGGCTTGC GTACGCGAAT
CTACCGTGTA TCGGCGCAGA GGTCCAGCTT CCGTTCGGCG GCGTGAAGAA GTCCGGAAAC
GGCTACCCGT CCGCGCGTGA GGCCATCGAG GCCGTTACCG AGCGGACTGC GTGGACGATG
AACAACTCCA AAGAAATCGA GATGGCACAG GGCCTCTCGG CTGACATCGT CACTCGAGAC
GACTGA
 
Protein sequence
MLEPRLEPES GADTDVDADT DADAATDVDA DTDADAATDV DADADANLGT HQLRDAATSS 
VFTSLVVHSS MSQQATEQVN QHYIGGEWTD GSGSETFESE NPATGEALAV FQRGTEDDVD
AALAAANDAF EEWRELSHID RAEYLWDIYH ELRERHDELG EIISKECGKE ISEGKADVAE
AWHMVEWAAG NARHPHGDVV PSEVGSKDAY MRRKPRGVIG CITPWNFPVA IPFWHMAIAL
VEGNTVVWKP AEQTPWCGQI IAEMFEDAGI PDGVFNMVQG FGDAGAAITD DERVNTVLFT
GSAEVGQEIA SKVGGEPGKL AACEMGGKNG IIVTEEADLD IAVHSAIMSS FKTTGQRCVS
SERLIVHTDV YDEFKERYVD IAKDVAVGDP LQENTFMGPA IEPEHVEKIK KHNELAEQEG
ADVLVDRFEL EDDEIPEGHE DGNWVGPFVY EIEYDTDLRC LKEECFGPHV ALLEYDGDIE
DAVEIHNDTP YGLAGAIISE DYRQINYFRD RAEIGLAYAN LPCIGAEVQL PFGGVKKSGN
GYPSAREAIE AVTERTAWTM NNSKEIEMAQ GLSADIVTRD D