Gene Nmag_0401 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmag_0401 
Symbol 
ID8823223 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatrialba magadii ATCC 43099 
KingdomArchaea 
Replicon accessionNC_013922 
Strand
Start bp394725 
End bp396062 
Gene Length1338 bp 
Protein Length445 aa 
Translation table11 
GC content63% 
IMG OID 
ProductHydroxymethylglutaryl-CoA synthase 
Protein accessionYP_003478551 
Protein GI289580085 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAGCAG TCGGTATCGA TGCCATCGAG ATCTGGACCG GGAACCTCAA ACTCGACCTT 
CCCGGGACGT TCGCGCCGCA GAAGGGCGAA GACCCCGAAA AGTACACGAA AGGGCTCGGC
CTCAACGCGA GTTCGTTCCC CGACAGTTAC GAGGACATCG TCACCATGGG CGCAAACGCG
GCCCACCGCC TGATGGAGCG CAAGGGCCTC GAACCCGACG ATATCGGCCG TATCGACGTC
GCAACCGAGA GCTCGTTCGA CAACTCGAAG CCAGTTTCGA CGTACGTCGC TGGCTGCCTC
GAATCAGTCT ACGACGGCGA CTTCCACCAC GCGAACAAGG GCGAGCGCAA GTTCGCCTGC
ATCGCCGGCA CGCAGAGTCT GGACGACGCG TTCAACTGGA TCCGTGCGGG TCGCAACCGC
GGCCGCGGCG CGCTCGTCAT CGCCACTGAC ACCGCACTCT ACGCCCGCGG CGACGCCGGC
GAGGCAACCC AGGGCGCGGG TGCCGTCGCG ATGTACATCG ACGAAGACCC CGACCTGATC
GAACTCTCCG CCGAACAGGG CTACGGCTCG GCCGACGAAA CCGACTTCCT CAAACCGAAT
CAGCAGTTCC CCTCGGTCGA CGGCAAGCGC TCCGTGCAGG TCTACCTCGC ACGCATGCGT
GAAGCTCTGG AGGACTACGA GAGCGTCGCG GGCGACGTCC ATCCCGACGA TTTCGTGTTC
GCGCCGTTCC ACACGCCGTT CCCAGGTATG GTGCGCAAGG CAGCGATGCT CGCGTATCGC
CACGTTACGC GTGATACGGC GGTCGAAGAG GAACTCGCCG AAGAGATCGG TCGACAGCCC
CGTAGAGAGG CGTTCGACGA CGAAGAGGCG TTCCGCGATG CCGTTCGCGA GTACATGGAC
GCGCTCAAGG AGACCGACCG GTACCAGGAG TGGTACGCCG AGACGATCGA TCCCACACTG
GCGCTCTCGC GTGAGGTCGG CAACTGGTAC ACTGGTTCGG TTCACATCGC CCGCGCAAGC
GCGCTGAAGC AGGCCCTCGA GTCCGGTCGC GATCTGACGG GCGAGACGCT ACTGATCGGC
TCCTACGGGA GCGGTGCGCA GGCCGAGATT CACTCAGAAA TCGTCCAAGA CGGCTGGGAG
GAAGAAATCG AGGCGCTGAA CGTCGACGAG CAACTCGAGG CGCGCTACGA TATGGACTGG
GCGGATTACG AGCAGATCCA CGACGCGCAC AACCACGAGA TGGACATCGA CGTCGAGGAG
TTCACGACGC CCGAAGACGA GTTCGTCTTC GACGGCTGGG GTCGGATGGG CGAGCGGAAA
TACCGCTACG TCGAGTAA
 
Protein sequence
MTAVGIDAIE IWTGNLKLDL PGTFAPQKGE DPEKYTKGLG LNASSFPDSY EDIVTMGANA 
AHRLMERKGL EPDDIGRIDV ATESSFDNSK PVSTYVAGCL ESVYDGDFHH ANKGERKFAC
IAGTQSLDDA FNWIRAGRNR GRGALVIATD TALYARGDAG EATQGAGAVA MYIDEDPDLI
ELSAEQGYGS ADETDFLKPN QQFPSVDGKR SVQVYLARMR EALEDYESVA GDVHPDDFVF
APFHTPFPGM VRKAAMLAYR HVTRDTAVEE ELAEEIGRQP RREAFDDEEA FRDAVREYMD
ALKETDRYQE WYAETIDPTL ALSREVGNWY TGSVHIARAS ALKQALESGR DLTGETLLIG
SYGSGAQAEI HSEIVQDGWE EEIEALNVDE QLEARYDMDW ADYEQIHDAH NHEMDIDVEE
FTTPEDEFVF DGWGRMGERK YRYVE