Gene Nmag_3894 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmag_3894 
Symbol 
ID8826764 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatrialba magadii ATCC 43099 
KingdomArchaea 
Replicon accessionNC_013923 
Strand
Start bp290473 
End bp291621 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content62% 
IMG OID 
Productpyruvate dehydrogenase (acetyl-transferring) E1 component, alpha subunit 
Protein accessionYP_003481997 
Protein GI289583587 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.361042 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCACGC AAAACGAAGC GGACCAGCAA CCAGACAGTG CAACGGCACA CGGCACGGCC 
GATCCACTCC AGATTCTGGA CGCGGACGGA ACCGTGCTGT CGAACGCGAC GGTTCCAGAC
CTTTCGGACG GCGACCTGAT CGCGATGTAC GAAGACATCA AACTCGCTCG CCGATTCGAT
CAGCGGGCGA TCAGCCTGCA ACGACAGGGA CGGATCGCGA CGTACGCGCC GATGACAGGA
CAGGAAGGAG CACAGGTCGC AACCGGGTAC GCGTTGGCAG CGCAAGACTG GCTCCTCCCG
ACGTATCGAG AGCACGCCGC CAAGTACGTC CACGGAATGG ATCTCGCATC GCTGTTGAAG
CCACTGTGTG GTCTGCGGGA AGGGTACGCG ATTCCCGACG ACGTAAACGT CATGCCAGAA
TATATTCCGA TCGCAACGCA GGTACCACAG GCCACCGGTA TGGCCTGGGG GAAGCAACGA
CAGGGAGAGA CGGATACTGC CGTCCTCTGT CACTTCGGCG ACGGGGCGAC CTCCGAAGGC
GACTTCCACG AGGGCCTCAA CTTCGCCGGC GTCTTCGACG TCCCCACCGT CTTCGTCTGT
AACAACAACC AGTGGGCGAT TTCGGTCCCT CGCGAACACC AGACTGCCAG TGAAACCATC
GCCCAGAAGG CCGCAGCGTA CGGAATAGAG GGGGTCCGAG TCGACGGCCT CGACCCGCTC
GCCGTCTACG CAGTAACGCG TGCAGCACTC CAGAAGGCGA AGAACCCGGC CGACGACGAA
CGGCGGCCCA CGCTCATCGA GGCCGTCCAG TACCGCTACG GCGCACACAC GACCGCCGAC
GACCCATCAA CGTACCGCGA GGAAGACGAG GCCGAGGACT GGCGCGAGAA AGACCCGCTC
GACCGAATGC AGAACTTCCT CACCAACAGG GGACTGCTCG ACGACGACCT GGAAGCCGAA
ATCGACGAAC GGATCGAGAC ACAGCTCACC GAGGCGGTCG AGTCCGTCGA AGCAGCAACG
ACAGACCCGG CGACGATGTT CGATCACGTC TACGACGTAC TTCCTGCTCG CCTTCGTGAG
CAGCGAGCCG AACTCGAGTC CCTCCGCGAG AAGTACGGCG ACGACGCGTT CCACGAGGTG
TTAGAATGA
 
Protein sequence
MSTQNEADQQ PDSATAHGTA DPLQILDADG TVLSNATVPD LSDGDLIAMY EDIKLARRFD 
QRAISLQRQG RIATYAPMTG QEGAQVATGY ALAAQDWLLP TYREHAAKYV HGMDLASLLK
PLCGLREGYA IPDDVNVMPE YIPIATQVPQ ATGMAWGKQR QGETDTAVLC HFGDGATSEG
DFHEGLNFAG VFDVPTVFVC NNNQWAISVP REHQTASETI AQKAAAYGIE GVRVDGLDPL
AVYAVTRAAL QKAKNPADDE RRPTLIEAVQ YRYGAHTTAD DPSTYREEDE AEDWREKDPL
DRMQNFLTNR GLLDDDLEAE IDERIETQLT EAVESVEAAT TDPATMFDHV YDVLPARLRE
QRAELESLRE KYGDDAFHEV LE