Gene Nmag_1997 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmag_1997 
Symbol 
ID8824839 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatrialba magadii ATCC 43099 
KingdomArchaea 
Replicon accessionNC_013922 
Strand
Start bp2032562 
End bp2033962 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content60% 
IMG OID 
ProductD-lactate dehydrogenase (cytochrome) 
Protein accessionYP_003480130 
Protein GI289581664 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCAGAAC GTTTGGAACA GATCTGTGAG GAACTGGATC CCGGACAGTA TTCGTTCACC 
GATGCGGATC GATTCGAACG GTCCCACGAC TGGGGGACTG ACGAGGAAGA CGGCGTCTAT
CCGGACGTCG TCATCTGGCC CGAAAGCACT GCCGACGTTT CGGCCGTTCT CGCAGCGGCA
AACGATGCGG GGATCCCAGT GACACCGTAC GCGGCCGGGA CGAGCCTCGA AGGAAACGCC
GTTCCACTGC ACGGCGGCAT TAGCCTCGAC CTGACTCGGA TGGACGCAAT ACACGATATC
CGGCCGGATG CGCTGCAGAT CGATGTTGGA CCCGGAATCT ACGGCGACGA GATCAACGCC
GCACTCGAAA ATCACGGACT GATCCTTCCG TCGCTTCCGT CCTCCGGGAA GATATCCACG
ATCGGCGGAA TGATCGCAAA CGACGCCTCC GGCATGAAGA CGGTGAAGTA CGGTGAAGTC
GCAGATTGGC TGCTCGAGGC CGAGGCTGTG CTCCCCTCTG GCGAGGTACT CACAGTTGGG
AGCAAGGCAG CAAAGACTTC CTCGGGGTAC AACGTTCTCG ATCTCCTCGT CGGGAGCGAG
GGTACCCTCG CGGTCGTCAC CCGTCTCACG CTTCGCTTGA CCGGCCGACC CGAACAGATC
TGGGCTGGCC GCGCAACCTT TTCGAATCTG CACGACGCTG CTGACGCCGT GTTCGACGCC
ATCCGTTCGG GCGTCGACGT GGCCAAGATC GAACTGATCG ACTCACTCAG TGCCGAGATT
GCAAACACAC GACTCGATAC CGATCTCCCC AACTCGCCGA TGGTCTTTCT CGAGTTCCAC
GCGAATCAGC ACATCGAAGC CGAAGTCGAC TTCTGCCGGA CGGTGTTCGA CGCACACGAC
ATCGACTCGT TCGAAATCGC CGAACAGGAT CAGGAGATGG CGGCGCTCTG GGAAGCGCGC
CGCGAACTGG CCGACGCGGT TGAGCCGTAC GACCCTGACC TCTCGCCGCT CACCCCCGGC
GACGTCACCG TTCCGATCGA CCGGTTGCCC GATATCGTCG ACTACATCAA AACGCTTGGC
GAGGAACACG ACATCATGAT TCCTTGCTTC GGGCACGCTG GTGATGGGAA CATCCACTAC
TTCGTGATGG TAGATCCTGA CGATCCGGCG ATGGTCTCGA CCGGGCAGGA CGTGTCCAAG
CAGATCGTCG CTCGTGCAGT CGAAATGGGG GGAACTGCGA CCGGGGAACA CGGCATCGGC
ATTGGAAAGC GGGAGTACGT TCCGACCGAG CACGACGAGG CACTGGTCGC CACGATGCGC
TCGATCAAGT CGACGTTCGA CCCGAACGGA ATTCTCAACC CGGGGAAGAT TTTCCCTGAT
GAGTCGGACT CTCAGCGGTA G
 
Protein sequence
MSERLEQICE ELDPGQYSFT DADRFERSHD WGTDEEDGVY PDVVIWPEST ADVSAVLAAA 
NDAGIPVTPY AAGTSLEGNA VPLHGGISLD LTRMDAIHDI RPDALQIDVG PGIYGDEINA
ALENHGLILP SLPSSGKIST IGGMIANDAS GMKTVKYGEV ADWLLEAEAV LPSGEVLTVG
SKAAKTSSGY NVLDLLVGSE GTLAVVTRLT LRLTGRPEQI WAGRATFSNL HDAADAVFDA
IRSGVDVAKI ELIDSLSAEI ANTRLDTDLP NSPMVFLEFH ANQHIEAEVD FCRTVFDAHD
IDSFEIAEQD QEMAALWEAR RELADAVEPY DPDLSPLTPG DVTVPIDRLP DIVDYIKTLG
EEHDIMIPCF GHAGDGNIHY FVMVDPDDPA MVSTGQDVSK QIVARAVEMG GTATGEHGIG
IGKREYVPTE HDEALVATMR SIKSTFDPNG ILNPGKIFPD ESDSQR