Gene Ndas_0366 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0366 
Symbol 
ID9244201 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp446930 
End bp448336 
Gene Length1407 bp 
Protein Length468 aa 
Translation table11 
GC content71% 
IMG OID 
ProductMalate dehydrogenase (oxaloacetate-decarboxylating) 
Protein accessionYP_003678320 
Protein GI297559346 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.697833 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCCACCC TTCCCAGCGT TTCCTACTCC ATCACCGTCC GCCTCGAACT GGACGCTGGG 
GGCAGTGCGG TCGGCAGTCT CACCAACGCC GTCGAGCAGG TGGGCGGCAT GATCACCGCG
CTGGACGTGG CCGCCGCCGG ACACGAGCGC ATCCGCATCG ACGTCACCTG CGCCGCCCGC
GACACCGAGC ACGCCCAGGC CATCGTCGAC GCCCTGGGCG CCGTCGAGGG CGTGGTCGTG
CACAAGGTCA GCGACCGCAC CTTCCTCATG CACCTGGGCG GCAAGATCGA GATGAAGTCC
AAGGTGCCGC TGCGCAACCG CGACGAGCTC TCCATGGCCT ACACCCCCGG CGTCGCCCGC
GTCTCCCAGG CCATCGCCGC CAACAAGGAC GACGCCCGGC GCCTGACCAT CAAGCGCAAC
AGCGTCGCCG TGGTCACCGA CGGCTCCGCG GTCCTGGGCC TGGGCAACAT CGGCCCCGAG
GCCGCCATGC CCGTCATGGA GGGCAAGGCC GCCCTGTTCA AACGCTTCGC CGACATCGAC
GCCTGGCCCA TCGCCCTGGA CACCCAGGAC GTGGACGAGA TCGTGCGCAC CGTGCAGGTG
CTGGCGCCCG GGTTCGGCGG CATCAACCTG GAGGACATCT CCGCCCCCCG CTGCTTCGAG
GTCGAGGCCC GCCTGCGCGA GCTGCTCGAC ATCCCCGTCT TCCACGACGA CCAGCACGGC
ACCGCCATCG TCGTGCTCGC CGCCCTGCGC AACGCTCTGC GCGTGGTGGG CAAGAAGCTC
GGCGAGGTCC GCATCGCCAT GTCCGGCGCG GGCGCGGCCG GAACCGCGAT CCTCAAGCTC
CTCATGCACG CCGGGGCGCG CGACGTCATC GTCAGCGACG TGCACGGCGC CGTGCACGCC
GGGCGCGAGG ACCTCGACCC CAACCTGCGG TGGATCGCGG AGCACACCAA CCCCGAGGGC
TACAGCGGCG ACCTGCGCGG TGCCGTGGCC GGGGCGGACG TCTTCATCGG CGTCTCGGCC
CCCAACCTCC TCAACGGCGA CGACATCGCC GAGATGAACG AGGACGCCAT CATCTTCGCG
CTGGCCAACC CCGACCCCGA GGTCGACCCG GACGTGGCCC ACCTGCACGC CTCCGTGGTG
GCCACCGGCC GCAGCGACTA CCCCAACCAG ATCAACAACG TGCTGGTCTT CCCCGGCTTC
TTCCGCGGTC TGCTGGACGC CCAGAGCCAC GACGTCACCT CGGACATGAT GGTCGCCGCG
GCGGAGGCCC TGGCCGACGT CGTCACCGAG GACGAGCTGG GCCCCAACTA CATCATCCCC
AGCGTGTTCC ACTCCGACCT GTCCACGCAC GTGGCCACGG CCGTGCGCGA GGTCGCCCAG
CGCGGCCAGG CGGCCGCACA GGCATAA
 
Protein sequence
MATLPSVSYS ITVRLELDAG GSAVGSLTNA VEQVGGMITA LDVAAAGHER IRIDVTCAAR 
DTEHAQAIVD ALGAVEGVVV HKVSDRTFLM HLGGKIEMKS KVPLRNRDEL SMAYTPGVAR
VSQAIAANKD DARRLTIKRN SVAVVTDGSA VLGLGNIGPE AAMPVMEGKA ALFKRFADID
AWPIALDTQD VDEIVRTVQV LAPGFGGINL EDISAPRCFE VEARLRELLD IPVFHDDQHG
TAIVVLAALR NALRVVGKKL GEVRIAMSGA GAAGTAILKL LMHAGARDVI VSDVHGAVHA
GREDLDPNLR WIAEHTNPEG YSGDLRGAVA GADVFIGVSA PNLLNGDDIA EMNEDAIIFA
LANPDPEVDP DVAHLHASVV ATGRSDYPNQ INNVLVFPGF FRGLLDAQSH DVTSDMMVAA
AEALADVVTE DELGPNYIIP SVFHSDLSTH VATAVREVAQ RGQAAAQA