Gene Ndas_3685 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3685 
Symbol 
ID9247554 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4424116 
End bp4425303 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content72% 
IMG OID 
ProductMalate dehydrogenase (oxaloacetate-decarboxylating) 
Protein accessionYP_003681589 
Protein GI297562615 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCACTG TTTCGCGTTC CCAGAACCCG CGTCGTGACG ACGATCCCGC CTTCGCCATG 
CACCGCGGCG GCAAGCTCCA CGTGACCTCG TCCGTGGACG TCCGCGACCA GGAGGGACTC
TCCCTCGCCT ACACCCCCGG CGTCGCCCGC GTCTGCGACG CCATCGCCGA GTCCCCCGAA
CTCGTCGACA CCTACACCTG GAAGAGCAAC CTGGTCGCCG TCGTCACCGA CGGCACCGCT
GTCCTGGGCC TGGGCGACAT CGGCGCCGAG GCCTCCCTGC CGGTCATGGA GGGCAAGTCG
CTGCTGTTCA AGCAGTTCGC CGGCGTCGAC TCCGTGCCGA TCGCGCTGGG CTGCACCGGG
GTCGACGAGA TCGTCGAGAC CGTCGTGCGC ATGGCGCCCT CCTTCGGCGG GATCAACCTG
GAGGACATCT CCGCGCCCCG CTGCTTCGAG ATCGAACGGC AGCTGCGCGA GCGCCTGGAC
ATCCCGGTCT TCCACGACGA CCAGCACGGC ACCGCCATCG TCACCGTGGC CGCGCTGCGC
AACGCCGCCC GGTTCACCGG GCGCACCCTG GCCGACCTGC GCGCCGTGGT CTCCGGCGCG
GGCGCCGCGG GCGTGGCCGT CACCAAGATG CTCGTCGACG GCGGGATCGG CGACATCGCC
GTGGCCGACT CCAAGGGCAT GATCTACGAA GGCCGCGAGG GCCTGACCCC GGTCAAGCGG
GAGCTGGCCT CCATCAGCAA CCGGGCGGGG CTGCGCGGGT CCATCGAGAG CGCGCTGGAG
GGGGCCGACG TGTTCATCGG CCTGTCCGCG GGCGAGGTCC CCGAGTCCGT GGTCGCCACC
ATGGCCGACG ACGCGATCAT CTTCGCGATG GCCAACCCCA ACCCGGAGGT GCACCCGGAC
GTCGCCCGCA GGCACGCGAG CGTGGTCGCC ACCGGGCGCA GCGACTTCCC CAACCAGATC
AACAACGTCC TGGCCTTCCC GGGCGTGTTC CGGGGCGCCT TCGACGCGGG GGCCACGGAC
ATCACCGAGA ACATGAAGCT GGCCGCCGCC ACCGCGCTGG CCGACCTGGT GGGCGACAAG
CTGTCCGCCG ACTACATCAT CCCCAGCCCC TTCGACGAGA GGGTGGCCCC GGCGGTCGCC
GCCGCGGTCG CCGCCCAGGC CCGTGAGGAC GGCGTCGTCC GCGGCTAG
 
Protein sequence
MTTVSRSQNP RRDDDPAFAM HRGGKLHVTS SVDVRDQEGL SLAYTPGVAR VCDAIAESPE 
LVDTYTWKSN LVAVVTDGTA VLGLGDIGAE ASLPVMEGKS LLFKQFAGVD SVPIALGCTG
VDEIVETVVR MAPSFGGINL EDISAPRCFE IERQLRERLD IPVFHDDQHG TAIVTVAALR
NAARFTGRTL ADLRAVVSGA GAAGVAVTKM LVDGGIGDIA VADSKGMIYE GREGLTPVKR
ELASISNRAG LRGSIESALE GADVFIGLSA GEVPESVVAT MADDAIIFAM ANPNPEVHPD
VARRHASVVA TGRSDFPNQI NNVLAFPGVF RGAFDAGATD ITENMKLAAA TALADLVGDK
LSADYIIPSP FDERVAPAVA AAVAAQARED GVVRG