Gene Ndas_1446 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1446 
Symbol 
ID9245296 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp1771575 
End bp1772708 
Gene Length1134 bp 
Protein Length377 aa 
Translation table11 
GC content78% 
IMG OID 
ProductMalate/L-lactate dehydrogenase 
Protein accessionYP_003679384 
Protein GI297560410 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0751418 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCACCG CGGCCCCCAC CCAGGCCCCG CCGGAACGCG AGGCCGTACG GGTACGCCAC 
GACGACCTGG TCGCGTTCGC CGCCGGGGTG TTCACCGACC GCGGCCTGCC CCCCGACCGG
GCGGCCGAGG CGGCGCGCGC TCTGTGCCAC GGCGACCTCG CCGGGCCGCG TTCGCACGGT
CTGGCCAACC TGACCCGCCT CTACCTGCCG CTCCTCGACG AGGGCAGGGC CGACCCCGCC
GCGGAGCCGC GCGTCCTCGC CGACCTCGGC GCCGCCGTGC TCTGGGACTC CCGGCGGGCC
CTGGGCCTGT GGGCGGCGAG CGAGGCCATG GACCTGGCCG CCGAGCGCGC CGCGCGCCAC
GGCATCGGGC TGGTGTCCGT GCGCGGCGCC ACCCACCTGG GCTGCGCCGG GTACCACGCG
CTGCGCGCGG CCGAACGCGG CATGGTGGGC CTGGTGGCCA GCAACTGCGG ACGCCAGCGC
ATCGCCCGCC CGCCCGGCGG CGCGGTCGCG ATGCTGGGCA CCAACCCGCT CAGCGTCGCC
GCCCCGGCCG GGGAGCACCC GCCGTTCCTG CTCGACATGA GCACCACGGC CGCGCCCACC
GGCCGGATCC GCCAGGCCGC CCGCGAGGGC CTCGCCCTGC CCGAAGGCCT GCTGTGCGAC
GACACCGGCG CGCCCGTCAC CGACCCCGCC GCCTTCGACG CCGGGCGCGC GCACCTGATG
TGGCTGGGCG GCGAAGCGGG ACGCTACAAG GGCTTCGGCC TCGGACTCAT GGTCGAGGTG
CTCTCCGCAC TGGTCCCGGG GGCCGGGACG GGCCCCCACC CCGACGCCCT GGACGGGGAC
GGCGGCCCGA GCGGACGCGA CGACGACATC GGCTTCTTCG TGGCCGCGAT CGCGCCCGGC
GCCCTGCGGC AGGGCGCCGA CGACGACGCG CGGGAGCTGT TCGGCGCGCT GCTGGCCTGT
CCGCCCACCG ACCCGGACGC GCCGGTGCGC TACCCCGGCT GGCACGAGTA CCACCGGGCG
CGGGAACTGC GCCTGGCGGG CGTGCCGCTG GAGGCGGAGC TGTACGCCGA GCTGGCGGAG
CTGGCCGACC GGACCGGCCT GCCCTTCGAG GCGATGCGGG AGGAGACGCG ATGA
 
Protein sequence
MTTAAPTQAP PEREAVRVRH DDLVAFAAGV FTDRGLPPDR AAEAARALCH GDLAGPRSHG 
LANLTRLYLP LLDEGRADPA AEPRVLADLG AAVLWDSRRA LGLWAASEAM DLAAERAARH
GIGLVSVRGA THLGCAGYHA LRAAERGMVG LVASNCGRQR IARPPGGAVA MLGTNPLSVA
APAGEHPPFL LDMSTTAAPT GRIRQAAREG LALPEGLLCD DTGAPVTDPA AFDAGRAHLM
WLGGEAGRYK GFGLGLMVEV LSALVPGAGT GPHPDALDGD GGPSGRDDDI GFFVAAIAPG
ALRQGADDDA RELFGALLAC PPTDPDAPVR YPGWHEYHRA RELRLAGVPL EAELYAELAE
LADRTGLPFE AMREETR