Gene Ndas_3623 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3623 
Symbol 
ID9247492 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4344757 
End bp4345914 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content70% 
IMG OID 
Product1-hydroxy-2-methyl-2-(E)-butenyl 4-diphosphate synthase 
Protein accessionYP_003681529 
Protein GI297562555 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.772778 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCGTCG ATCTCGGTAT TCCCGCCGCA CCGCCGCGCC CGCTGGCGAC CCGCCGCAAG 
ACGCGGCAGA TCATGGTCGG AAACGTCCCC GTCGGCGGGG ACGCCCCGGT GTCCGTGCAG
TCGATGACGA CCACCCGTAC CTCCGACATC AACGCGACCC TCCAGCAGAT CGCGGAGCTG
ACCGCCGCGG GCTGCCAGAT CGTCCGCGTG GCCGTGCCCA CCAACGACGA CGCCGACGCC
CTGCCGATCA TCGCCAGGAA GTCGCAGATC CCGGTGATCG CCGACATCCA CTTCCAGCCC
AAGTACGTGT TCCAGGCGAT CGACGCCGGA TGCGCCGCCG TGCGCGTCAA CCCGGGCAAC
ATCAAGAAGT TCGACGACAA GGTCGCCGAG ATCGCCAAGG CGGCCGGTGA GGCCGGGACG
CCGATCCGCA TCGGCGTCAA CGCCGGTTCG CTGGACAAGC GCCTGCTCCA GAAGTACGGC
AAGGCCACGC CTGAGGCCCT GGTCGAGTCG GCTCTGTGGG AGTGCTCGCT GTTCGAGGAG
CACGGCTTCC GCGACATCAA GATCTCGGTC AAGCACAACG ACCCCGTGGT CATGGTCAAC
GCCTACCGCC AGCTCGCCGC GGCCTGCGAC TACCCGCTGC ACCTGGGCGT GACCGAGGCC
GGTCCCGCCT TCCAGGGCAC CATCAAGTCC GCCGTGGCCT TCGGCGCTCT GCTCTCGGAG
GGCATCGGCG ACACCATCCG CGTGTCCCTG TCCGCGCCCC CCGCGGAGGA GGTCAAGGTC
GGCAACCAGA TCCTGGAGTC GCTCGGGCTG CGCGAGCGCG GCCTGGAGAT CGTCTCCTGC
CCCAGCTGCG GCCGGGCCCA GGTGGACGTG TACACGCTCG CCGAGGAGGT CACCGCGGGT
CTGGAGGGCA TGGAGGTGCC GCTGCGCGTG GCCGTCATGG GTTGCGTCGT CAACGGCCCC
GGCGAGGCCC GCGACGCCGA CCTGGGCGTG GCCTCCGGCA ACGGCAAGGG CCAGATCTTC
GTCAAGGGCG AGGTCATCAA GACCGTGCCC GAGTCCAAGA TCGTGGAGAC CCTCATCGAG
GAGGCCATGC GCATCGCCGA GGAGATGGGC GAGTCCGGCG CCGAGTCGGG CGCGCCCACG
GTCTCCGTGG CAGGCTGA
 
Protein sequence
MTVDLGIPAA PPRPLATRRK TRQIMVGNVP VGGDAPVSVQ SMTTTRTSDI NATLQQIAEL 
TAAGCQIVRV AVPTNDDADA LPIIARKSQI PVIADIHFQP KYVFQAIDAG CAAVRVNPGN
IKKFDDKVAE IAKAAGEAGT PIRIGVNAGS LDKRLLQKYG KATPEALVES ALWECSLFEE
HGFRDIKISV KHNDPVVMVN AYRQLAAACD YPLHLGVTEA GPAFQGTIKS AVAFGALLSE
GIGDTIRVSL SAPPAEEVKV GNQILESLGL RERGLEIVSC PSCGRAQVDV YTLAEEVTAG
LEGMEVPLRV AVMGCVVNGP GEARDADLGV ASGNGKGQIF VKGEVIKTVP ESKIVETLIE
EAMRIAEEMG ESGAESGAPT VSVAG