Gene Ndas_3625 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3625 
Symbol 
ID9247494 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4347599 
End bp4348852 
Gene Length1254 bp 
Protein Length417 aa 
Translation table11 
GC content73% 
IMG OID 
Product1-deoxy-D-xylulose 5-phosphate reductoisomerase 
Protein accessionYP_003681531 
Protein GI297562557 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGGAGCG TGCGAGAAGA ACAGCGAACA GCAGTGATCC TCGGTTCCAC CGGCTCGATC 
GGGACCCAGG CCGTCGACAT CGTCCAGCGC AACCCCCGAC GCTTCCGCGT GGTGGGCCTG
GCCGCGGGCG GCGGGCGCGT GGACCTGCTC GCCAAGCAGG CGGCCGAGCT CGACGTCCCG
CTGGTCGCCG TCGCCGACCC CGACCGCGCG GGCGAACTCG CGCGCGCCCT CGCCGGACAC
GGCAGCCGCG CCACCGTCCT GGCCGGGCCC GAGGGGGTCG CCGAGCTGGC CGGGAGCGAG
TGCGACGTCG TCCTCAACGG CATCACCGGC GCTTTGGGCC TGGAGTCCAC CCTCGCCGCG
CTGCGCGCCG GGCGCACCCT CGCGCTGGCC AACAAGGAGT CCCTCATCAT CGGCGGACCG
CTCGTGCGCG GACTCGCCAG GCCCGGCCAG ATCGTCCCGG TCGACTCCGA GCACTTCGCC
ATCGCCCAGT GCTTCCCCCA CCTGCCCGAG GGCGAACTCG CCAGCCCGGC GCAGGGCCAC
GTCCACGCCC GCCGCGACGA GGTGCGCCGC CTGGTCGTCA CCGCCAGCGG GGGCCCCTTC
CGCGGCCGGG GCCGCGACGA ACTGGCCTCG GTCACCCCCG CAGACGCCCT CAACCACCCC
ACGTGGAGCA TGGGCCCGGT CATCACCGTC AACTCCGCCA CCCTGGTCAA CAAGGGGCTG
GAGGTCATCG AGGCCAACCT GCTCTTCGAC GTGCCCTTCG ACCGGATCGA CGTGGTCGTC
CACCCGCAGT CGGTCGTCCA CTCCATGGTC GAGTACGTGG ACGGCTCCAC GATCGCCAAG
GCCAGCCCGC CCAGCATGAT GATCCCCATC GCCTACGGGC TCGGCGCGCC CGACCGCGTA
CCCGACGCCG CGCCCGGCAT GGACTGGACC CGCGCCCACA CCTGGACCTT CGAGCCCCTG
GACCACGAGG CCTTCCCCGC CGTCCGCCTG GCGTGCGAGG TCGGTAGCGC GGGCGGGACC
GCGCCCGCGG TCTACAACGC GGCCAACGAG GAGGCGGTCG ACGCCTTCCT CCGGGGAAAC
CTCGCGTTCC CGGCCATCAT GGACACCGTC TCCCGGGTAG TCTCGGAGCA CCAGCGAACA
GAACGTGCCG GGGGAGCGTC CTCCCACAGA GGACACCTGA GCCTGGACGA CGTGTACGCC
GCTGACACCT GGGCCCGCAC GCGTGCGCGC GAGCTGCTCG CGCGCCAGGC CTGA
 
Protein sequence
MGSVREEQRT AVILGSTGSI GTQAVDIVQR NPRRFRVVGL AAGGGRVDLL AKQAAELDVP 
LVAVADPDRA GELARALAGH GSRATVLAGP EGVAELAGSE CDVVLNGITG ALGLESTLAA
LRAGRTLALA NKESLIIGGP LVRGLARPGQ IVPVDSEHFA IAQCFPHLPE GELASPAQGH
VHARRDEVRR LVVTASGGPF RGRGRDELAS VTPADALNHP TWSMGPVITV NSATLVNKGL
EVIEANLLFD VPFDRIDVVV HPQSVVHSMV EYVDGSTIAK ASPPSMMIPI AYGLGAPDRV
PDAAPGMDWT RAHTWTFEPL DHEAFPAVRL ACEVGSAGGT APAVYNAANE EAVDAFLRGN
LAFPAIMDTV SRVVSEHQRT ERAGGASSHR GHLSLDDVYA ADTWARTRAR ELLARQA