Gene Ndas_0878 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0878 
Symbol 
ID9244723 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp1075642 
End bp1076742 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content74% 
IMG OID 
Product3-dehydroquinate synthase 
Protein accessionYP_003678828 
Protein GI297559854 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.365925 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCGTGA CCCGGATCGG TGTGGGCGAC TCCGCCGGCC GCTACGACGT GGTGGTCGGC 
AGCGGGGTCC TCTCCGAACT CCCCTCCCTG GTCGGCGACG CCGCCCAGGT GGCCGTCATC
CACCCCGAGA GCCTGGACGG GATCGCCCGC CCCGTGGTCG GCGCCCTGGA GGCCGCCGGT
TACCGGGCGC ACTCCATGCC CGTCCCGGAC GGCGAGGCCG CCAAGACCGC GGCCGTCGCC
GCCGACCTGT GGTCCCGGCT CGGGCAGCGC AACTTCACCC GCAGCGACGC CGTCGTCGGC
GTGGGCGGGG GAGCCGCCAC CGACCTGGCC GGGTTCGTCG CCGCCACCTG GCTGCGCGGC
GTGCGCTCGG TCCTGGTGCC CACCACCCTG CTCGGCATGG TCGACGCCGC CGTCGGCGGC
AAGACCGGCA TCAACACCCC CGAGGGCAAG AACCTCGTCG GCGCCTTCCA CCCCCCGGCC
GGGGTCCTGT GCGACCTCGC GACCCTGCCG AGCCTGCCCC GCGCCGACTA CATCGGCGGC
CTCGCCGAGA TCGTCAAGGC CGGGTTCATC GACGACCCCG TCATCTGCGA CCTGGTCGAG
GACGACCCCG AGGGCGCGGC CGAACCCGGG GGCAGGCACA CCCGCGAGCT CATCGAGCGC
GCCATCCGGG TCAAGGCCGA CGTCGTCTCC GGCGACCTGC GCGAGAGCGG CCGCCGCGAG
ATCCTCAACT ACGGCCATAC CCTGGGCCAC GCCATCGAGC GCGCCGAGAA CTACACCTTC
CGGCACGGCT ACGCGGTCTC CATCGGCATG GTCTACGCCG CCGAACTCGC CCGCCTGGAC
GGCCGCGTGG GCGACGACCT GGTCCAGCGC CACCGCTCGC TGCTCTCCTC GGTCGGCCTG
CCCGTCTCCT ACGCCCCCGA GGCCTGGCCC GAACTGCGCG CCGCCATGAG CGTGGACAAG
AAGGCCCGCG GGGCCACCCT GCGCTTCGTC GTCCTGGACG GCCTGGCGCG GCCCACCATC
CTCAGCGGCC CCGCGCCCGA ACTGCTGGAC GAGGCGTACC GCGCGGTCAC GGGCGACACC
CGGGTTCCGC GAAGCCACTA G
 
Protein sequence
MTVTRIGVGD SAGRYDVVVG SGVLSELPSL VGDAAQVAVI HPESLDGIAR PVVGALEAAG 
YRAHSMPVPD GEAAKTAAVA ADLWSRLGQR NFTRSDAVVG VGGGAATDLA GFVAATWLRG
VRSVLVPTTL LGMVDAAVGG KTGINTPEGK NLVGAFHPPA GVLCDLATLP SLPRADYIGG
LAEIVKAGFI DDPVICDLVE DDPEGAAEPG GRHTRELIER AIRVKADVVS GDLRESGRRE
ILNYGHTLGH AIERAENYTF RHGYAVSIGM VYAAELARLD GRVGDDLVQR HRSLLSSVGL
PVSYAPEAWP ELRAAMSVDK KARGATLRFV VLDGLARPTI LSGPAPELLD EAYRAVTGDT
RVPRSH