Gene Ndas_3866 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3866 
Symbol 
ID9247737 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4637062 
End bp4638129 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content72% 
IMG OID 
Product3-dehydroquinate synthase 
Protein accessionYP_003681769 
Protein GI297562795 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.948172 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGCTAC TAGCCCGCAT GCTCCCCTCC CCGCTGGCGA TCGACGTGCG CCGGGGAGCC 
GTCTCCTCCC TCGGGACGCT GCTCGCCGAC CGCAGGATCG CCACCGAGGG CCGCATCGCC
GTGGCCGTCG GCCCGGGGCA GGGGGCGCAG ATCGCCTCCG AGCTGGACCT GCCCAACTGC
GAGGTCTTCC ACGTCGAGGA GGGCACCGTC GACGCCGCGA CCGAGCTGGG AAAGAAGCTC
CGCTCGGGCG CCTACGAGGC GGTCGCCGGG ATCGGCGGCG GCAAGACCAT CGACGTGACC
AAGTTCGCCG CCACCATGGC GGGGATCCCC ATGGTGGCCG TGGCCACCAA CCTCGCGCAC
GACGGCATCG CCTCGCCGGT CAGCTCGCTG GAGCACGAGG GCGGCAAGCC CTCCATCGGC
GTGACCATGC CCATCGCCGT GGTCATCGAC GTCGACTACG TCCGGGCGGC CCCCTCGCAC
CTGGTGCGCT CGGGCATCGG CGACGTGGTC AGCAACATCT CCGCCATCGA GGACTGGGAG
CTGGCGGGCC GGGTCAACGG CGAGCCGGTG GACGGCATGT CCGTCACCTT CGCCAGGGTC
GCGGCCGAGG CGGTCCTGCA CCGCCCGGAC TCGGTGGAGT CCGAGGCCTT CCTCACGGTG
CTGGCCGAGG GCCTGGTGCT CTCGGGGATG GCGATGTCGG TGGCCGGGTC CAGCCGCCCC
GCCAGCGGCG CGTGCCACGA GATCCTGCAC GCGGTCACCC AGCTCCACCC GGGCACCAGC
AACCACGGCG AGCTCGCCGG GCTGGGCGCG CTGTACGCGT CCTTCCTGCG GGTGCGGCAC
CTGGACTGGT CGCAGGCGCG GATGAACGAG ATCCGCGACT GCCTGATCCG TCACGAGCTG
CCCGTCGTGC CCTCCGACGT CGGACTCGAC GAGGCGGAGT TCGCACGGGC GGTGGTCCAC
GCCCCGGACA CCCGTCCGGG CCGGTTCACC ATCCTGGAAC ACCTGAACCT CTCCGAGGAC
GAGATCGGAC GGAGCGTCAA GGACTATGTC GAAGCCGTCG GTCGCTGA
 
Protein sequence
MPLLARMLPS PLAIDVRRGA VSSLGTLLAD RRIATEGRIA VAVGPGQGAQ IASELDLPNC 
EVFHVEEGTV DAATELGKKL RSGAYEAVAG IGGGKTIDVT KFAATMAGIP MVAVATNLAH
DGIASPVSSL EHEGGKPSIG VTMPIAVVID VDYVRAAPSH LVRSGIGDVV SNISAIEDWE
LAGRVNGEPV DGMSVTFARV AAEAVLHRPD SVESEAFLTV LAEGLVLSGM AMSVAGSSRP
ASGACHEILH AVTQLHPGTS NHGELAGLGA LYASFLRVRH LDWSQARMNE IRDCLIRHEL
PVVPSDVGLD EAEFARAVVH APDTRPGRFT ILEHLNLSED EIGRSVKDYV EAVGR