Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_3866 |
Symbol | |
ID | 9247737 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 4637062 |
End bp | 4638129 |
Gene Length | 1068 bp |
Protein Length | 355 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | |
Product | 3-dehydroquinate synthase |
Protein accession | YP_003681769 |
Protein GI | 297562795 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.948172 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGCTAC TAGCCCGCAT GCTCCCCTCC CCGCTGGCGA TCGACGTGCG CCGGGGAGCC GTCTCCTCCC TCGGGACGCT GCTCGCCGAC CGCAGGATCG CCACCGAGGG CCGCATCGCC GTGGCCGTCG GCCCGGGGCA GGGGGCGCAG ATCGCCTCCG AGCTGGACCT GCCCAACTGC GAGGTCTTCC ACGTCGAGGA GGGCACCGTC GACGCCGCGA CCGAGCTGGG AAAGAAGCTC CGCTCGGGCG CCTACGAGGC GGTCGCCGGG ATCGGCGGCG GCAAGACCAT CGACGTGACC AAGTTCGCCG CCACCATGGC GGGGATCCCC ATGGTGGCCG TGGCCACCAA CCTCGCGCAC GACGGCATCG CCTCGCCGGT CAGCTCGCTG GAGCACGAGG GCGGCAAGCC CTCCATCGGC GTGACCATGC CCATCGCCGT GGTCATCGAC GTCGACTACG TCCGGGCGGC CCCCTCGCAC CTGGTGCGCT CGGGCATCGG CGACGTGGTC AGCAACATCT CCGCCATCGA GGACTGGGAG CTGGCGGGCC GGGTCAACGG CGAGCCGGTG GACGGCATGT CCGTCACCTT CGCCAGGGTC GCGGCCGAGG CGGTCCTGCA CCGCCCGGAC TCGGTGGAGT CCGAGGCCTT CCTCACGGTG CTGGCCGAGG GCCTGGTGCT CTCGGGGATG GCGATGTCGG TGGCCGGGTC CAGCCGCCCC GCCAGCGGCG CGTGCCACGA GATCCTGCAC GCGGTCACCC AGCTCCACCC GGGCACCAGC AACCACGGCG AGCTCGCCGG GCTGGGCGCG CTGTACGCGT CCTTCCTGCG GGTGCGGCAC CTGGACTGGT CGCAGGCGCG GATGAACGAG ATCCGCGACT GCCTGATCCG TCACGAGCTG CCCGTCGTGC CCTCCGACGT CGGACTCGAC GAGGCGGAGT TCGCACGGGC GGTGGTCCAC GCCCCGGACA CCCGTCCGGG CCGGTTCACC ATCCTGGAAC ACCTGAACCT CTCCGAGGAC GAGATCGGAC GGAGCGTCAA GGACTATGTC GAAGCCGTCG GTCGCTGA
|
Protein sequence | MPLLARMLPS PLAIDVRRGA VSSLGTLLAD RRIATEGRIA VAVGPGQGAQ IASELDLPNC EVFHVEEGTV DAATELGKKL RSGAYEAVAG IGGGKTIDVT KFAATMAGIP MVAVATNLAH DGIASPVSSL EHEGGKPSIG VTMPIAVVID VDYVRAAPSH LVRSGIGDVV SNISAIEDWE LAGRVNGEPV DGMSVTFARV AAEAVLHRPD SVESEAFLTV LAEGLVLSGM AMSVAGSSRP ASGACHEILH AVTQLHPGTS NHGELAGLGA LYASFLRVRH LDWSQARMNE IRDCLIRHEL PVVPSDVGLD EAEFARAVVH APDTRPGRFT ILEHLNLSED EIGRSVKDYV EAVGR
|
| |