Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_0878 |
Symbol | |
ID | 9244723 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 1075642 |
End bp | 1076742 |
Gene Length | 1101 bp |
Protein Length | 366 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | |
Product | 3-dehydroquinate synthase |
Protein accession | YP_003678828 |
Protein GI | 297559854 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.365925 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACCGTGA CCCGGATCGG TGTGGGCGAC TCCGCCGGCC GCTACGACGT GGTGGTCGGC AGCGGGGTCC TCTCCGAACT CCCCTCCCTG GTCGGCGACG CCGCCCAGGT GGCCGTCATC CACCCCGAGA GCCTGGACGG GATCGCCCGC CCCGTGGTCG GCGCCCTGGA GGCCGCCGGT TACCGGGCGC ACTCCATGCC CGTCCCGGAC GGCGAGGCCG CCAAGACCGC GGCCGTCGCC GCCGACCTGT GGTCCCGGCT CGGGCAGCGC AACTTCACCC GCAGCGACGC CGTCGTCGGC GTGGGCGGGG GAGCCGCCAC CGACCTGGCC GGGTTCGTCG CCGCCACCTG GCTGCGCGGC GTGCGCTCGG TCCTGGTGCC CACCACCCTG CTCGGCATGG TCGACGCCGC CGTCGGCGGC AAGACCGGCA TCAACACCCC CGAGGGCAAG AACCTCGTCG GCGCCTTCCA CCCCCCGGCC GGGGTCCTGT GCGACCTCGC GACCCTGCCG AGCCTGCCCC GCGCCGACTA CATCGGCGGC CTCGCCGAGA TCGTCAAGGC CGGGTTCATC GACGACCCCG TCATCTGCGA CCTGGTCGAG GACGACCCCG AGGGCGCGGC CGAACCCGGG GGCAGGCACA CCCGCGAGCT CATCGAGCGC GCCATCCGGG TCAAGGCCGA CGTCGTCTCC GGCGACCTGC GCGAGAGCGG CCGCCGCGAG ATCCTCAACT ACGGCCATAC CCTGGGCCAC GCCATCGAGC GCGCCGAGAA CTACACCTTC CGGCACGGCT ACGCGGTCTC CATCGGCATG GTCTACGCCG CCGAACTCGC CCGCCTGGAC GGCCGCGTGG GCGACGACCT GGTCCAGCGC CACCGCTCGC TGCTCTCCTC GGTCGGCCTG CCCGTCTCCT ACGCCCCCGA GGCCTGGCCC GAACTGCGCG CCGCCATGAG CGTGGACAAG AAGGCCCGCG GGGCCACCCT GCGCTTCGTC GTCCTGGACG GCCTGGCGCG GCCCACCATC CTCAGCGGCC CCGCGCCCGA ACTGCTGGAC GAGGCGTACC GCGCGGTCAC GGGCGACACC CGGGTTCCGC GAAGCCACTA G
|
Protein sequence | MTVTRIGVGD SAGRYDVVVG SGVLSELPSL VGDAAQVAVI HPESLDGIAR PVVGALEAAG YRAHSMPVPD GEAAKTAAVA ADLWSRLGQR NFTRSDAVVG VGGGAATDLA GFVAATWLRG VRSVLVPTTL LGMVDAAVGG KTGINTPEGK NLVGAFHPPA GVLCDLATLP SLPRADYIGG LAEIVKAGFI DDPVICDLVE DDPEGAAEPG GRHTRELIER AIRVKADVVS GDLRESGRRE ILNYGHTLGH AIERAENYTF RHGYAVSIGM VYAAELARLD GRVGDDLVQR HRSLLSSVGL PVSYAPEAWP ELRAAMSVDK KARGATLRFV VLDGLARPTI LSGPAPELLD EAYRAVTGDT RVPRSH
|
| |