Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_1760 |
Symbol | |
ID | 9245610 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 2146590 |
End bp | 2147690 |
Gene Length | 1101 bp |
Protein Length | 366 aa |
Translation table | 11 |
GC content | 80% |
IMG OID | |
Product | NAD-dependent epimerase/dehydratase |
Protein accession | YP_003679694 |
Protein GI | 297560720 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.122533 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.00990126 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACCCGCG TGGGCGTACT CGGCGCCGCG GGGGCGGTCG GTTCCGCGCT GCTGGCGCGG CTGGCCGGTA CCGGGGCGCG CCTGACGGCG GGCGTGCGCG ACCCCGGCCG CCTGTCCGGC CCTCCCCCCG GGGCGGCCGT GCGCGTGGTC GACGCCGAGG ACCCCGCGGG GCTGGCGGAG TTCTGCGCCT CCCACGACGT GGTGGTCAAC TGCGCGGGCC CCTCCGCGCT CCTGGGGGAC CGGGTCCTGC GGGCCGCGAC CGCGTCCGGC GCCCACTACG TCTCGGTGGG GGACGACGGA CGCGACCACC TCTCCCCCGC GGGACCGGAC GACCCCGGCC CGGCGCCGGG CCGCTGCGCC CTGCTGGGGG CGGGCCTGCT GCCGGGGCTG AGCACACTGC TGCCCCGCGT GCTGGCCGAC GGCTTCGACC GGGTGACGGA CATGACAGTC CACTCCGGCG GCCTGGAGCG CTTCACCCCG GCGGCCGCGC GCGACTACGT CGCGGGGCTG GCCTCCGGCG CGGACCGCTC GCTGGCGGCC TGGCGCGGCC GCCGTGTCGC GGGCGCCCTG CGGCCCGAGG CGGACGCGCG GCTGCCCTTC CTGCCCCGGC CCGTGTCCCT GCACCCCTTC CTGAGCCCCG AGGCCGAACG CCTGGCACGC GCCCTGTCCC TGGAGCGCCT GGACTGGTGG CACGTCTTCG AGGGGACGCG CACCACCGAC GCGCTCGCCG GGACGCGGGG CCGGGGCGTC ACCGACCCCG ACGCCCTGGC GGACCTGCTC GTGCGCGCCT CCGGCCTGGA GGTGTTCGGC CGCACCCAGT ACCAGGCGCT GGTGCTGCGG GCCGGGGGCC GGATCGGCGG CCGTGAGCGC ACCCGGGTCC TCGCGCTCAC CGGCGCGGGT CCGGCCCTGA GCGCCGAGGC CGCGGCCCTG GCGGTGCGGT TCGCGGCCGG GGGCGGGGCG GCGGACGGAA CGCACTGGGC GGGCGAGGCG CTGCCGACCG CCGGGGTCCT CGACGGCCTG CGGGACGCGC CCGGCGTCGC GTTCCTGCGC CTCACCGACG ACGATGACGC GCACTCCGGA GTCGAGGAGG GGGTCCTGTG A
|
Protein sequence | MTRVGVLGAA GAVGSALLAR LAGTGARLTA GVRDPGRLSG PPPGAAVRVV DAEDPAGLAE FCASHDVVVN CAGPSALLGD RVLRAATASG AHYVSVGDDG RDHLSPAGPD DPGPAPGRCA LLGAGLLPGL STLLPRVLAD GFDRVTDMTV HSGGLERFTP AAARDYVAGL ASGADRSLAA WRGRRVAGAL RPEADARLPF LPRPVSLHPF LSPEAERLAR ALSLERLDWW HVFEGTRTTD ALAGTRGRGV TDPDALADLL VRASGLEVFG RTQYQALVLR AGGRIGGRER TRVLALTGAG PALSAEAAAL AVRFAAGGGA ADGTHWAGEA LPTAGVLDGL RDAPGVAFLR LTDDDDAHSG VEEGVL
|
| |