Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_4061 |
Symbol | |
ID | 9247933 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 4857265 |
End bp | 4858269 |
Gene Length | 1005 bp |
Protein Length | 334 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | |
Product | Xylose isomerase domain protein TIM barrel |
Protein accession | YP_003681963 |
Protein GI | 297562989 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGCGAC CAGTCACCCT GTTCACCGGA CAGTGGGCCG ACCTGCCCTT CGAGGAGGTG TGCTCCCTGG CCTCCTCGTG GGGCTACGAC GGTCTGGAGG TCGCCTGCTG GGGCGACCAC CTCGACGTCA CCCGGGCCGC CGTGGACGAC GACTACGTGC GCGACCGGCT GGACACGCTC AAACGCCACG GGCTGGGCCT GTGGGCGATC TCCAACCACC TGCTGGGCCA GGCCGTCTGC GACGACCCGA TCGACCACCG CCACCGCGAC ATCCTGCCCG CGCGTGTGTG GGGCGACGGC GAGCCCGAGG GCGTGCGCCG GCGGGCCGCC GAGGAGATGA AGACCACCGC CCGCGCGGCC GCCAGGCTCG GCGTGTCCAC CGTGGTGGGC TTCACGGGCT CGGCCGTCTG GAAGTACGTG GCGATGTTCC CGCCGGTGGG CGAGGACGTC ATCGAGGCCG GGTACCGCGA CTTCGCCGAC CGGTGGAACC CGATCCTGGA CGTGTTCGAC GAGGTGGGGG TGCGTTTCGC GCACGAGGTG CACCCCTCCG AGATCGCCTA CGACTACTGG TCAACCCAGA GGGCGCTGGA GGCGGTGGGG CACCGTCCGG CCTTCGGCCT GAACTGGGAC CCCAGCCACA TGGTGTGGCA GGACATCGAC CCGGTCGGGT TCCTGTGGGA CTTCCGGGAC CGGATCTACC ACGTGGACTG CAAGGACGCC CGCAAGCGGG TGGGCAACGG CCGCAACGGG CGGCTGGGCT CCCACCTGCC GTGGGGTGAT CCGCGCCGGG GCTGGGACTT CGTCTCGACC GGGCACGGCG ACGTGCCCTG GGAGGACTGC TTCCGCACGC TGAACTCGAT CGGCTACCAG GGGCCGATCT CGGTGGAGTG GGAGGACGCG GGCATGGACC GGCTGGTCGG GGCGGCCGAG GCGGTGGAGT TCGTGCGCTC CCACGCCTAC GACCCGCCGG AGGCGTCGTT CGACTCCGCC TTCGGGTCGG ACTGA
|
Protein sequence | MPRPVTLFTG QWADLPFEEV CSLASSWGYD GLEVACWGDH LDVTRAAVDD DYVRDRLDTL KRHGLGLWAI SNHLLGQAVC DDPIDHRHRD ILPARVWGDG EPEGVRRRAA EEMKTTARAA ARLGVSTVVG FTGSAVWKYV AMFPPVGEDV IEAGYRDFAD RWNPILDVFD EVGVRFAHEV HPSEIAYDYW STQRALEAVG HRPAFGLNWD PSHMVWQDID PVGFLWDFRD RIYHVDCKDA RKRVGNGRNG RLGSHLPWGD PRRGWDFVST GHGDVPWEDC FRTLNSIGYQ GPISVEWEDA GMDRLVGAAE AVEFVRSHAY DPPEASFDSA FGSD
|
| |