Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_4560 |
Symbol | |
ID | 9248441 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 5404716 |
End bp | 5405861 |
Gene Length | 1146 bp |
Protein Length | 381 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | |
Product | glucose sorbosone dehydrogenase |
Protein accession | YP_003682453 |
Protein GI | 297563479 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.321101 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAACCTT CAGCGCGCGC GAGCGCGGCG CTCCGCGCCG CCGCCGTCCT CGGCTCCCTC GTCCTCCTCG CCGGGTGCGG CGCCCAGGGG ACCACCGGCG CGGAGGGGGA CGGGACGGGC GGAACGGACG GCCAGCGGGA GCCGGGCGGA CAGGACGGCT CCGCCGGTGC CGACCCCGGC CGGCCCGGGG AGGTCGCCGC GGGCCTGGAG GCGCCGTGGG GGCTGGCCTT CCTCCCCGAC GGCTCGGCCC TGGTCGCCCA GCGCGACTCG GCGGAGGTCG TGCGCGTGTC CCCCGACGGC TCGGTCACCG GCGTCGGCAC CGTCGAGGGG GCCGCGCCGA ACGGTGAGGG CGGTCTGCTG GGACTGGCCG TGGACCCGGA CTTCCCCGGG GAACCGTACG TGTACGCCTA CCACACCGCG GCCTCCGACA ACCGGATCAG CCGTCTGGAG TACGCCCCCG ACGGCGGGGG CTTCGGGGAC ACCGAGGTCG TCCTCGACGG CATCCCCTCC GCCTCCTACC ACAACGGCGG CCGGATCGAG TTCGGCCCCG ACGGCCTGCT GTACGTCGGC ACCGGGGACG CGGGCCAGCA GGGACTGTCC CAGGACACCG GCTCGCTGGG AGGCAAGATC CTGCGGATCA CCGCCGACGG GGACCCGGCG CCGGACAACC CCTTCGGCAA CCCGGTCTAC AGCTACGGCC ACCGCAACGT CCAGGGGCTG GCGTGGGACG ACGAGGGGAA CCTGTACGCC ACCGAGTTCG GGCAGAACGA GTTCGACGAG GTCAACCTGA TCGAACCGGG CGGCAACTAC GGCTGGCCCG AGGTGGAGGG CGCCGGGGGA GGCGACGAGT ACGTCGACCC CCTGCTGACG TGGAGGCCCG CGGAGGCCTC GCCCAGCGGC GCGGCGGTCG CGGGCGGTTC CCTGTGGGTG GCGGCGCTGC GCGGCGAACG CCTGTGGGAG GTGCCGCTCG CCGGTGACGG CGGCGTGGGC GAGCCCGTGG ACCACCACCA GGGGGAGTAC GGCCGCCTGC GCACGGTGGT GACCGCACCC GGCGGCGACG CGCTGTGGCT GGCCACCAGC AACCTCGACG GGATCGGGGA ACCGGCCGAG GGAGGCGACA GGATCCTGCG GGTGCCGCTG GAGTAG
|
Protein sequence | MEPSARASAA LRAAAVLGSL VLLAGCGAQG TTGAEGDGTG GTDGQREPGG QDGSAGADPG RPGEVAAGLE APWGLAFLPD GSALVAQRDS AEVVRVSPDG SVTGVGTVEG AAPNGEGGLL GLAVDPDFPG EPYVYAYHTA ASDNRISRLE YAPDGGGFGD TEVVLDGIPS ASYHNGGRIE FGPDGLLYVG TGDAGQQGLS QDTGSLGGKI LRITADGDPA PDNPFGNPVY SYGHRNVQGL AWDDEGNLYA TEFGQNEFDE VNLIEPGGNY GWPEVEGAGG GDEYVDPLLT WRPAEASPSG AAVAGGSLWV AALRGERLWE VPLAGDGGVG EPVDHHQGEY GRLRTVVTAP GGDALWLATS NLDGIGEPAE GGDRILRVPL E
|
| |