Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_3852 |
Symbol | |
ID | 9247723 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 4624092 |
End bp | 4625453 |
Gene Length | 1362 bp |
Protein Length | 453 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | |
Product | nucleotide sugar dehydrogenase |
Protein accession | YP_003681755 |
Protein GI | 297562781 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0199691 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.463935 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGATGCCG CAGCCTCCAA CCCAGCCCTT GTCTCCGAGC AGTCCGCCGG TGCGGTCAGC GACCTCGTCG TTCTCGGCCT CGGCTACGTC GGCCTGCCGC TGGCGGCCGA GGCCGTGTCC GCCGGCCTGA AGGTCACCGG TCTGGACGTG AGCGACCGCG TCGTCGACGG GCTCAACAAC GCCGTCTCGC ACGTGGACGA CCTCTCCGCG GAGGACGTGC GCCGGATGCT GGACCGCGGT TTCACCGCGA CCACCGACCC GGCCTGCCTG GCCGCCGCGC GCACCATCGT GATCTGCGTG CCCACGCCGC TGTCGGCCGA GGGCGGCCCC GACCTGGGCG CGGTGACGTC CGCGGCCGAG GCGATCGCCG CCCAGCTGCA GCCCGGCACC CTGGTCATCC TGGAGTCGAC CACCTACCCC GGCACCACCG AGGAGGTGGT CCGCCCGCTG CTGGAGAAGT CCGGCCTGGT CGCGGGCGCC GACTTCCACC TGGCCTTCTC GCCCGAGCGC ATCGACCCCG GCAACCCGAC CTTCGGCGTG GCCAACACGC CCAAGGTGGT CGGCGGCCTG ACGCAGGAGT GCGGCGAGGC GGCGGCGGAG TTCTACGGCG CCTTCGTGAA CACGGTGGTG CGGGCGCGCG GCACCCGCGA GGCCGAGATG GCCAAGCTGC TGGAGAACAC CTACCGCCAC GTCAACATCG CCCTGGTCAA CGAGATGGCC ATCTTCTGCC AGGAGCTGGG CGTGGACCTG TGGGACTCCA TCGCCGCGGC GGCCACCAAG CCGTTCGGCT TCCAGGCCTT CTACCCGGGC CCGGGCGTGG GCGGCCACTG CATCCCCATC GACCCGAACT ACCTGTCGTA CAAGGTCAAG ACCCTCGGCT ACCCGTTCCG GTTCGTGGAG CTGGCCCAGG AGATCAACGG CCGCATGCCC TCCTACGTCA TCCAGCGGGC GCAGGAGCTG CTCAACGACT CCGGCCTGGC CCTGTCGCGC TCCAAGGTGC TGCTGCTGGG CGTCACCTAC AAGGCCGACA TCGCCGACCA GCGCGAGTCC CCGGCCCGGC CGGTCGCGCG CAAGCTGGCC GCCAAGGGCG CCACGCTGAC CTACCACGAC CCGCACGTGG AGTCCTGGCA GGTCGACGGC GTGGACGTGC CCAGGTCCAC CGACCTGGAC CGCGCCCTGG CCGAGGCCGA CCTGACCATC CTGCTCACCG ACCACGCCGA GTACCGGCCC AAGCGGCTGG AGGAGTACGC GCGGCTGCTC CTGGACACCC GGGGCGTGCT GCGCCGCCCC GACCCCGAGG ACTCCGCGGT CCCCTCGCAG GTGCGGCGCC ACGTCACACG GGAGGGCATC GAGGTCCTGT GA
|
Protein sequence | MDAAASNPAL VSEQSAGAVS DLVVLGLGYV GLPLAAEAVS AGLKVTGLDV SDRVVDGLNN AVSHVDDLSA EDVRRMLDRG FTATTDPACL AAARTIVICV PTPLSAEGGP DLGAVTSAAE AIAAQLQPGT LVILESTTYP GTTEEVVRPL LEKSGLVAGA DFHLAFSPER IDPGNPTFGV ANTPKVVGGL TQECGEAAAE FYGAFVNTVV RARGTREAEM AKLLENTYRH VNIALVNEMA IFCQELGVDL WDSIAAAATK PFGFQAFYPG PGVGGHCIPI DPNYLSYKVK TLGYPFRFVE LAQEINGRMP SYVIQRAQEL LNDSGLALSR SKVLLLGVTY KADIADQRES PARPVARKLA AKGATLTYHD PHVESWQVDG VDVPRSTDLD RALAEADLTI LLTDHAEYRP KRLEEYARLL LDTRGVLRRP DPEDSAVPSQ VRRHVTREGI EVL
|
| |