Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_3799 |
Symbol | |
ID | 9247670 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 4562221 |
End bp | 4563288 |
Gene Length | 1068 bp |
Protein Length | 355 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | |
Product | NAD-dependent epimerase/dehydratase |
Protein accession | YP_003681703 |
Protein GI | 297562729 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.0477666 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGTCGCTG TCACCGGTGC CGCCTCCGGG GTGGGCCGGC TCCTCGTGCA ACGCCTGGCC GCGCCGGAGA GTTCCGCGCG CCTGCGTGAG GTCGTCGCCA TCGACGCCGA GCTCGCCGAC CTGCGCGGCG TCACCTGGCG CATCGCCGAC GTGTGCGACC CCGGGCTGGT CTCCCGCCTC GGCGGGGTGG ACGTGCTCGT GCACACCGCC GACGACCGCT CCCTGGAGAC CCGGCCCGCG CGGCGCCGCG CGCACAACAT CCGCGCGGCC CAGACGGTCC TGACCGCCGC CGCGGCCTCG GGGGTCCCCC GCGTCATCCT GGTCACCAGC ACCATGGTCT ACGGGGCCGC CCCCGACAGC CCCGTGCCCC TGGCGGAGAA CGCGCCGCGC GTGTCCGACA ACAGCGAGGG GCTCATGGGC GACTTCGCCG AGATCGAGGC CCTCGCCGAG CGCGCCCGCC GGGCCCACCC CAGCCTCACC GTCACCGTCG TGCGCCCCGC CCCGCTCGTG GGGCCCGGGC TCGACACCCT CCTCAGCCGC CACTTCTCCG CCCCGCGCCT GCTCACCGTC AAGGGGCACG AACAGCACTG GCAGTTCTGC CACGAGGACG ACCTGGTCAG CGCCCTGGCC TTCTGCGCCC TGCACGGGGT GGACGGGCCC GAGGGCGTGG TCGCCGTGGC CAGCGAGGGC TCCCTCACCC AGGACGAGGT GGAGGCCGTC TCCGGGATGA AGCACTTCGA GGTGCCCGCC AACCTGGCCT TCGGCGCCGT CCGCCGCCTC CACCAGGCCC GGATCACCCC GGCGGCCGAG GGCGAGCTGA AGTTCCTCGT CTACCCGTGC GTGGTGGACT GCGCGGTGCT GCGCGAGGCC GGGTGGAAGC CCGCCCACGA CAACGAGTCC GCCCTGGCAG CCCTGCTGGA GTCCCGCAGC GGCAGGCCCG CGCTGGTCGG CCGCAGCCTG GGCCGCAAGG AGGTCACCAT CACCGCCGCG GGTGCCGCCG GAGCCGCCGC GGCGGCCATC GGCACCGCCG CAGCCATCCG CCACCTGCGC AAACGCAAGG GGGCGTGA
|
Protein sequence | MVAVTGAASG VGRLLVQRLA APESSARLRE VVAIDAELAD LRGVTWRIAD VCDPGLVSRL GGVDVLVHTA DDRSLETRPA RRRAHNIRAA QTVLTAAAAS GVPRVILVTS TMVYGAAPDS PVPLAENAPR VSDNSEGLMG DFAEIEALAE RARRAHPSLT VTVVRPAPLV GPGLDTLLSR HFSAPRLLTV KGHEQHWQFC HEDDLVSALA FCALHGVDGP EGVVAVASEG SLTQDEVEAV SGMKHFEVPA NLAFGAVRRL HQARITPAAE GELKFLVYPC VVDCAVLREA GWKPAHDNES ALAALLESRS GRPALVGRSL GRKEVTITAA GAAGAAAAAI GTAAAIRHLR KRKGA
|
| |