Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_4110 |
Symbol | |
ID | 9247984 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 4906874 |
End bp | 4907884 |
Gene Length | 1011 bp |
Protein Length | 336 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | |
Product | 2-dehydropantoate 2-reductase |
Protein accession | YP_003682012 |
Protein GI | 297563038 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.661509 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.957383 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGAATCG CCGTACTCGG CGCCGGAGCG ATCGGCGCCT ACGCGGGAGC CGCGCTGCAC AGGGGCGGGG CCGACGTGCA CCTCATCGCG CGCGGCGAGC ACCTGCGGGC CCTGCGCGAG CGGGGAGTCC GGGTCCTCAG CCCGCGCGGG GACTTCGACG CGCACCCGCA CGCGACCGAC GACCCCACCG AGGTCGGGCC CGTGGACGCC GTCATCCTCG GCCTGAAGGC CCAGCACTAC GCGGCCTGCG GCCCGCTGCT GCGCCCCCTG ATGGGCCCCT CCACCATGGT CGTCGCCGCC CAGAACGGCA TCCCCTGGTG GTACTTCCAC GGGATCGAGG GGCCCCTGGC CGGGCACCGG ATCGAGAGCG TGGACCCCGG CGGCGCGGTC AGCCGGACGA TCCCGGTCGA GCGCGCCGTG GGCTGCGTGG TCTACGCGGC CACCGAGATC GCCGGACCCG GCGTGATCCG GCACATCGAG GGGACCCGGT TCTCCATCGG CGAGCCGGAC CGCTCCTCCT CCCGGCGCTG CCGCGACCTC CAGGCCGCGA TGGAGGCGGG CGGGCTCAAG TGTCCGATCG AGCGGGACCT GCGCGAGGAC ATCTGGGTGA AGCTGATGGG CAACATCGTC TTCAACCCGC TGAGCGCGCT GAGCCGCTCC ACCATGGTGC AGATCTGCCG CAACCGCCCG ACCAGGGAAC TGGCCCGCAC CATGATGGCC GAGACCCTGG ACGTGGCCGC GCGGCTGGGC GTGCGCCCCG CCGTCTCCAT CGACCGGCGC CTGGCAGGGG CCGAGCGCGC CGGCGACCAC CGCACCTCGA CCCTGCACGA CCTGGAACGC GGCCGCCCCA TGGAACTGGA CGTCATCCTG TCCGCCGTGG TGGAGCTGGC CGACCTGACC GGCGCCCCCG CCCCCGCTCT CCGGGCGGTG GACGCCGTCG CGGGACTGCT CAACGCCCGG ATGCTCGACG CGCCCGCCCC CGCGCCCGCG GCACCCTCCG TCGCCGCCTA G
|
Protein sequence | MRIAVLGAGA IGAYAGAALH RGGADVHLIA RGEHLRALRE RGVRVLSPRG DFDAHPHATD DPTEVGPVDA VILGLKAQHY AACGPLLRPL MGPSTMVVAA QNGIPWWYFH GIEGPLAGHR IESVDPGGAV SRTIPVERAV GCVVYAATEI AGPGVIRHIE GTRFSIGEPD RSSSRRCRDL QAAMEAGGLK CPIERDLRED IWVKLMGNIV FNPLSALSRS TMVQICRNRP TRELARTMMA ETLDVAARLG VRPAVSIDRR LAGAERAGDH RTSTLHDLER GRPMELDVIL SAVVELADLT GAPAPALRAV DAVAGLLNAR MLDAPAPAPA APSVAA
|
| |