Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_0799 |
Symbol | |
ID | 9244644 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 982877 |
End bp | 984046 |
Gene Length | 1170 bp |
Protein Length | 389 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | |
Product | dihydroorotate dehydrogenase |
Protein accession | YP_003678749 |
Protein GI | 297559775 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.795428 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGTGT TCTACCAGAT GCTCTTCCGC TCGGTCCTGC GCCACATGGA CGCGGAGAAG GTCCACAGGC TCAGCTTCGC CGCGCTGCGC GGCGTGACCT CCCTGCCCGC CGTGGCCTCC GCCATGAAAG GCGTGCTCGG GCCGCGCGAG CCGGAGCTGA CCGTGCACGC CCTCGGGCAG GAGTTCCCCG GCCCGCTGGG GCTCGCGGCC GGTTTCGACA AGAACGCGGA GAGCCCCTCC GGGCTGGCGG CGCTCGGCTT CGGCTTCGTG GAGGTGGGCA CCGTCACCGC CCAGCCCCAG CCGGGCAACC CGCGGCCGCG CCTGTCCCGG CTGGTGGCCG ACCGCGCGAT CGTCAACCGC ATGGGCTTCA ACAACGAGGG GTCGGCCCTG GTCGCAGAAC GCCTCCACCA CCGGCGCGGC GGCCGCCGTC CCGTGCTCGG CGTCAACATC GGCAAGACCA AGGTCACGCC CGAGGAGGAG GCCCCCGCCG ACTACGCCCT CAGCGCCCGG CGCCTGGCCC GCTACGCCGA CTACCTGGTG GTCAACGTCA GCTCGCCCAA CACCCCCGGG CTGCGCAACC TCCAGGGCGT GGAGCGGCTG CGCCCGCTCC TGGCCGCCGT GCGCGAGGCC ATGGCCGAGG CCGGTCGCCC GGACCTGCCC CTCCTGGTGA AGATCGCCCC CGACCTCGCC GACGAGGACG TCGACGCCGT CGCCGACCTC GCGCTGGCCG AGGGACTCGA CGGCATCATC GCCACCAACA CCACCATCTC CCGTGAGGGC CTGACCACCC CCGCGGCGCA GGTGGAGGCG GCCGGTGCGG GCGGCCTGTC CGGCGCCCCC CTCAAGCGGC GCTCCCTGGA GGTGCTGCGC CGCCTGCGCG CCCGTGTGGG CGACCGGGTG ACCCTGATCG CCGTCGGCGG CATCGAGACG CCCCTGGACG CCTGGTTCCG CATCCGGGCG GGGGCCAGCC TCGTGCAGGG CTACACCGGC CTCATCTACG GCGGTCCGCT CTGGCCCCGC CGCATCAACC GCGGCCTCGC CCGGCTGGTG CGGGCCTCCG GCCACCGTTC CATCAACGAG GTCGTCGGAG CCGACGTCCC CTCCCCGGCC GCCCCGGCGG CCGACACCGG GCAGGGGGCG GACCCCGCCG CGGCCACAGC CAAGGGCTGA
|
Protein sequence | MTVFYQMLFR SVLRHMDAEK VHRLSFAALR GVTSLPAVAS AMKGVLGPRE PELTVHALGQ EFPGPLGLAA GFDKNAESPS GLAALGFGFV EVGTVTAQPQ PGNPRPRLSR LVADRAIVNR MGFNNEGSAL VAERLHHRRG GRRPVLGVNI GKTKVTPEEE APADYALSAR RLARYADYLV VNVSSPNTPG LRNLQGVERL RPLLAAVREA MAEAGRPDLP LLVKIAPDLA DEDVDAVADL ALAEGLDGII ATNTTISREG LTTPAAQVEA AGAGGLSGAP LKRRSLEVLR RLRARVGDRV TLIAVGGIET PLDAWFRIRA GASLVQGYTG LIYGGPLWPR RINRGLARLV RASGHRSINE VVGADVPSPA APAADTGQGA DPAAATAKG
|
| |