Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_1015 |
Symbol | |
ID | 9244861 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 1242032 |
End bp | 1243333 |
Gene Length | 1302 bp |
Protein Length | 433 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | |
Product | amidase, hydantoinase/carbamoylase family |
Protein accession | YP_003678964 |
Protein GI | 297559990 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.736306 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGCAC ACGAACAGGA CCCGTCACAG GACGCGGCAC CCGCCGACGC CCGCTTCCTG GAGGACTTCC GGGCCATGAG CGCCTTCGGC GCCACGCCGT CGGGGGGCGT CGACCGGCAG GCCGCCACCG GGCCCGACAT CGCCCAGCGG CGGTGGCTGG AGGAACTGCT GGTCGAGCGG GGGCTGCGGG TGGTCTACGA CCGCATCGGC AACCAGTTCG GCCTGCTGGA GCGGGTACCG GGCGCCCCCT ACGTGGTCGT CGGCTCGCAC ATGGACTCCC AGCCCACGGC GGGGCGCTAC GACGGGGCCT ACGGCGTGCT GGCCGCCGCC CACGCCGTCT TCCGGATCGC CGAGCAGGAC GCCGCCGCCG GTGCCCCCGC GCCCGTGTAC AACCTCGCGG TCGTGAACTG GTTCAACGAG GAGGGCTCGC GCTTCACGCC CTCCATGATG GGCAGCGCCG TCTACACCGG CGCGCTCCCG CTGGAGACGG CACTGGCCGC GCGCGACGCG GCCGGGGTCA CGGTGGCCGA GGCCCTGGCC CCGACGGGCT TCCTGGGCGG GGACGACGGC CCCGAGGCCG CCTTCTGCGC CGAGATCCAC GTCGAGCAGG GGCGCGTCCT GGAGGAGTCC TCCACCACGA TCGGCCTGGT CAGCGCCAGC TGGGCCGCGC GCAAGTACGC GGTCACCGTC CACGGGGAGC AGGCGCACTC GGGGGCGACC GTGATGGCCG ACCGCCGCGA CGCGCTCGTG GGCGCCTCCA TGCTGGTCGT GGCCGCGCGC GAGCTGGCCG ACCGGTTCCC CGGGGTCCTG CACACGGCCG TGGGACAGTT CGACGTCTAC CCGAACTCGC CGGTGGTGGT GCCCTCGCGC GTCGAGCTGC TGCTCGACCT GCGCTCGCAC GACGAGGAGG TGCTCGCCGA GGCCGACCGG CTGTTCCAGG AGCAGGTCGC CCGCATCGAG GCGGCGGCCT CCGTCACCGT GGAGCAGACC CTCTCCCACT CCTGGGGCGT CAACCCCTAC CAGCCCGAGG GCGTGGCCCT GGCCCGCGCC AGCGCCCGGA GCCTGGGACT GAGCAGCGGC GAGGTCATGA CCGTGGCCGG GCACGACTCG ATCAACATGA AGGAGCGCGT GCCCACGGTC ATGCTCTTCG TGCCCTCGGT CGGGGGCGTG TCCCACAACG AGGGCGAGTA CACCGAGGAC TCCGACCTGG TCGCGGGCCT GGCGGTGCTC ACCGACGTGG TCCGGCGGCT CGGCGCGGGC GAACTGGCCG CCGACGGGTC CTACACGCCC GGACGCGCGT GA
|
Protein sequence | MTAHEQDPSQ DAAPADARFL EDFRAMSAFG ATPSGGVDRQ AATGPDIAQR RWLEELLVER GLRVVYDRIG NQFGLLERVP GAPYVVVGSH MDSQPTAGRY DGAYGVLAAA HAVFRIAEQD AAAGAPAPVY NLAVVNWFNE EGSRFTPSMM GSAVYTGALP LETALAARDA AGVTVAEALA PTGFLGGDDG PEAAFCAEIH VEQGRVLEES STTIGLVSAS WAARKYAVTV HGEQAHSGAT VMADRRDALV GASMLVVAAR ELADRFPGVL HTAVGQFDVY PNSPVVVPSR VELLLDLRSH DEEVLAEADR LFQEQVARIE AAASVTVEQT LSHSWGVNPY QPEGVALARA SARSLGLSSG EVMTVAGHDS INMKERVPTV MLFVPSVGGV SHNEGEYTED SDLVAGLAVL TDVVRRLGAG ELAADGSYTP GRA
|
| |