Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_1777 |
Symbol | |
ID | 9245627 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 2175293 |
End bp | 2176552 |
Gene Length | 1260 bp |
Protein Length | 419 aa |
Translation table | 11 |
GC content | 77% |
IMG OID | |
Product | Dyp-type peroxidase family |
Protein accession | YP_003679711 |
Protein GI | 297560737 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.822896 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.374196 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACGGGGG ACGCGCACGA ACCGCCCCGT TTCTCCCGGC GCGGTCTGCT CACCGCCGTG GGGGCGGCGG GGATCGCCGG AATCGGCGCC GGAGGGCTGA CGGGGTACGC CTCCGCCGCC GCGGGCGCGG AGGAACGGGA CGCCCCCGCC CTCGACCCCG CCCGCTCCAG GACCGGCTCG CGGGAGGGAC GCCCTCCCGC GCTGCTGACG CCCACGCCCG CGCACGTGCG GGTGGTGGCC GTGGACGTGA ACGCCCAAGA CGCGGCGGAC GTGCGCGTGG CGGCGCGCGA GGTCCTCGGC GCCTGGAGCC GTGAGGCCCG TTCCCTGCAC GAACGGGGCC CGGCCGCGCT CGGGGAGGGG GCGCCCTCCC AGGGCCTGCA CCCCGCCTCG CTGGGGGTCA CCCTCGGGCT GGGCCCCTCC CTGCTGGAAC GCGCGGGGCT GGCCGACCGG CGTCCGCCGC ACATGGAGGA CCTGCCCGCC TTCGACTCCG ACCGCCTCGA CCCCGCCTGG TGCGGCGGCG ATCTCATGCT GCACGTGGGA GCCGAGGACC CCCTGGTCCT CAGCTCCGCG GTCGACCACC TGCTGCGCGC GGCCCGGGGC CGGGTCGGGG TCCGCTGGTC GCTGTCCGCC TTCCAGCGGT CGGCCGCGGC CGCCGCCGAC CCCGCCGCCA CACCGCGCAA CCTCATGGGC CAGATCGACG GGACGGTCAA CCCGCGCCCC GACGAGGCCC TGTTCGCCAC CCAGGTCCTG GCCTCCCACA CCGAGCCGTC CCTGGCCTGG ATGGACGGGG GGTCCTACGT GGTCGTGCGG CGCATCCGCA TGCTGCTGGA CGACTGGTTC GCCCTGGAGA CCCGACGGCG CGAGGACGTC ATCGGACGCC GCCTGTCCGA CGGCGCGCCC CTGGGCGGGG ACCGCGAGCA CGACCGGCCC GACCTGTCGG CCAGGGACGG CGCGGGGGAG CCGGTCATCG CCCGTGACGC CCACATCAGG CTCGCCAGCC CCGAGAGCAC GCTGGGAGCA CGGATGCTGC GCCGGGGCTT CAGCTACGAC CTGGGCTGGG ACGCCGACGG CCGCAGGCAG GCGGGCCTGC TCTTCACCGC CTGGCAGGCC GATCCGCGCA CCGGGTTCAC GGCGGTGCAG CGCAACCTCG ACGAGGGCGG GGACGCGCTC GGCGCCTACG TCAGACACGA GGGCAGCGCA CTGTTCGCGG CGCCGCCGGT GCGGGAGGGG GAGCCCCGTG TGGCGCACAC CCTGCTGTGA
|
Protein sequence | MTGDAHEPPR FSRRGLLTAV GAAGIAGIGA GGLTGYASAA AGAEERDAPA LDPARSRTGS REGRPPALLT PTPAHVRVVA VDVNAQDAAD VRVAAREVLG AWSREARSLH ERGPAALGEG APSQGLHPAS LGVTLGLGPS LLERAGLADR RPPHMEDLPA FDSDRLDPAW CGGDLMLHVG AEDPLVLSSA VDHLLRAARG RVGVRWSLSA FQRSAAAAAD PAATPRNLMG QIDGTVNPRP DEALFATQVL ASHTEPSLAW MDGGSYVVVR RIRMLLDDWF ALETRRREDV IGRRLSDGAP LGGDREHDRP DLSARDGAGE PVIARDAHIR LASPESTLGA RMLRRGFSYD LGWDADGRRQ AGLLFTAWQA DPRTGFTAVQ RNLDEGGDAL GAYVRHEGSA LFAAPPVREG EPRVAHTLL
|
| |