Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_3579 |
Symbol | |
ID | 9247448 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 4291513 |
End bp | 4292718 |
Gene Length | 1206 bp |
Protein Length | 401 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | |
Product | Dyp-type peroxidase family |
Protein accession | YP_003681486 |
Protein GI | 297562512 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.519759 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGCGGTG AGTCCCGGCT CAGCCGCAGA GGGCTGCTCC TGGGCGGCGC GGCGGCCGGT GTCGGCGCGG CCGGGGGCGC GCTCGCCCAC CGGTGGGCCG CGGAACCTGC GGCGGCCCCC CAGGCGCCCC CGCTCAACGG CACGCTGACC GTACCGTTCC ACGGCGTGCG GCAGGCGGGC GTGGAGACCC CGCCGCAGAC GCACGGCACC TTCCTCGCGC TGGACCTGGA ACCGGGGACG GACGCGGACG GGGTCGGCCG GCTCCTGCGG CTGCTCAGCG ACGACGCGGC CCGCCTGTCC CGGGGCGAGC CCGCCCTGGC CGACACCGAG CCCGAACTCG CCCTGGTCCC GGCCCGGCTC ACCACCACGT TCGGCTTCGG GCGGGGTCTG GTGGAGCGGG TGGACCCCGA CGCGGTACCG GAGTGGCTCG GGCCACTGCC CGAGTTCGGG CACGACCGGC TCGACCCGGC CTGGGGCGGG GCGGACCTGC TGCTCCAGGT GTGCGCGGAC GACCCCGTCA CCGTCTCCCA CGCGGTGCGG ATGATGCTCA AGGACGCGCG GGCCTTCGCG CGGGTGCGGT GGACGCAGAG CGGGTTCCGC CGGGCCCACG GCTCCCAGCC CGAGGGCACC AGCATGCGCA ACCTGATGGG GCAGGTGGAC GGGACCGTCA ACCCGGCGCC GGGAACCGGG GACTTCGACC GGCTGGTCTG GGGCGGGAAC CCACCGCGGT GGCTCAGGGG AGGCACGAGC CTGGTGCTGC GCCGTATCGC CACCCACCTG GACACCTGGG ACGAGCTGGA CCGGCCCGCC CGCGAGGCGG TCATCGGCCG CCGCCTGGAC AACGGCGCGC CGCTGACCGG TACCGAGGAA CACGACGAGG CGGACCTGGA GGCCACGGAC GCCTCCGGGC TGACCGTCAT CGCGGACTTC GCGCACATCA GGCGGGCCCG CACCGACGAC CCGGACCAGC GGATCTTCCG GCGCGCCTAC AACTACGACG AGCGCGGCTC GGGCGGCGAG GAGGCGGGGC TGCTGTTCGC CTCCTTCCAG GCCGACCCGC TGCGCCAGTT CGTGCCCATC CAGCGGCGCC TGGACGAGCT GGACCTGCTC AACGAGTGGG TGACCGCCGT GGGCTCGGCG GTGTTCGCCG TCCCGCCGGG CTGCGAGGAG GGAGGGTACG TGGGGCAGGC CCTGCTGGAG GGGTGA
|
Protein sequence | MGGESRLSRR GLLLGGAAAG VGAAGGALAH RWAAEPAAAP QAPPLNGTLT VPFHGVRQAG VETPPQTHGT FLALDLEPGT DADGVGRLLR LLSDDAARLS RGEPALADTE PELALVPARL TTTFGFGRGL VERVDPDAVP EWLGPLPEFG HDRLDPAWGG ADLLLQVCAD DPVTVSHAVR MMLKDARAFA RVRWTQSGFR RAHGSQPEGT SMRNLMGQVD GTVNPAPGTG DFDRLVWGGN PPRWLRGGTS LVLRRIATHL DTWDELDRPA REAVIGRRLD NGAPLTGTEE HDEADLEATD ASGLTVIADF AHIRRARTDD PDQRIFRRAY NYDERGSGGE EAGLLFASFQ ADPLRQFVPI QRRLDELDLL NEWVTAVGSA VFAVPPGCEE GGYVGQALLE G
|
| |