Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_0747 |
Symbol | |
ID | 9244589 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 915815 |
End bp | 917065 |
Gene Length | 1251 bp |
Protein Length | 416 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | |
Product | deoxyguanosinetriphosphate triphosphohydrolase |
Protein accession | YP_003678698 |
Protein GI | 297559724 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.360686 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.792853 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGAACA ATCCCCGCGA GGGCGGAGCG CTCGGCTACA CCGACCAGGA CACCGAGCGC ATGGCCCTGG AGAACCGCAA GAACCGGGCC CGCGACCCCT TCGAGCGGGA CCGGGCCCGG GTCCTGCACA GCGCCGCCCT GCGCCGCCTG GCCGCCAAGA CCCAGGTCGT CCAGCCCGGT GTGAGCGACT TCCCGCGCAC CCGCCTCACC CACTCCCTGG AGTGCGCCCA GATCGGCCGC GAACTCGGCC AGGCCCTGGG CTGCGACCCC GACCTGGTGG AGGCGGCCTG CCTGTCCCAC GACCTGGGCC ACCCGCCCTT CGGCCACAAC GGCGAGCGCG CCCTGGACGA GGCCGCCGCC GACTGCGGCG GGTTCGAGGG CAACGCCCAG AGCCTGCGCC TGCTCGTGCG CCTGGAGGGC AAGGTCATCG ACCCCGACGG GCGCAGCGCG GGGCTCAACC TCACCCGCGC CACCCTCGAC GCCACCGTCA AGTACCCCTG GCTCCGGGGC GAGGGCGGCG ACACCCACAA GTTCAACTGC TACCCCGACG ACACCGAGGT GTTCGACTGG CTGCGCAGGG ACGCGCCCCC GGGCCGCACC TGCTTCGAGG CCCAGGTCAT GGACTGGGCC GACGACGTCG CCTACTCCGT GCACGACGTC GAGGACGCCC TGCACGCCGG GCTGGTGGAC CCCGCGGCCC TGCGCGGCGC CGCCGAGCGC GCCGAGGTCG TGCGGATCGC CGCCGCCGAC TACTGCGACG CCGACCCGGC CGAACTCGAC GAGGTCTTCA CCGACCTGAT CGCCCACCCC GCGTGGCCGC GCGAGTTCAC CGGGGACCTC GCCTCGCTCG CCGCGCTCAA GAACCTCACC AGCGGGCTCA TCGGCCGCTT CTGCCGGGCC GCGGAGGAGG CCACCCGCGC CGCGTACGGT CCCGGGCGCC TCACCCGTTA CGGCGGCGAC CTCATCGTGC CCCGCCGCCC CCTGCTGGAG TGCGCCCTGC TCAAGGCGGT CGCCGCCCAC TTCGTGCTCT CGCGGGAGGA GGCCCGGGTC TACCAGGCCG AGGAACGCCG CCTGATCACC GAGCTGGTCG GCCTCCTGTG GAAGAACGCC CCCGAGGGCC TGGACCCGCA GTTCCGCGCC GCCTTCACCG GGGCCGCCGA CGACTCCGCC GCCCTGCGCG TGGTCATCGA CCAGGTCGCC TCGCTCACCG ACACCTCCGC GACCGCGCTG CACGCCAGCC TCACGGGCTG A
|
Protein sequence | MTNNPREGGA LGYTDQDTER MALENRKNRA RDPFERDRAR VLHSAALRRL AAKTQVVQPG VSDFPRTRLT HSLECAQIGR ELGQALGCDP DLVEAACLSH DLGHPPFGHN GERALDEAAA DCGGFEGNAQ SLRLLVRLEG KVIDPDGRSA GLNLTRATLD ATVKYPWLRG EGGDTHKFNC YPDDTEVFDW LRRDAPPGRT CFEAQVMDWA DDVAYSVHDV EDALHAGLVD PAALRGAAER AEVVRIAAAD YCDADPAELD EVFTDLIAHP AWPREFTGDL ASLAALKNLT SGLIGRFCRA AEEATRAAYG PGRLTRYGGD LIVPRRPLLE CALLKAVAAH FVLSREEARV YQAEERRLIT ELVGLLWKNA PEGLDPQFRA AFTGAADDSA ALRVVIDQVA SLTDTSATAL HASLTG
|
| |