Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_3234 |
Symbol | |
ID | 9247091 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 3866205 |
End bp | 3867500 |
Gene Length | 1296 bp |
Protein Length | 431 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | |
Product | pyrimidine-nucleoside phosphorylase |
Protein accession | YP_003681146 |
Protein GI | 297562172 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACGTCA TCGAGATCAT CCGAGCCAAG CGCGACGGGG GAGAGCTGAG CCCCGGCCAG ATCGACTGGG TGATCGACGC CTACACCCGC GGCGAGGTGG CCGAGGAGCA GATGTCGGCG CTGGCCATGG CGATCTTCCT GCGCGGGATG GACCGGGCCG AGGTGAGCCG CTGGACCGAG GCGATGCTGG CCTCGGGGGA GCGCCTGGAC TTCTCCGACC TGGCGCGGCC CACCACCGAC AAGCACTCCA CGGGCGGGGT GGGCGACAAG ATCACCCTGC CGCTCACACC CACCGTCGCG GCCTGCGGGG CGGCCGTGCC GCAGCTGTCG GGGCGCGGGC TCGGGCACAC CGGCGGGACC CTGGACAAGC TGGAGTCGAT CCCCGGGTGG CGGGCCTCGC TGTCCACCGG CGAGATGCGG GAGGTGCTGG ACTCCACGGG CGGGGTGATC TGCGCGGCCG GGTCCGGGCT GGCCCCGGCC GACCGCAAGC TCTACGCCCT GCGCGACGTG ACCGGCACGG TCGAGTCGAT CCCGCTGATC GCCGCGTCCA TCATGAGCAA GAAGCTCGCC GAGGGCACCG GCGCGCTGGT CCTGGACGTC AAGGTGGGTT CGGGGGCGTT CATGAAGGAC GCCGACTCCG CGCGCGAGCT GGCCCGCACG ATGGTCGACA TCGGAAACGA CCACGGCGTG CGCACGGTCG CGCTGCTCAC CGACATGTCG GTCCCGCTGG GCAGGCAGGT GGGCAACGCC CTGGAGGTGG CCGAGTCCGT GGAGGTGCTG TCGGGCGGCG GGCCCGCGGA CGTGGTGGAG CTGACCGTGG CCCTGGCCCG GGAGATGCTG GCCGCGGCCG GGCTCGTCCC CGGTGAGGGC GGGGTCAAGG ACCCGGCCGA GGCGCTGCGG GACGGCAGCG CGCTGGAGTC GTGGAAGCGG CTGGTCCGGG CACAGGGCGG GGACCCGGAC GCGCCGCTGC CGGTGGCCGC CGAGCGCCGG GTGGTGCTCG CTCCGGCCTC GGGGACGGTG ACCCGGCTGG ACGCCTACCA GGTGGGGCTG GCCGCGTGGC GGCTGGGCGC GGGCCGGGCG CGCAAGGAGG ACGCGGTGTC GTTCGGGGCG GGGGTGACCC TGCACGCCAA GCCGGGGGAG TCCGTGCAGG CCGGGGAGCC GCTGTTCACG CTGCACGCGG ACGAGGCGGA GCGGTTCGAG CGGGCGGCCG AGGCGCTGGA GGGCGCCTTC GACATCGAGC CGGAGGGCGG GGCGGGCTAC GAGGCCCGGC CGCTGGTGAT CGACCGGATC GCCTGA
|
Protein sequence | MDVIEIIRAK RDGGELSPGQ IDWVIDAYTR GEVAEEQMSA LAMAIFLRGM DRAEVSRWTE AMLASGERLD FSDLARPTTD KHSTGGVGDK ITLPLTPTVA ACGAAVPQLS GRGLGHTGGT LDKLESIPGW RASLSTGEMR EVLDSTGGVI CAAGSGLAPA DRKLYALRDV TGTVESIPLI AASIMSKKLA EGTGALVLDV KVGSGAFMKD ADSARELART MVDIGNDHGV RTVALLTDMS VPLGRQVGNA LEVAESVEVL SGGGPADVVE LTVALAREML AAAGLVPGEG GVKDPAEALR DGSALESWKR LVRAQGGDPD APLPVAAERR VVLAPASGTV TRLDAYQVGL AAWRLGAGRA RKEDAVSFGA GVTLHAKPGE SVQAGEPLFT LHADEAERFE RAAEALEGAF DIEPEGGAGY EARPLVIDRI A
|
| |