Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_5234 |
Symbol | |
ID | 9249127 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014211 |
Strand | - |
Start bp | 386907 |
End bp | 387836 |
Gene Length | 930 bp |
Protein Length | 309 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | |
Product | Inosine/uridine-preferring nucleoside hydrolase |
Protein accession | YP_003683120 |
Protein GI | 297564147 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.604929 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.000866911 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCGTGTGT TCGTCGACTG CGACCCGGGG ATCGACGACG CCGTCGCGCT CGCCTACCTG GCCGCTCGGC CCGAGGTGGA GATCGTCGGC GTCGGGGCGG TCTTCGGCAA CAACAGCGTG GACGTCACGG CCGACAACGC GCTGCGGCTG CTGGAGCTGT ACGGCCGCCC GGACGTCCCG GTGGCGGTGG GCGCCGCGCG TCCGCTGGTG CAGCCGCCGA AGCTGGCCGC GCACGTGCAC GGAGGCAACG GACTGGGGGA CGTGGAGCTG CCCGAGCCCG CGGGTCGGCC GGTGTCAGAG ACGGCGGCCG GGCTGCTGGT GCGCCTGGTC CGGGAGAACC CGGGCGGGAT CGACGTGCTG GCCGTGGGGC CGCTGACGAA CCTGGCGATC GCGCTGGCCC TGGAGCCGGA GCTGCCGAGG CTGGTGCGGC GCCTGGTGGT GATGGGCGGT GCGGTGCGCG TGGCGGGCAA CGTGTCCTCA CACGCCGAGG CCAACATCAG CAACGACCCC GAGGCGGCGG AGGCGGTGTT CGCGGCCGGG TTCGACCTGG ACCTGGTGGC GCTGGACATC ACCATGAAGA CGGTGGCCAC CACCGAGTGG CTGGCGGAGC TGGCGACGGT CGCGGGCGAG CGCGCCGAGC GCACGTCGGC GTTCCTGGAC TTCTACGCCG ACTTCTACTC GGGGATCTTC GGGGTCCGCC AGTGCGCGAT GCACGACCCG CTGGCCGCGG CGGTGCTGGT GGACCCGCAC CTGGTGACGG AGTCCTTCGA GGCCCCGGTG CAGGTGGAGC TGACGGGAAC GCTGACGCGG GGGATGACCG TGGCGGACCT GCGCCCGCGC CCGCGGGACG ACGAGCGCCG CCCCGCGCGG GTGGTCACGG GCGTGGCCGA GGCCGAGTTC CTGGGGCGGA TGCTCGACTC GCTGCGCTGA
|
Protein sequence | MRVFVDCDPG IDDAVALAYL AARPEVEIVG VGAVFGNNSV DVTADNALRL LELYGRPDVP VAVGAARPLV QPPKLAAHVH GGNGLGDVEL PEPAGRPVSE TAAGLLVRLV RENPGGIDVL AVGPLTNLAI ALALEPELPR LVRRLVVMGG AVRVAGNVSS HAEANISNDP EAAEAVFAAG FDLDLVALDI TMKTVATTEW LAELATVAGE RAERTSAFLD FYADFYSGIF GVRQCAMHDP LAAAVLVDPH LVTESFEAPV QVELTGTLTR GMTVADLRPR PRDDERRPAR VVTGVAEAEF LGRMLDSLR
|
| |