Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_5198 |
Symbol | |
ID | 9249091 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014211 |
Strand | - |
Start bp | 342023 |
End bp | 343303 |
Gene Length | 1281 bp |
Protein Length | 426 aa |
Translation table | 11 |
GC content | 77% |
IMG OID | |
Product | glycosyl transferase group 1 |
Protein accession | YP_003683084 |
Protein GI | 297564111 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.603506 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCACGT CCTTCGCCTC CCCCGCGGTG GCGGCACCGC CCCGGCCCGC CGTCGGCGCG ACTCGCCGCA CCCGGGTCCT GATCGGTACC GACACCTATC CCCCCGACGT GAACGGCGCC GCGTACTTCA CCGCCCGCCT CGCCCGCGGT CTGGCCGCGC GCGGAGCGCG GGTGCACGTG GTGTGCCCCT CCCCCGAGGG CGCCCCGTAC ACGGCGGAAC GCGGCGGGGT GGTCGAGCAC CGGCTGCGCT CGGTGTCCTC CCTGGTCCAC GACAGCGTGC GGCTCGCGGT CCCGCTGGGC GTGCGCGGCC ACCTGGACCG GCTCCTGGAC CGGGTGCGGC CGGACGCCGT CCACATCCAG AACCACTTCC TCGTCGGCCG GATGCTGGCC GCCGCCGCGC ACGCCCGAGG CGTGCCCGTG GTCGCCACCA ACCACTTCAT GCCGGAGAAC CTCTTCGACT ACGTGCACGT GCCCGCGCCG CTGCGCCCGC ACGCGGCCCG GCTGGCCTGG TGGGACCTGG GCGCGGTGCT GTCCCGGGCC GAGCACGTGA CCACGCCCAC CCCGGCGGCG GCGCGGCTGC TGGTCGACCA GGGGTTCACC CGGCCGGTCG AACCGGTCTC GTGCGGGATC GACCTGGACC GGTTCAGCCC GCTGGACGGC GGCGCGGCCA CCCGGCGGCG GCTGCGCGCC CGGCTGGGCG TGCCGGACCA CAGGACGGTG CTGTTCGTGG GGCGGCTGGA CGAGGAGAAG CGCGTGGACG AACTCGTGCG CGCGGTGGCC CTGACCGACG GGGTGCAGCT CGTGCTGGCC GGGCACGGCG CGCACCGGGC GCGGCTGGAG GAGCTGGCGG CGGAGGTCGG CGCGGCCGAC CGGGTGGTGT TCCTGGGCTT CGTGCCGCAC GCCGACCTGC CCGACGTGTA CCGGTGCGCT GACGTGTGGG CCATCGCCGG GACCGCCGAA CTCCAGAGCA TCGCCACCCT GGAGGCGATG GCGAGCGGCC TGCCGGTGGT GGCGGCGGAC GCCATGGCGC TGCCGCACCT GGTGGAGGAG GGCGGCAACG GGTACCTGTA CCCGCCCGGC AGCCCGGGGG CGCTGGCCGC ACGCGTGGAG TCGGTGGTCG CCGACGAGGG CCGACGGCTC GGGATGGGCG CGCGCAGCCG CGACATGGCG GAGCTGCACC GGCTGGAGGA CTCGCTGGAG CGGTTCGAGC GGATCTACCG CGAGGCGTCC GCCGGTGCGG GGGCCGGCGC CGGAGCCGGT GCGACGCGCA GCGGGCGGTG A
|
Protein sequence | MSTSFASPAV AAPPRPAVGA TRRTRVLIGT DTYPPDVNGA AYFTARLARG LAARGARVHV VCPSPEGAPY TAERGGVVEH RLRSVSSLVH DSVRLAVPLG VRGHLDRLLD RVRPDAVHIQ NHFLVGRMLA AAAHARGVPV VATNHFMPEN LFDYVHVPAP LRPHAARLAW WDLGAVLSRA EHVTTPTPAA ARLLVDQGFT RPVEPVSCGI DLDRFSPLDG GAATRRRLRA RLGVPDHRTV LFVGRLDEEK RVDELVRAVA LTDGVQLVLA GHGAHRARLE ELAAEVGAAD RVVFLGFVPH ADLPDVYRCA DVWAIAGTAE LQSIATLEAM ASGLPVVAAD AMALPHLVEE GGNGYLYPPG SPGALAARVE SVVADEGRRL GMGARSRDMA ELHRLEDSLE RFERIYREAS AGAGAGAGAG ATRSGR
|
| |