Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_3973 |
Symbol | |
ID | 9247844 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 4752067 |
End bp | 4753644 |
Gene Length | 1578 bp |
Protein Length | 525 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | |
Product | glycosyl transferase family 2 |
Protein accession | YP_003681876 |
Protein GI | 297562902 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0772213 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGACCGAAC AGACCACGCG TCCCGTCCCG CGCCCCGACG ACAGCGGCCA CCCCGACGTG GTCGTCAGCC TCGGCACCGA CCACCACTCC TTCGACCGGC TGGTCCGCTG GATCGACGAC TACGCCCGGC GCCACCGGAC CCTGCGCTTC CTGGTCCAGC ACGGGCACAG CGCCGCCCCC GAGGTGGCCG CGGGCACCCC GTTCCTGCCC GGCGAGGAGC TCGGCGAGCA CATGCGCCGG GCCCGGGTGG TCGTCGCCCA CGGTGGACCG GGCACCATCG TCCAGGCCCG CCGCGCCGGA CGCCTGCCCA TCGTCGTCGC CCGCGACCCC GAACTGGACG AGCACGTCGA CGAGCACCAG CTCCTGTTCG TACGGCGTCT GGAGGAGGCG GGCCGGGTGC GTTCCTGCGC CACCCCCCAG CAGCTCTGCG CGCTCCTGGA CAGGGCGCTC GCCTCGCCCG CGGACTTCCG GGTGGACCCC GGCGACGGCG AGGGCACCGA GCGGGCCGCG CTGCGCGCCG GGGAGCTCAT CGACCTGCTC ACGCGGGGCC GGGGCGCGAC GGCCGAACCC GTCGCCACGG CCGCCGCCCC CTCAGTCCCC CGGGCCGCCG ACCACATCTT CGGCCGCACC GCCGCGCGGA CCGCCGCCTC CGGAGCCGAC GACACCGGTC CCCTGCCCGG CGTGACCGTG GTGGTGCCCA CCCGGGACCG GCCGGAACTG CTGCGCCGCA CCCTGCGGGC GATCAACGAG CAGGACTACT CCGGCCGCAT CACCACGATC GTCGTCTTCG ACAACGACCA GCCCGACCCC TCACTGGCCC GCTCCGACGG CGACCGCCCC GTGCGGGTGG TCACCAACAC CCTCACCCCC GGCCTGGCCG GGGCCCGCAA CACCGGTGTG CTCGCCGCCG ACACCGACCT GGTGGCCTTC TGCGACGACG ACGACACGTG GCTGCCCGGG AAGCTCCGGG CACAGGTCGG CGTCATGCTC GACGAGCCCG GCACGGAGAT GGTGTGCTGC GGCATCCGGG TGGTCTACGA CAGGGTCGAG GCGGTCCGCA GCCTGGACCG CACCAGCGTG ACCTTCGGCG ACCTGCTGGG GTCGCGCCTG ACCGAGCTGC ACCCGTCCAC GTTCCTCATC CGGCGCCGCG CCATGATCGA CGGCTGCGGA ACCGTCAGCG AGGAGATCCC CGGCAGCTAC GCCGAGGACT ACGAACTGCT GCTGCGCCTG GCCCGGCGCG GCCCCATCCG CAACATCCCC GAACCGGGCG TGCGGGTGCT GTGGCACCGC AGGTCGCACT TCTCCGGGCG CTGGCGGACC ATCTCCACCG CCCTGCGCTG GCTGCTGGAC CGCTACCCCG AGTTCGCTCT GGTGCCGCGC GGCCACGCGC GCGTGGCCGG GCAGATCGCC TTCGCCGAGG CCGCCTCCGG CCGCCGACGC GCGGCGCTGC GCTGGATCGG CACCACCGTC CGCAGCCGCC CGGCCGAGGC CCGCGCCTAC CTGGCGCTGG CGGTGGTGCT CGGGGTGCCC GCCGGGTGGG TCACGCGCGC GCTGCATCTG CGCGGCAGGG GCCTGTAA
|
Protein sequence | MTEQTTRPVP RPDDSGHPDV VVSLGTDHHS FDRLVRWIDD YARRHRTLRF LVQHGHSAAP EVAAGTPFLP GEELGEHMRR ARVVVAHGGP GTIVQARRAG RLPIVVARDP ELDEHVDEHQ LLFVRRLEEA GRVRSCATPQ QLCALLDRAL ASPADFRVDP GDGEGTERAA LRAGELIDLL TRGRGATAEP VATAAAPSVP RAADHIFGRT AARTAASGAD DTGPLPGVTV VVPTRDRPEL LRRTLRAINE QDYSGRITTI VVFDNDQPDP SLARSDGDRP VRVVTNTLTP GLAGARNTGV LAADTDLVAF CDDDDTWLPG KLRAQVGVML DEPGTEMVCC GIRVVYDRVE AVRSLDRTSV TFGDLLGSRL TELHPSTFLI RRRAMIDGCG TVSEEIPGSY AEDYELLLRL ARRGPIRNIP EPGVRVLWHR RSHFSGRWRT ISTALRWLLD RYPEFALVPR GHARVAGQIA FAEAASGRRR AALRWIGTTV RSRPAEARAY LALAVVLGVP AGWVTRALHL RGRGL
|
| |