Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_3764 |
Symbol | |
ID | 9247633 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 4519811 |
End bp | 4523224 |
Gene Length | 3414 bp |
Protein Length | 1137 aa |
Translation table | 11 |
GC content | 77% |
IMG OID | |
Product | glycosyl transferase family 2 |
Protein accession | YP_003681668 |
Protein GI | 297562694 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCCTGACC AGGATTTCGC CCGACACGTC GTCACCGCGG TCATCGTCTC CCACGACGGC TCCCGCTGGC TGCCCGAGAC GATGCAGGCG CTGCGCGGCC AGTCCTGGCC CGTGCAGCGG GCCGTCGCCG CCGACACCGG CAGCACCGAC GACAGCGCCG AGGTCCTGGC CCGGTTCCTG CCCGCCGACG CCGTCGTGGA GCTGCCCGCC GACACCGGGT ACGGCGACGC CGTCCGCGCC GCCCTGGAAC TGCCGCGCTC CACCAGCGCC GTACGCGGCT TCGATGAGGA CGCCACCGAG TGGATCTGGC TCATCCACGA CGACCTCACC CCGGACCGGG ACGCCCTCGC CCACCTGCTG GACGCCGCCG ACCAGGACCC CCGCGCCGCC GTCCTGGGCC CCAAGCTGCG CGATTGGTTC GACCGCCGCC TCCTGGTGGA GGCGGGCGTG ACCATCGACG GCGCGGGCCG CCGCGAGACC GGCCTGGAGC AGCGCGAGTT CGACCACGGC CAGCACGACG GCACACGCCA GGTCCTGGCG GTCTCCAGCG CGGGCATGCT CGTGCGCCGC GACGTGTGGG ACCGCCTCGG CGGCTTCGAC CCCTCCCTGC CGCTGTTCCG CGACGACATC GACTTCTGCT GGCGGGTGGG CGGCGCCGGA CTGCGCGCCG TCCTCGTCAC CGACGCGGTC GCCTACCACG CCGAGGCCTC CGCCCGGCGC CGCCGGCCCA TCTCCGCCAC CCGCAACCAC CCGCGCCGGG TGGACCGCCG CCACGCCGTC TTCGTGCTGC TGGCCAACCT GCCCCTGGGC GGCATGGTGG CCGCGCTGCT GCGCAACTCC GTGGCCTCGC TGCTGCGGGT GGTGGGGTAC CTGCTCATCA AGCAGCCCGC CAACGCCCTG GACGAGGCCG CCGCGATCAC CCTGGTCTAC CTGCGGCCGC TCCGGCTGAT GCGCGCCCGC TTCCGGCGCC GCCGCGACCG CCGCCGCACC TACAGCGCCA TCCGGCCCTT CCTCGCGCGC GGCGTGGCCA TGCGCCAGGT CCACGACGTG CTCACCGGGA TCCTGTCCGG GGAGCCCGTG CGCAACACCC CCGGCCGTCA CCAGGCCGTC ACCGCGCCGC CCGCCGCCGA GGACGAGGAC GAGATCCCCA CGGACACCGC CGGCTTCGTG CGCAGCGTGC TCCTGCGGCC CGGGGTGCTG CTGGTCCTGG CCCTGGCCGC GGTGGCCCTC GTCGCCGAGC GCTCCCTGCT CCTGGGCGAC CTCCTGGCGG GCGGCGCCCT GCCCCCGGTC GCGGGCGGCG CGGGCGACCT GTGGAACCTG TACCTGTCCG GCAGCCCCGA CAGCGGGCTC GGCGCGGCCG ACCCCGTCCC GCCCTACGTC GGACTGCTCG CCCTGCTGTC CACGCTCACC CTGGGCAAGC CCTGGCTGGC CGTCACGATC GTCCTGCTGG GCTGCGTGCC CCTGTCCGGG CTGACCGCCT ACCTGCTCGC CCGCGAGGTG CTGGGCTTTC GCCCCGCACG CCTGTGGACG GCCGCGGCCT ACGCGCTGCT GCCGGCGGCC ACCGGCGCGG TCGCCCAGGG CCGCCTGGGC ACCGCCCTGG TCCACGTCCT GCTGCCGGTG CTGGGACTGC TGCTGGTGCG GCTGGTGTCG ATGCCGCCCA AGCCCTCGCG CCGAGCGGCC TGGGGAGTGG GCCTGGTCCT GACGGTCGCC ACCGCGTTCG TGCCGATGGT GTGGCTCCTG TCGCTGGTCA CCGGCGTGCT GGTGGCCGTC GCCTTCGGCC ACCTCGGCCG CCGGATCTAC GTCAGCGTCG CGCTGGGCCT GGCCGTCCCG CTCGTGCTGC TGATGCCGTG GACGCTCGAA CTGCTCCTGC ACCCCAGCCT GTGGCTGCTG GAGGCGGGAC TGCACCGCCC CGAGCTGTCG GCGCCGGGCC CGACCCCGCA GGAACTGCTC ATGCTCTCGC CGGGCGGCCC CGGGACACCG CCGTTCTGGG TCACGGCCGG GTTCGCCGTG GCCGCGCTGT GCTCCCTGCT GCTGCTCCGC AACCGGATGC TGGCCGCGGC GGGGTGGTCG CTCGCCCTGT TCGGGATCCT CGTGGCCGTC CTGACCAGCC GGGTGGTCGT CGAGCCCTAC TACGGCGGCC CCCCGGCGCC GGTCTGGCCC GGGGTCGCCC TGACCTTCGC CGCCACGGCG GTGCTGCTGT CGGCGGCGAC CGCCGCGCGC TCCTTCGGCG ACATGGTGCG CCTGGGCGGG CCCAGGCGGG TCTTCGCGCT CGCCGTGGGA CTGCTCGCCC TGGCGACCCC CGTCTGCGCC GCGGGCGTGT GGATGTGGGA GGGCGTGCGG GGGCCGGTCA CCGCGCACGC CGAACCCGTC GTCCCCGGCG CGCTGACCGG GGCCGGGACC GTCGAGGGCG GCGGGTCCGA CGGGGCGGGG ACCCGGCCGC GCACCCTCGT GGTCACCGCC GACGGTGAGG GCGGCGTGGA CTACCTGGTC GTGCGCGGAC GCGAACCCCG TCTGGGCGAG GAGCACCTCG TGCCCGAGAG CGGGATCCGC GGGGCGATGG ACCGGGCCGT GGCCGAACTC ACCGTCGGCC AGGGCGGCGA CCAGATGTAC ACCCTGGCCG ACTTCGGCAT CCAGTACGTG CTCTACCCGC GCCCGCGGAT CAGCGGGCCC GCCGACGTCA CCATGGTCGA CACCCTGGAC GGCACACCGG GGCTGGAGCG GCAGTCGCTG TCGCGCCACT ACGCGCTCTG GCGGCTGGCC GCCCCCACCG GCGCGCTGCG GGTGGTCTCC GAGGACGGCG TCGAAGCCGA GGTCCTCGCG GTGCGCGGGG ACGCGGACGA GGTGAGCGCC CCGGTCCCCG AGGGCGGCAC CGGGCGGCGG CTCGCCCTGG CCGAGGCCGC AGACAGCGGC TGGCGCGCCA GCCTGGACGG CGTCGAACTG GACCCCGTCC CCACCCAGAA CGGAACCCAG GCCTGGGCGC TGCCCGTGGA GGGCGGCGAC CTGCGCGTCT GGCACACCGA CTACGTCCAC GCGGCCTGGC TGCTCACGCA GGGGGTCCTG CTCACGGTGG TCGCGGTCCT CGCCGCGCCG GGTGTGCGGA CCGAGGAGGA GGCGCGGCTG ATCGAGGCCA CGCCCACACC CCGCCCGCGG CGGCCGGAGC GGCTGCGGCG CTCGGGGAGG TCCCGGGCGT CCTCGCGCCG GGGCTCACGC GCCAGGCCCG GCCGGTCGCG GCCCGGCGGT GAGGACCCCG GCGTGCGCGC GGACGGGGAC GCCGGAGCGG ACGCGCCGGA GGACGGCGTC CCGCCCGGTT CCGCGTCCGA GGAGGACACC GGCACGCTGC CCGCCGTGCG GGGCGGGGGC CGCCGCCGGG GCACCCGCGG CGTGCGCAGG GGGGAGCGGC GCCGTGGCCG GTGA
|
Protein sequence | MPDQDFARHV VTAVIVSHDG SRWLPETMQA LRGQSWPVQR AVAADTGSTD DSAEVLARFL PADAVVELPA DTGYGDAVRA ALELPRSTSA VRGFDEDATE WIWLIHDDLT PDRDALAHLL DAADQDPRAA VLGPKLRDWF DRRLLVEAGV TIDGAGRRET GLEQREFDHG QHDGTRQVLA VSSAGMLVRR DVWDRLGGFD PSLPLFRDDI DFCWRVGGAG LRAVLVTDAV AYHAEASARR RRPISATRNH PRRVDRRHAV FVLLANLPLG GMVAALLRNS VASLLRVVGY LLIKQPANAL DEAAAITLVY LRPLRLMRAR FRRRRDRRRT YSAIRPFLAR GVAMRQVHDV LTGILSGEPV RNTPGRHQAV TAPPAAEDED EIPTDTAGFV RSVLLRPGVL LVLALAAVAL VAERSLLLGD LLAGGALPPV AGGAGDLWNL YLSGSPDSGL GAADPVPPYV GLLALLSTLT LGKPWLAVTI VLLGCVPLSG LTAYLLAREV LGFRPARLWT AAAYALLPAA TGAVAQGRLG TALVHVLLPV LGLLLVRLVS MPPKPSRRAA WGVGLVLTVA TAFVPMVWLL SLVTGVLVAV AFGHLGRRIY VSVALGLAVP LVLLMPWTLE LLLHPSLWLL EAGLHRPELS APGPTPQELL MLSPGGPGTP PFWVTAGFAV AALCSLLLLR NRMLAAAGWS LALFGILVAV LTSRVVVEPY YGGPPAPVWP GVALTFAATA VLLSAATAAR SFGDMVRLGG PRRVFALAVG LLALATPVCA AGVWMWEGVR GPVTAHAEPV VPGALTGAGT VEGGGSDGAG TRPRTLVVTA DGEGGVDYLV VRGREPRLGE EHLVPESGIR GAMDRAVAEL TVGQGGDQMY TLADFGIQYV LYPRPRISGP ADVTMVDTLD GTPGLERQSL SRHYALWRLA APTGALRVVS EDGVEAEVLA VRGDADEVSA PVPEGGTGRR LALAEAADSG WRASLDGVEL DPVPTQNGTQ AWALPVEGGD LRVWHTDYVH AAWLLTQGVL LTVVAVLAAP GVRTEEEARL IEATPTPRPR RPERLRRSGR SRASSRRGSR ARPGRSRPGG EDPGVRADGD AGADAPEDGV PPGSASEEDT GTLPAVRGGG RRRGTRGVRR GERRRGR
|
| |