Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_0388 |
Symbol | |
ID | 9244226 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 475627 |
End bp | 478980 |
Gene Length | 3354 bp |
Protein Length | 1117 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | |
Product | glycosyl transferase family 51 |
Protein accession | YP_003678342 |
Protein GI | 297559368 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.62292 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCCGGACG TGATTTGCCA TGGGGGCGGG GAACTGAGCG AGAGCACACA CGGCGAAAGC GGGGCGGACG GACAGCGTCC CGGGAAGGAC ACGCCGGACA CCAGGAACAC CTCTGCCGAA GAGCTGTCCA ACAAGCACAG TCCCGAGAGT GGAGTACCCA ACATCGAGGG GGACCAATCC ACTCATGACA CTCCTGGAGG AGAAGAAAGC GTAGGAGAAC AGCCCGAACC GGAGTTCATC AACCCGGTCA CCTTCAAGAC CAGCGGTACC TTCCTCCGTG ACAAGGTCGC CCAGAACCTC GCGGAACAGG GCTTCGGTTC CGACGACACC GGCGACGACG CCGCGGAGGA CTCCGGCGTT CCCGCCGACG CGCACGGCAC CGATGCGGTC CACACCTCTC CCGCGGCCGG CTCCCCCGCC TCAGCCGGCA GCACGGGCAC GGAGGAGTCC GACACGCGCG AGGACGGGGG TCGGGGGGAC GGACGGTCCT CCCAGGCCGC CCCGCCCTTC GCGGCCGACG ACACGTCCGT CGACCAGGAC GGTACGGCCG GCGCCGACGA GCCCCCATCG CCGGTGTCCG ACCGGACCGA GCCCATCTCG CTCGACTCCG TCCGCGCCGC CGCCTCCGCG GAGGCGGCGG AGACGTCCGA CGACGCCTCC GGCAACGCCG GTGCCGAAGA GGGAGGGACA GACCTCTGGA GCCCGCGCGG GCAGGAGCGC ACGGGAGTTC CCAAGAGCCC CGTGGCCGGA ACCGTGACCG ACGTCCAGGA CTTCGAGGAC GGGTCCGCCG AGGCCGCCAC GTCCGCTGTG CCCGCCGCAC CGGACGCGGT CGCTCCGGAC CGTGACGGAG AGCAGAGGAT CGACGAGTTC TCCGCGGCCG AGACGGCGTC CCTGTGGCCC GAGCCGCGTC GCCTCTCCTC CTACGTGGCC GATTCGCCCT CCCGGGAGCA GGACCGGGGC GCCCGCCCCG GTTTCGCCGC GGCCGCGGCC GGAGCGGCGG GTGCCGCGGG AGCGGCCGCC GCCCACTCCG GGGCACACGG CGACCCCAGC GGTCCCGGTG ACACCACCAG TTCCAGTAAC TCCGGTGGAC CCAGCGGTCC CACCGGCCCT GGCGGCCCCA AGGGTCCGGC TGGGCCTGGC GGTACCGGGC CCGGCGGGCC CAAGGGCCCG GGTGACCCCA AGGCACGGGG TAAGAAGAAG GGCGCCGGTC AGAAGGCCAA GAAGCCCATG TGGTGGAGGA TCCTGCGGGT CTTCCTCATC GTCACCGGCG TGTTCTTCAT CATCGGGTGC GGTGTCTTCG CCTTCTTCTA CAGCACCGTG GAGGTTCCGG ACGCGGCCAA GGCCGACGTC CTGGAGGAGG GTTCCACCTT CTACTTCGCC GACGGCGAGA CCGAGTTCGC CTACCGGGGC ACCCACCGGG AGATCCTGTC CTACGACGAG ATGACCGCGG GCGGCGACCA CGTCGTCGAG GCGGTCATCT CCGCGGAGGA CCGCGGTTTC TGGACCGAGC CCGGGGTGTC CGTGAGCGGC ACCGCCCGCG CGGTCTGGTC CACCGTCACC GGCCAGCAGG TCCAGGGCGG CTCCACCATC ACCCAGCAGA TGGTCCGCAA CTACTACGAG GGCATCTCCC GGGACGTGTC CATCACCCGC AAGGTGCGGG AGATCATCAT CGCCCTCAAG GTCGACCGGA GCGAGTCGAA GGAGTGGGTG ATGGAGCAGT ACCTCAACAC CATCTACTTC GGCCGCAACG CCTACGGAGT CCAGGCCGCC TCGCAGGCGT ACTACCACAA GGACGTCCAG GACCTGGAGC CCGCCGAGGC CGCGTTCCTG GCCGCCGCCA TCCAGCAGCC CACCCCCTTC GGTGAGGCCG ACGTGGAGAC CACCCCCTCC ATGGAGCGGC GCTGGGAGTA CGTCGTCAAC GGCATGGTCA CCACGGAGGC CATCACCCAG GCCGAGGCCG ACGCGATGGA GTTCCCGGCG CCCGAGCCGG AGCGGCCCAG CGAGGGCACC GACCTGAGCG GCTACAAGGG CTACATGCTC CAGGAGGCCA TGAACGAGCT GGAGCGGCTC GGCTACACCG AGGACCAGAT CAACCGCCAG GGCTACAGGA TCGTCACCAC GTTCGACCAG GACATGATGG ACGCCGCCTA CGCCGCGGTG GAGGAGATGG TCCCGCTCGA GAACCGGCCC GAGGGCGTCA ACGTCGGTCT GACCACCGTC GACCCGGCCA CGGGCGAGGT CCTGGCCTTC TACGGCGGCC ACGACTACTG GGAGAACCAG TACGACAGCT CCTTCCTCGG CGCCGCGCAG ACCGGTTCGG CGTTCAAGCC CTACGTGCTC GCCACCGCCC TGGAGCAGGG CTACAGCCTC AACTCCACGG TGGACGGGCG CGGGCCGCGC ACCATCGCGG GCTCGCGCAT CCAGAACGCG GGCAACTCGC CGGGCGGCAT CATGACGCTC ACGCAGGCCA CCCAGGTCTC CAACAACCTC GGTTTCATCG AGCTGGCCCA GGAGGTCGGT TTCGAGAACG TCCGCGACAC CGTCTACGCC GCCGGGTGGC CCGAGGGGAG TGTGCCCGAC AGCCAGCTGG TTCCGGTCAT GCCGCTGGGC GCCTCCAGCG CCCGCACGGT CGACCAGGCC AGCGGGTACG CCACCTTCGC CAACGGCGGC GTCCACGTCG AGACGCACGT GGTGCGGGAG ATCATCGACT CCGAGGGCGA GAACGTGCGC CCGGAGGTGG AGAGCAACAG GGCTCTGGAG GAGGAGACCG CGGCCGACGT CACCCACGCC CTCCAGCAGG TGGTCAACGG CGGTACCGGT ACCGGGGCGC GGCTGGCCAA CCACCCCACC GCGGGCAAGA CCGGTACCAC CGACGGCAGC GTCGCCGCCT GGTTCGTCGG TTACACCCCG CAGCTGTCCA CGGCGGTGGG GATCTACAGC GGCAACAACG AGGGCTTCAG TATCCCCGGA TACGGCTCTC TGTCGGGCGG TACGCTCCCG GCGTCGCTGT GGAACCGGTA CATGAGCACC GCGATGGAGG GTTACGAGCC CGGTTCGTTC CCGAGCCCGG CCTTCAGCGG CACCACCGAG AACTGGGCTC CGGACGTGTC CACCGAGCAG CCCCAGCAGC CGCAGCAGCC GGAGGCGCCC GCGGAGCCGG AGACGCCGGT GGAGCCGGAG GTGCCCACCA CGCCGGAGAC GCCGGTGGAG CCGGAGGTGC CCACCACGCC CGAGTGGCCG GAGATCCCCG GTCCCGGGAC CGATCCGGGT CCCGGTGAGG GCGGCGGCGG AGAGGAAGGC GGTGGCGGTG GCGGCGGTGA GAGCCCGCCG GCCAGACGCG ACGACCTCTG GTGA
|
Protein sequence | MPDVICHGGG ELSESTHGES GADGQRPGKD TPDTRNTSAE ELSNKHSPES GVPNIEGDQS THDTPGGEES VGEQPEPEFI NPVTFKTSGT FLRDKVAQNL AEQGFGSDDT GDDAAEDSGV PADAHGTDAV HTSPAAGSPA SAGSTGTEES DTREDGGRGD GRSSQAAPPF AADDTSVDQD GTAGADEPPS PVSDRTEPIS LDSVRAAASA EAAETSDDAS GNAGAEEGGT DLWSPRGQER TGVPKSPVAG TVTDVQDFED GSAEAATSAV PAAPDAVAPD RDGEQRIDEF SAAETASLWP EPRRLSSYVA DSPSREQDRG ARPGFAAAAA GAAGAAGAAA AHSGAHGDPS GPGDTTSSSN SGGPSGPTGP GGPKGPAGPG GTGPGGPKGP GDPKARGKKK GAGQKAKKPM WWRILRVFLI VTGVFFIIGC GVFAFFYSTV EVPDAAKADV LEEGSTFYFA DGETEFAYRG THREILSYDE MTAGGDHVVE AVISAEDRGF WTEPGVSVSG TARAVWSTVT GQQVQGGSTI TQQMVRNYYE GISRDVSITR KVREIIIALK VDRSESKEWV MEQYLNTIYF GRNAYGVQAA SQAYYHKDVQ DLEPAEAAFL AAAIQQPTPF GEADVETTPS MERRWEYVVN GMVTTEAITQ AEADAMEFPA PEPERPSEGT DLSGYKGYML QEAMNELERL GYTEDQINRQ GYRIVTTFDQ DMMDAAYAAV EEMVPLENRP EGVNVGLTTV DPATGEVLAF YGGHDYWENQ YDSSFLGAAQ TGSAFKPYVL ATALEQGYSL NSTVDGRGPR TIAGSRIQNA GNSPGGIMTL TQATQVSNNL GFIELAQEVG FENVRDTVYA AGWPEGSVPD SQLVPVMPLG ASSARTVDQA SGYATFANGG VHVETHVVRE IIDSEGENVR PEVESNRALE EETAADVTHA LQQVVNGGTG TGARLANHPT AGKTGTTDGS VAAWFVGYTP QLSTAVGIYS GNNEGFSIPG YGSLSGGTLP ASLWNRYMST AMEGYEPGSF PSPAFSGTTE NWAPDVSTEQ PQQPQQPEAP AEPETPVEPE VPTTPETPVE PEVPTTPEWP EIPGPGTDPG PGEGGGGEEG GGGGGGESPP ARRDDLW
|
| |