Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_4845 |
Symbol | |
ID | 9248731 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 5738917 |
End bp | 5741853 |
Gene Length | 2937 bp |
Protein Length | 978 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | |
Product | glycosyl transferase family 51 |
Protein accession | YP_003682734 |
Protein GI | 297563760 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.398226 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGTACCG AACGACGACG GAACGCCGAT GGCGGCCGTC GCCGGGCCGA CGGCCCGGCC GACGGCTCGC GCGGCGGCGG CGGACGACGC CGGGCCGACG GCCCGCCCCG GGACGAACCC TCCGGAGGCC GCCGCCGCCC GGACGCCGGT GACGGCTTCT GGGGGGACGA CACCGGGCCC GAGCGCCCGC GCCGCCGACG CCCCCCGGAG CAGGATGCCC GTTCCGAGGG CCGCCGCCGC TCCGAGGGCG ACCGCCCCCG CCGCCCGGAG GGACAGCGCC GCGCCCCCGC CGACGGTGAC CGCCCCCGCC GCCGCCCCGA GGGCGAGCGC CCCCGGCGGC CCCAGGACGG CGAGCGCTCC GGCCACCGTT CCGAGGGACA GCGCCGCCGC CCGGACGAGG ACCCCCGCGG CCGCCGCAGG GCGGCCGGTT CCGGACCGCG CGACCCCCGG GACCGCGACC CCCGCGACCC CCGCGACCGC GGCCGTGACC GTGAGAACGC CGCCCGCTCC GAGGGGCGGC GAGCCCCGGG CCGCGGCTCC CGCGCCGCCG CCGGCGGCGG CAGAGGAGGC CGGGGAGGCG GCCGGCGCCG CTCCCGCGAC GAGGAGCCGG ACGACCGCCC CTGGTTCAAG CGCTTCCTGT CCAAGGCGTG GAAACCCGCC CTGGCCGTCT TCGGCCTGAT GATCATCGGC GGCGTGGCCG CCTTCGCGAT CCTGTACGCC ATGGCCCCCA ACGTGAACGA GCTCGAGGCC AAGTCGGACG CCAACCTGTC CGCCACCCAG ATCATGTGGG CTCCGGAGGG CGACGCCGAG CCCGAGGTCG CGGTCACCAC GGGCGAGGTC AGGCGGGTCG AGGTCCAGAG CGAGGACATC CCGCAGACGG TCATCAACGG TGTGCTCGCC GCCGAGCAGC GCACCTTCTA CACCGACCCC GGCATCAGCA TCTCCGGGAT CGGGCGCGCC CTCCTCTCCG GCGGCGAGGC GGGCGGCGGT TCCACCATCA CCCAGCAGAT GGCGCGGAAC TACTACAGCG ACCTGGAGAT CGAGAACCAG TACATCCGCA AGATCCGCGA GATCTTCATC TCCATCAAGC TCAACCAGCA GATGGAGAAG GAGGAGATCC TCTCCACCTA CCTCAACACC ATCTACTTCG GCCGCAACGC CATGGGCATC CAGATGGCCG CCCAGTCCTA CTTCGGCAAG GACCTGGACG AGCTGACCGA CGCCGAGGGC GCCTTCCTGG GCATCATCAT CCAGATGCCC AGCAACTTCC AGGGCGAGAT GGGCGACTGG ACCAGGACCT ACCTCAACGA GGAGCGCTGG CCCTACGTGC AGAACCAGCT GGCGCTCATG CACGAGGAGG ACCCCGACCT CGGCCTGCCC AGGGCCGAGG CCGAGGCCCT GGAGATCCCC GAGCTCGTCC CGTACGGCAC CGAGGAGGGC GAGGAGGAGC AGGAGTACGA CCCCAAGTTC GACTACGTCC GCCAGGCGGT CGTCCAGGAG ATCGAGGAGC GCTACTCCGG GATCACCGCC CAGGACATCG CCACCCAGGG CTTCGTCGTG CAGACCTCGC TGGACCCCGT GCTCATGGAC GCCGCGCGCA GCTCCTTCGA CGTCCTGCCG AACACCCCCG AGGACACGAT GATGGGCCTG ACCGCGGTCG ACCCCGCCAC GGGGCAGATC GTCGCCTTCC ACGGGGGCGA CAACGTGGTC GAGGACGTCA ACAACTCGCT GACCCACCGG ACGCAGGCGG GCTCGTCCTA CAAGCCCTAC GTGCTCGCCA CCGCGCTGGA GAACGGCATC AGCCTCAACA GCACGTTCGA CGGCGACTCC CCCCAGGAGT TCCCGGGGCT GCAGAGCGAG GTCATCAACG CGAGCGGCAG GAGCTGGGGT CCGGTCAACC TGATCCAGTC CACCGCCCAG TCGATCAACA CCCCGTTCGT GGAGCTGGCG GTGCAGATGA CCCCGGCCGC GGTGGACGAG CTCGCGGTCC AGGCCGGTGT GGCCGAGGAG CAGACCACGA CCTCCGCCCA GGGCCCGCTC ATCGCGCTGG GCACCCACCA GGTGAGCGCG CTGGACCAGG CCGAGGGCTA CTCCACCTTC GCCGCCGGGG GCGTGCACCG TCCGGCGCAC ATGGTCACCG AGCTGCGCAC CTCCGACGGC CAGGTCATCC CGCCCGACGA CGCCGACCTG CTGGAGAACG GTGAGCAGGT GGTCACCGCG GGCGTCGCGG CCGACGTCAC CTACGCCATG ACCCAGGTCG TCGAGCCCGG CGGCGGTGGC GACCAGGCGG CGCTGCCCGA CGGGCGCCCC GTCGCGGGCA AGACCGGTAC CTCCAGCGAC GCGGTCTCCG CGTGGTTCGT GGGATACACG CCGCAGCTCG TCGCCGCGGT GGGCCTGAGC CGCGCGGACG CCAACCAGGC CCTGGAGTTC GACGGCCAGA CCGCGGGGGA GATCTTCGGT GGCACCACCT CGGCCAACGT GTGGCGCGAG TTCATGACGA CCGCCATGGA GGGCGTCGAA CCGGCGCAGT TCCCGCCGCC CGCCTACGTC GGTACGGAGC AGAGCTTCGT GCCCACGCCC TCCCCGAGCG CCGAGGAGAG CGACGAGCCG AGCGACGAGC CCTCGGTCGA GGAGTCCCCG AGCGAGAGCC CGGCGGAGTG CGACCCGTCC ATGCCGGACT CCCCGGAGTG CCAGGACCAG CAGACCGATG AGCCCTGTGA GCCCGGGTGG GGCCGGGACT GCCCGGAGAG CCCCGGCTCG GGGGACCAGG AGGAGTGCGG TGGCTGGGGC CAGCCCGCCT GCGAGGACAC CGATCCCAGC GGTGAGGCCT CCGACCCCGA GCAGGACGGC GGGGAGCAGG GCGGCGGGAT CTTCGGGAGG ACGACCAACA CCGGCGGCGA GCCCGGAAGG TTGGTGATCC TCGGCCGGAA CGACTGA
|
Protein sequence | MSTERRRNAD GGRRRADGPA DGSRGGGGRR RADGPPRDEP SGGRRRPDAG DGFWGDDTGP ERPRRRRPPE QDARSEGRRR SEGDRPRRPE GQRRAPADGD RPRRRPEGER PRRPQDGERS GHRSEGQRRR PDEDPRGRRR AAGSGPRDPR DRDPRDPRDR GRDRENAARS EGRRAPGRGS RAAAGGGRGG RGGGRRRSRD EEPDDRPWFK RFLSKAWKPA LAVFGLMIIG GVAAFAILYA MAPNVNELEA KSDANLSATQ IMWAPEGDAE PEVAVTTGEV RRVEVQSEDI PQTVINGVLA AEQRTFYTDP GISISGIGRA LLSGGEAGGG STITQQMARN YYSDLEIENQ YIRKIREIFI SIKLNQQMEK EEILSTYLNT IYFGRNAMGI QMAAQSYFGK DLDELTDAEG AFLGIIIQMP SNFQGEMGDW TRTYLNEERW PYVQNQLALM HEEDPDLGLP RAEAEALEIP ELVPYGTEEG EEEQEYDPKF DYVRQAVVQE IEERYSGITA QDIATQGFVV QTSLDPVLMD AARSSFDVLP NTPEDTMMGL TAVDPATGQI VAFHGGDNVV EDVNNSLTHR TQAGSSYKPY VLATALENGI SLNSTFDGDS PQEFPGLQSE VINASGRSWG PVNLIQSTAQ SINTPFVELA VQMTPAAVDE LAVQAGVAEE QTTTSAQGPL IALGTHQVSA LDQAEGYSTF AAGGVHRPAH MVTELRTSDG QVIPPDDADL LENGEQVVTA GVAADVTYAM TQVVEPGGGG DQAALPDGRP VAGKTGTSSD AVSAWFVGYT PQLVAAVGLS RADANQALEF DGQTAGEIFG GTTSANVWRE FMTTAMEGVE PAQFPPPAYV GTEQSFVPTP SPSAEESDEP SDEPSVEESP SESPAECDPS MPDSPECQDQ QTDEPCEPGW GRDCPESPGS GDQEECGGWG QPACEDTDPS GEASDPEQDG GEQGGGIFGR TTNTGGEPGR LVILGRND
|
| |