Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_0250 |
Symbol | |
ID | 9244084 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 310786 |
End bp | 313464 |
Gene Length | 2679 bp |
Protein Length | 892 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | |
Product | glycosyl transferase family 51 |
Protein accession | YP_003678205 |
Protein GI | 297559231 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0340145 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACGGAGT CCAACCAGCG CCCGCCGCGT GGGCGGCGGC ACGAGGCCCC TCGCCGGAAC TGGCGCGGGG CCCTCTCCCG GGCGGTACCG GCCGCCGCGG GTCCGCGTGT GCGGCGGTGG GCCGAGGCCC TGCGCCGGAA GGTGTCGGAG CCGACCCCGG CGGGGGACCG CCGGGAGACG GTCCAGCGCC TGGCCGGAAC CGGCGCGGTC GCCGGTCTGC TCACGGCGGC GCTGGTCATG CCCTGGGTCG GCGGCCTCGG CCTGGCGGCC AGGGACTCCG CGGCGGCCTT CATGGCCCTG CCCAGCGACC TGGCCGTGCC GCACCCCGCC GAGCGCGTGC TGCTGACCGA CGTCGACGGG GAACCGATCG CCGAGGTCGC CGAGCGCGAG CGCGACGTGG TGCCGCTGGA CGAGATCAGC CCCTGGGTGC CCGCCGCCCT CATGGCGATC GAGGACGACC GCTTCTACGA GCACGCCGGA CTGGACCTGC GCGGCACGCT GCGCGCCGCC GTCCGCACCG TCCTGGGCAA CACCCAGGGC GGGTCCACCA TCACCCAGCA GTACGTGAAG AACCTCCTCA TGGAACAGGC CGACACCGAG GAGGAGCTGG CGAGCGCCAA CGCGCGCACC CTGACCCGCA AGGTGCTGGA GCTGCGCTAC GCCATCGAGC TGGAGGAGAA GCTCACCAAG GACGAGATCA TGGAGGGCTA CCTCAACCTC GCCTACTTCG GCCAGAACGC GTACGGCATC GAGGTCGCCG CCGAGCGCTA CTTCTCCGTC CCGGCCTCCG AGCTCGACCC CGCGCAGGCC GCCACGATCG TGGCGCTGGT GCGCGCGCCC TCGTACTACG ACCCGCTCAC CAACCCCGAG GCCTCCGTCG AGCGCCGCAA CCTGGTGCTG GACCGGATGG TCGCCACCGG ACACCTGGAG AGCGCGCAGG CGCAGGAGTA CAAGAGCCGG GGCCTGGAGG TGGACGAGAC CCCGCGCGCG GGCAGCTGCT TCAGCAGCGA GCAGCCCTTC TTCTGCGACT ACGTCATGCG GTGGCTGGGC GGCTCCGACG CGCTCGCCGG GACCCAGGAG GAGCGCGACC GGATACTGGA GCGGGGCGGC ATCACCGTGC GCACCACCCT GGACCTGGAC ATGCAGGAGG CCGCCGAGCA GGCGATCGAG CGCTACGTCC CCGCGGGCGA CTCCCACAAG TTCGCCGCCG AGGTCCTCGT GGAGCCCGGT ACCGGCCGGG TGCGGGTGAT GGCCCAGAAC ATGCGCTACG GCTTCGACGA CGAGCCGGGC ACCACCTCGA TCAACCTGTC CGTGGACCAC GAGGACGGCG GGTCGCTGGG CTACCAGGCG GGTTCGACGT TCAAGCCGTT CACCCTGGCC GCCGCCCTGG ACGCCGGGCT CAAGTACGAC ACCAGCTTCT CCTCGCCCGA GTCCACGACG GTGAGCGGCC TGGAGAACTG CGAGGGCGGC AGGATGGCGC CCTGGGACGT GCGCAACGCC GGGGAGAGCG ACGGCGGCAG GCACAACATG ATCAGCGGGA CGAAGGGCTC GGTGAACACC TACTTCGCCC AGCTCCAGGA GCGCGTCGGC CTGTGCGAGA CGGCGGAGAT GGCCCAGAGC CTGGGCATCC ACCGCGCGGA CGGCGAGGAC CTGCAGGTGT GGAGCTCCTT CACCCTGGGC GACCAGGAGG TCTCCCCGCT CACCATGGCC AGCGCCTACG CCGTCTTCGC CTCCCGGGGC ACCTACTGCG AGCCGGTCCC CGTGGCCTCG GTCCTCTTCG AGGGCGAGGA CGGCGAGGAG GTCGAGATGG GCACCGAGTG CGAGGAGGGC GCGCTGGACA CCGAGGTCGC CGACGGCGTC AACCACCTGC TCCAGCAGAC CTTCGAGGGC GGTACCGCCA ACGGCCTGGA GATCGGACGC CCTGTGGCGG GCAAGACCGG CACCACCGAC AGCGCGGCCT ACGCGTGGTT CGCGGGCTAC ACCCCGAACC TGGCCGGGAC CGTGGTGGTC GGCGACATCC GGGGCGGGGA GCAGCACACG CTCCAGGGCG TGACCATCGG CGACCGCTAC TACGGCATCG TCTACGGGGC CACGCTGCCC GGTCCGATCT GGCAGGCCAC CATGCGCGAG GCCGTGGCGG ACCTGCCGGA GGAGGAGTTC GCCCCCTCGC CGAAGGTCTA CGGCAAGGCC TCGGACAAGC CATCGGGCGG CGGTGGCGAC AACGGTGACA GTGACGACAG TGACGCCGGT GGAGGCGATG GAGGCACGGC TGGCGGTGAC GGCGGCGTCG CGGCCGGTGG TGGTGGCGGA GGCACTGGCG GTGGCGGCGG TGCCGGGGGT GACGATGGCT CGACCGGCGG CGGTGGCACC GGTGACGGTG GTGGGGGAGC CACCGGCGGT GGTGGCGGGG GTTCGACCGG GGGTGGTGGC GGTACCGGAG GCGGCGGAGG ATCGACCGAC GGCGGTGGAG GCACCGGTGA CGGCGACGGC GGGGGTTCAA CCGGTGGAGG CGGCGGTACC GGTGGCGGCG GCGGTGGTGG TGGAGACGGC AGCGGTACCG GCGGAGGCGG CGGCGGTACC GGTGGCGGCG AGTCCCCGGG CGGGGACGGC GGCGCGCCCT GGGGCGGCGC GACCCCGGGA CCGCAGGCGC CCGAGGGCGG CGCCAGTCCC GGGGGCTGA
|
Protein sequence | MTESNQRPPR GRRHEAPRRN WRGALSRAVP AAAGPRVRRW AEALRRKVSE PTPAGDRRET VQRLAGTGAV AGLLTAALVM PWVGGLGLAA RDSAAAFMAL PSDLAVPHPA ERVLLTDVDG EPIAEVAERE RDVVPLDEIS PWVPAALMAI EDDRFYEHAG LDLRGTLRAA VRTVLGNTQG GSTITQQYVK NLLMEQADTE EELASANART LTRKVLELRY AIELEEKLTK DEIMEGYLNL AYFGQNAYGI EVAAERYFSV PASELDPAQA ATIVALVRAP SYYDPLTNPE ASVERRNLVL DRMVATGHLE SAQAQEYKSR GLEVDETPRA GSCFSSEQPF FCDYVMRWLG GSDALAGTQE ERDRILERGG ITVRTTLDLD MQEAAEQAIE RYVPAGDSHK FAAEVLVEPG TGRVRVMAQN MRYGFDDEPG TTSINLSVDH EDGGSLGYQA GSTFKPFTLA AALDAGLKYD TSFSSPESTT VSGLENCEGG RMAPWDVRNA GESDGGRHNM ISGTKGSVNT YFAQLQERVG LCETAEMAQS LGIHRADGED LQVWSSFTLG DQEVSPLTMA SAYAVFASRG TYCEPVPVAS VLFEGEDGEE VEMGTECEEG ALDTEVADGV NHLLQQTFEG GTANGLEIGR PVAGKTGTTD SAAYAWFAGY TPNLAGTVVV GDIRGGEQHT LQGVTIGDRY YGIVYGATLP GPIWQATMRE AVADLPEEEF APSPKVYGKA SDKPSGGGGD NGDSDDSDAG GGDGGTAGGD GGVAAGGGGG GTGGGGGAGG DDGSTGGGGT GDGGGGATGG GGGGSTGGGG GTGGGGGSTD GGGGTGDGDG GGSTGGGGGT GGGGGGGGDG SGTGGGGGGT GGGESPGGDG GAPWGGATPG PQAPEGGASP GG
|
| |