Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tcur_1268 |
Symbol | |
ID | 8602580 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermomonospora curvata DSM 43183 |
Kingdom | Bacteria |
Replicon accession | NC_013510 |
Strand | - |
Start bp | 1440147 |
End bp | 1444067 |
Gene Length | 3921 bp |
Protein Length | 1306 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | |
Product | glycosyl transferase family 51 |
Protein accession | YP_003298889 |
Protein GI | 269125519 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.796636 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGCTCGG ATGGCACGAC GCCCATCGAG CCGGACGACG GCGGTGCAGC GGCCTCGCCG GTCACCGGCA CCCCACCCCC CTCCGCCGGC CTTCCGGCCG CCCCCCGTCC CGGCGGATCT TCCGGCACCC CTCCGGGAAC CGACGCCGCC CCGGGCGCGT CCTCCGCCCC CGCCGCGGAA GCCGCCCCCG GCGACGCCAC GGAAATCGAC GCCGACCCCG ACGACTTCTC TGACGACACC ATGGAGACCC ACGCCGACCG CGGCACCCCC TCCGACCAGA CCGCGGCGAC CCGCAGTGAA CCCGACGAGC CCGCCGGCGG CATCGCCCCC GAAGCCGACG CCGGCCCCGA CGGCTCTTCC GACGCCGCGG ACAACGAATC CGGCTCCGGC ACAGCGGCCA ACGCCGAACC CACCGGCACC ACCACCGCAG CCGAAGCCGG TTCCGGCGGC TCTTCCAACG CCGCGGACAA CGAACCCGGC ACCGGCACGG AGACCAACGC CGAACCCACC GGCACCACCG CCACAGCCGA GACCGGTTCC GGCGGCTCTT CCGGTGAGGC CGCAGGCGAC GAACCCTGCA CCGGCACACC CACCGGGGAC GGCACGGAGG CTCACCCCGA GCACACCGCC CCCGACCCGG ACAAGCCCTC CGAGGCGGCC GAGGAAGAGC ACCCGGTCTC GAGCACGCCC ACCGAGCACG ACGCCGAATC CGGCGACCTC CCCGGCGACG CCGCGGACAA CGAACCCGGC ACCGGCACAG CGGCCAATGC CGAACCCACC GGCACCACCA CCGCAGCCGA GGGCGGTTCC GGCGGCTCTT CCGGCGAGGT GGCAAGCGAC GACCCCGGCT CCGGCACACC CACCGGGGAC GGCACAGAGG CTCACCCCGA GCACACCGCC CCCGACCCGG ACAAGCCCTC CGGCGCGGCC GAGGAAGAGC ACCCGGCCTC GAGCACGCCC ACCGAGCACG ACGCCGAATC CGGCGACGCC GCAAGCGGCT CCCGCACGGC GGCCGATGTC TTGGACGGCC GCGACGGGAC CGTTCCGAGC GCCGTCGCGG ACACCGGCGA TGCCGTGGAA AGCCGCGCCG TCTCCGCCGG GGCCGCCGGC GCCGCCGCGG AACCCGAGGC AGCCGCAGCA GACCCCGCCG GGGGCGGGCA CGGGCGCACC GATGCGTCCG CCGGAGACGG CGCGGAAGGC CGCTCCGTCT CCGGCGGGGA CGCCTCGCCG GGCGCCGCCG CGGAGAGCGA GGCCGACCCC GGCGGCGACT CCACCCGGAC CCGGCTCGAC CTGCCGCAGC TGCCCGACGA CGCCGACAGC ACGCGGGTGG ACCTGCCGAA GCTCTCCGAT GAGAGCGAGC AGACGCGGGT GGACCTGCCG GCCCTCTCCG ACGCCCCGGC CGCGTCCGCT TCCCGGAAGG AGGGGGAAGC TGCGGCCGTC GGTGACGCCA TGGCCGAGGC GGGACGGACG AGCACTTCTG AGACTGCAGG GGCCGATGCG GCCCTAGTGC AATCCTCCGG CGCAGCGGGT ACTGCGATGG TGGCGGGAGC CGCCGCAGGC GGAGGCGGCA CGGCGCAAGC CCCGGCGGGG GAGACCGGGG CGGCGTCCGC GGCGGCCTCC CTGGGAAGCG TCGCGGTGCC GGACAAGGGG GAGGACGCGG CTGTGGGGGC GGCTGCGGAG CGGGAGAAGG CCGGCCGGAG CGGCAGACGG GGCGGCCGCC GGCGCAAGCA GCGCAGCCGG CTGGTGCGCT ACACGCGCCG GGCCGCCATC GCGATGCTGC TCTTCCTGGG CCTGGCCATC GCCGCGTTCG CCGTGGCCTA CATGCTCACG CCCGTCCCCT CCCCGCAGGA GGACGCCACC GCGCAGGGCC CGGAGTTCTA CTACGACGAC GGCAAGACCC TCATCGCCAA GATCGGCGTG AACCGCCACA AGGTCGACCT GGAGGTCGTC CCCGAGCACG TGCGCAACGC GGTGATCGCG GCGGAGAACC GCAGCTTCTA TGAGGACCCG GGGGTGTCGA TCCGCGGCAC GATCCGGGCG TTCTGGTCGA CGGTCTCCGG TGAGCAGCTG CAGGGCGGCT CGACCATCAC CCAGCAGATG GTGCGCAACT ACTACCAGGG GCTGGGGCAG GAACGCACGA TCAACCGCAA GTTCAAGGAG ATCATGGTCT CCTTGAAGGT CGGCAGCCAG CGGGACAAGG ACTGGATCCT CGAGCAGTAC CTGAACACCA TCTACTTCGG CCGCGACGCC TACGGCATCC AGGCCGCCGC CCAGGCCTAC TACGACAAGG ACGTCGGGCA GCTCACCAAG GCGGAGGCCG CCTACCTGGC CGCGGCGATC CAGCAGCCCA CCCCGTTCGG CAACCCCAGC GGCGACAACC GCGCCATGGC CGAGAACCGG TGGCGCTCGG TGGTGAACGC GATGGTGGAG ACCGGGGCGC TCACCCCCGC CGAGGCGGCG GCGATGAAGT TCCCCATGCC GGTCGAGCAG AAGGTCACCG ACATCCTCAA AGGCCAGATC GGCTACATGG TCCGCATCGC GCAAAAGGAG CTGCGCGAGC GCCGCGGCTA CACCGAGGAC CAGATCAACC GCGGCGGGCT GAAGATCGTC ACGACCTTCG ACAAGGACCT GATGCGGGCC GCCGAGAAGG CGGTGAAGGC CAACCTGCCC CCCAACGTCG GGGAGCGGAC CCTCACCGGG CTGGTGTCGG TGGATCCGGC CACCGGGCAG GTCGTCGCCT TCTACGGGGG GCGCGGCTAC CTGGAGGAGC AGCTGAGCAC CGCCTTCGGC CACTGGGCGC AGGCCGGATC GGGGTTCAAG CCGATCGTGC TGGCCACCGC GCTGGCCAAC GGCAAGACGC TGGGCAGCGT GGTCAACGGC AGCTCCCCGC AGTACTACAA CGGCACCCCG GTGCGCAACA GCGGCGGCGC CAGCTACGGG ATGATCAACC TGGTCACCGC CACCCAGAAC TCGGTGAACA CCGGCTATGT CAACCTCGGC CTGGAGGTCG GGCTGGACAA GGTCACCGAG ATGGCCGAGA AGATGGGCAT CCCCCGCGAG CAGCTGACCG CCAACGGCGC CAACAAGGCC CCGACCTTCT CGCTGGGCGT GGTCTCGGTG CACCCGGTGC AGCAGGCGGG GGTGTACGCC ACGTTCGCCG CCGAGGGGAT CTTCCGCACC CCGTATGTGG TCAAGTCCGT CACCGAGCTG GACGGCGACA AGCACGTCTA CACCGAAAAG GGCAGGCGGG TCTTCAGCCC GCAGGTCGCC CGGGACGCCA CCTACGCGAT GACCAAGGTG GTCGAAAGCG GAACGGGCAC CAACGCCCGG CTGTACGACG GGCGGGACGT GGCCGGCAAG ACGGGCACCA CCGACAACGG CAACGCCCTG TGGTTCAACG GGTTCATCCC CCAGCTGGCC ACCTCGGTGG CGATCTTCCG CGCCGACAGC CCCACCAAGC AGGTGGAGAT CGGCGGCTAC AGCGCGTTCG GCGGTGTGCT GCCCGCTCAG ATCTGGCGGA CGTACATGAC CGATGTGATC GCCATCAAGG GCCTGGAGCC CAAGAGCTTC GGCCCGCCCT CCAGCTACGT CTCCGGGGGC GGCGGGTACA GCCGGTCGAC CGGGCAGCCG AGCGCCCCGG CCCCGCAGGG GCCCGACAGC CCGCGGCCCA GGCCGACCAG GCCCGCGCCG CAGCCCTCCG TCCCCCGTCC GCCCGAGCCG GGCGGCCCGG AGGACCCCGG TCCCGAGCAG CCGAACCCGC CGCAGCCGCC GAACCCGCCC GACGGCGGCG GTGACGGCGG TGGCGGCGGC GACGGCGGTG GTGGCGGCGG TGAGGGCCGC CGGCAGAACC CCGCGCAGCA GCAGGGCATG GCGCCGCAGC AGGGCGAGGC GCCCCACGGC CCGGCCGGCC GCCGCGAGTG A
|
Protein sequence | MGSDGTTPIE PDDGGAAASP VTGTPPPSAG LPAAPRPGGS SGTPPGTDAA PGASSAPAAE AAPGDATEID ADPDDFSDDT METHADRGTP SDQTAATRSE PDEPAGGIAP EADAGPDGSS DAADNESGSG TAANAEPTGT TTAAEAGSGG SSNAADNEPG TGTETNAEPT GTTATAETGS GGSSGEAAGD EPCTGTPTGD GTEAHPEHTA PDPDKPSEAA EEEHPVSSTP TEHDAESGDL PGDAADNEPG TGTAANAEPT GTTTAAEGGS GGSSGEVASD DPGSGTPTGD GTEAHPEHTA PDPDKPSGAA EEEHPASSTP TEHDAESGDA ASGSRTAADV LDGRDGTVPS AVADTGDAVE SRAVSAGAAG AAAEPEAAAA DPAGGGHGRT DASAGDGAEG RSVSGGDASP GAAAESEADP GGDSTRTRLD LPQLPDDADS TRVDLPKLSD ESEQTRVDLP ALSDAPAASA SRKEGEAAAV GDAMAEAGRT STSETAGADA ALVQSSGAAG TAMVAGAAAG GGGTAQAPAG ETGAASAAAS LGSVAVPDKG EDAAVGAAAE REKAGRSGRR GGRRRKQRSR LVRYTRRAAI AMLLFLGLAI AAFAVAYMLT PVPSPQEDAT AQGPEFYYDD GKTLIAKIGV NRHKVDLEVV PEHVRNAVIA AENRSFYEDP GVSIRGTIRA FWSTVSGEQL QGGSTITQQM VRNYYQGLGQ ERTINRKFKE IMVSLKVGSQ RDKDWILEQY LNTIYFGRDA YGIQAAAQAY YDKDVGQLTK AEAAYLAAAI QQPTPFGNPS GDNRAMAENR WRSVVNAMVE TGALTPAEAA AMKFPMPVEQ KVTDILKGQI GYMVRIAQKE LRERRGYTED QINRGGLKIV TTFDKDLMRA AEKAVKANLP PNVGERTLTG LVSVDPATGQ VVAFYGGRGY LEEQLSTAFG HWAQAGSGFK PIVLATALAN GKTLGSVVNG SSPQYYNGTP VRNSGGASYG MINLVTATQN SVNTGYVNLG LEVGLDKVTE MAEKMGIPRE QLTANGANKA PTFSLGVVSV HPVQQAGVYA TFAAEGIFRT PYVVKSVTEL DGDKHVYTEK GRRVFSPQVA RDATYAMTKV VESGTGTNAR LYDGRDVAGK TGTTDNGNAL WFNGFIPQLA TSVAIFRADS PTKQVEIGGY SAFGGVLPAQ IWRTYMTDVI AIKGLEPKSF GPPSSYVSGG GGYSRSTGQP SAPAPQGPDS PRPRPTRPAP QPSVPRPPEP GGPEDPGPEQ PNPPQPPNPP DGGGDGGGGG DGGGGGGEGR RQNPAQQQGM APQQGEAPHG PAGRRE
|
| |