Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_0060 |
Symbol | |
ID | 9154194 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | - |
Start bp | 60674 |
End bp | 62644 |
Gene Length | 1971 bp |
Protein Length | 656 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | |
Product | glycosyl transferase group 1 |
Protein accession | YP_003645053 |
Protein GI | 296137810 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.19162 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGACCCGCG ACGCCGAACG CGCCTACCTC CCGGGTGGCC TCCAGTCCGG CCCACCGCCG GCCGCCCCGA CGCCCCGCAC CCCGGCGGTG TTGATCGTGG CCTACCGCAA CCCCGCCGAT CTGCGGGCAT GCCTGCGGTC CGTCGGCGAG CACCTGCCGG ACCTTCCGGT GCTGGTGTGG GACAACTCCG TTCCCGCCGA ACCCGAGATG GCGAGTCTCG CCGCGGAGTT CCCGGACGTC CGCTGGTACT CGACCGGCGA GAACCTGGGA TTCGCGGCGG GCGTCAACCG GTTGGCCGAA GCGGCACCCG ATCACGATCT CCTGCTTCTC AATCCGGACG CCGTGCTCCG CGACGACCTC GCCCGGACCC GGGCCGCGCT CGGCACGGCC GGCGTGGCGG CCGCCGCGCC CGGTGTCCGC GACCCGGACG ACCACGACGG TCTGCGCCGG CCGTGGGACG TGGCGCACCG GCCACGCGGC CTGGTTCGGA GCCTGGTCTC CCATGCCGGT TACGCCGAGC GACTGCGCGG CACACCGCTC TCGGATCTGT ACTCGACACG ACCCGATTCG GTATCCGGCT ATCTGACCGG TGCGTGCCTG GCGATCTCCC GCGACGCCTG GAACGCGCTC GGACCGTTCG ACGAGGAGTA CTTCCTCTAC GGCGAGGAGT CCGACTGGCA GACGCGGGCC ACCGATGCAG GCTGGCGCCT GGTACTGACC GAGGATCCGG GAGTCGATCA CACCGGCCAC GGCACCGTCC GCAACGACCG TTCGGCAAGC AGGCGCTCCG CAGACCTGTT GCGCGCCAAT GTCGCTCTGA ACCTCGAACA CGCAGCCGGG CCGCGCGTGG CCGGCGCCTT CCTGGTGGGG CATTCCGCGC TCGACCGGGT GCAACGATCG AAACGGCGCC CACGCACGCG GGCGACGACA CCGTCGATCG TGATCACCAC CAACCGCCTG GTGTTCGGCG GTGCCGAGCG TCAGCACGTG GTCCTCGCCT CCGAACTGGC CCGCCGCGGG CACGACGTGA CGATCGCGTG TATGCAGCGA TTCGGGCCAC TGGTGCGCGA GATACCGCAC GGAGTGCGGG TGGTGCGCCA ACCGTGGTGG GCACCCGCGC TCGAACTGCC CGCGGGGCCA CAGATCCTGA TCAGCGGCGA CACCAACACG GAGACCGGTT TCGCGACCCT GTGGCGAGCC GCCGGACGGA ACCGGAAGTG GTTGGTGGCC GCGCATATTC CACCCGCGCT CGACGGACCC ACCTACTCCA CCGGACTGGC GGCCGCGATG CGCCGGGCGG ATGGTTTCAT CGCCCTCTCC GATCGCCATT GGACCGAGGC GACCGCCCAT CAGGACTTGG GGTCGCGCCA CTTCACCGCA CCTAACGGTG TGGCCTCGGC GGCCTCGCTC GGCGGCGTCC CACCACGCCC CGCGGTCGGC GATCCGCCGC GCCTGGCGAT GTTGTCCCGG ATCGTCGAAC ACAAGAACCC GCACCTTCTG GTGGAGGCGC TCGCCGGCCT GCGCGAGATG CCCTGGGAGT TGTCGATCTA CGGCGACGGT CCCGACCGCG CGCGCCTGGA AGAACTCACC CCGGACGATC TGCGTGACCG GGTGCACTGG CGCGGCTGGT CGCCCGGCCC CGATCACGCG CTCGCCGAGG CGGATCTGTT GTGCGTCCCG AGCCGGTCCG AGGCCTTCCC GCTGGTGATC CTCGAAGCGA TGGCGCGCAA GGTGCCGGTG GTCGCATCGT CGGTGTGCGC AGTGCCGGAC ATGCTCGACC ACGGCCGGGC CGGGGCGCTG GTCGACGACG TCTCGGTTAC CGGGTGGCGC GATGCGCTCG CGGCCCTGCT CGAACGCCCT GAGGAATGGG CGGCACTCGG GGAGAACGGA TTCGACCGGA TGCGAGAGCG GTACACGATC GAGGCGATGG CCGACGCGTA CGAGGCCGCT TTCGCCGGGG TCGCACCGTG A
|
Protein sequence | MTRDAERAYL PGGLQSGPPP AAPTPRTPAV LIVAYRNPAD LRACLRSVGE HLPDLPVLVW DNSVPAEPEM ASLAAEFPDV RWYSTGENLG FAAGVNRLAE AAPDHDLLLL NPDAVLRDDL ARTRAALGTA GVAAAAPGVR DPDDHDGLRR PWDVAHRPRG LVRSLVSHAG YAERLRGTPL SDLYSTRPDS VSGYLTGACL AISRDAWNAL GPFDEEYFLY GEESDWQTRA TDAGWRLVLT EDPGVDHTGH GTVRNDRSAS RRSADLLRAN VALNLEHAAG PRVAGAFLVG HSALDRVQRS KRRPRTRATT PSIVITTNRL VFGGAERQHV VLASELARRG HDVTIACMQR FGPLVREIPH GVRVVRQPWW APALELPAGP QILISGDTNT ETGFATLWRA AGRNRKWLVA AHIPPALDGP TYSTGLAAAM RRADGFIALS DRHWTEATAH QDLGSRHFTA PNGVASAASL GGVPPRPAVG DPPRLAMLSR IVEHKNPHLL VEALAGLREM PWELSIYGDG PDRARLEELT PDDLRDRVHW RGWSPGPDHA LAEADLLCVP SRSEAFPLVI LEAMARKVPV VASSVCAVPD MLDHGRAGAL VDDVSVTGWR DALAALLERP EEWAALGENG FDRMRERYTI EAMADAYEAA FAGVAP
|
| |