Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_2522 |
Symbol | |
ID | 9156683 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | + |
Start bp | 2613278 |
End bp | 2615377 |
Gene Length | 2100 bp |
Protein Length | 699 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | |
Product | transketolase |
Protein accession | YP_003647465 |
Protein GI | 296140222 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGACCAGCA CCGCCGAGAT CCAAGCCCTC ACCACCCCGC ATCACCCCGC CGACTGGACC GATCTCGACA CACGTGCGGT GGACACCGCC CGAGTGCTGG CGGCCGACGC GGTGCAGAAG GTCGGCAACG GCCACCCGGG TACCGCCATG AGCCTCGCTC CCCTGGCCTA CACCCTCTTC CAGAAGGTGA TGGTGCACGA CCCGTCGGAC ACGCACTGGA TCGGCCGGGA CCGGTTCGTG CTCTCCTGCG GCCACTCCAG TCTCACGCTG TACCTCCAGC TCTACTTGGG CGGCTTCGGC CTGGAACTGA GCGACATCGA GGCATTGCGC ACCGAAGGGT CCCTGACCCC GGGCCATCCC GAGTACGGGC ACACCAAGGG CGTCGAGATC ACCACCGGTC CGCTCGGCCA GGGCCTGGCC TCCGCCGTCG GCATGGCGAT GGCCTCGCGG TACGAGCGCG GCCTGTTCGA CCCGGAGACC GCGCCGGGAC AGAGTCCGTT CGACCACTTC ATCTACGTGA TCGCCTCCGA CGGCGACATC GAAGAGGGCG TCACCTCCGA GGCCAGTTCC CTCGCCGGTA CCCAGCAGCT GGGCAACCTG ATCCTGTTCT ACGACGACAA CAAGATCTCC ATCGAACACA ACACCGACAT CGCGCTGTCG GAAGACGTCG CGAAGCGCTA CGAGGCGTAC GGCTGGCACG TGCAGTACGT CGAGGGCGGC GAGAACGTCA CCGCGATCGA GGAGGCCACC GCGGCCGCCA AGGCCGTCAC CGACAAGCCC TCGATCATCA TCGTGCGCAC CATCATCGGC TTCCCCGCCC CGAACAAGAT GAACACGGGC GGCGTGCACG GCTCGGCGCT CGGCGCCGAC GAAGTGGCCG CGGTGAAGGA GATCCTCGGC TTCGACCCGG CGAAGAGCTT CGACGTGGAC CCCGAGGTCA TCGCGCACTC CCGTGAGTTG GTCAAGCGCG GCCAGGCCGA GCACGAGGCG TGGAACGCGA AGTTCGACGC CTGGGCCTCC GCCAACCCGG AGCGCAAGGC GCTGCTCGAC CGGCTCGAGG CCGGCACGCT CCCCGAGGGC TGGGACGCCG AACTCCCCAC GTGGGGCGTC GACGACAAGG CCATCGCGAC CCGCGCCGCA TCGGGCGCCT TCCTCGCGGC AGCCGGCCAG ACCCTGCCCG AGCTGTGGGG CGGCTCCGCC GACCTCGCCG GGTCGAACAA CACCACGATC AAGGGCGCCG ACTCCTTCGG CCCGCCGTCG ATCTCGACCG ACGACTGGAA CGCGCAGCCC TACGGCCGCA CCCTGCACTT CGGCATCCGC GAGCACGCCA TGGGCTCGAT CCTCAACGGC ATCGTGTTGC ACGGACCCAC CCGTCCGTAC GGGGGCACGT TCTTGCAGTT CGCCGACTAC ATGCGTCCCG CGGTTCGCCT CGCCGCTCTC ATGGACATCG ATCCGATCTA CGTGTGGACG CACGACTCGG TCGGTCTCGG TGAGGACGGT CCGACGCACC AGCCGGTCGA ACACCTCGCT GCACTGCGCG CCATCCCGGG CTTGAACGTG GTGCGTCCGG CCGATGCGAA CGAGACCGTC GCCGCGTGGA AGGCCACTCT CGAGCGCACC GGCGGAAACG GCCCCACCGG CCTGATCCTC ACCCGCCAGG GCGTGCCGAT CCTGCCCGGC ACCTCCGCCG AGGGCGTCGC CCGCGGCGCT TACGTCCTCA AGGAAGCCGA TGGCGGCACG CCGGACGTGA TCATCATCGG CACCGGGTCC GAGGTGCAGC TGGCCCTCCA GGGCGCGGAG CTGCTCGCCG CGAAGGGCAT CAAGGCGCGC GTCGTCTCGA TGCCCTCGGT GGAGTGGTTC CACGCCCAGG ACCAGGCCTA CCAGGACAGT GTGCTGCCTC CGTCGGTCAA GGCGCGGGTC GCGGTCGAGG CGGGCATTGC ACAGTCGTGG TGGCGCATCG TCGGCGGTTT CGGCGAGGTG CTCTCGCTGG AGCACTTCGG CGAGTCCGCC TCGGACAAGG TCCTGTTCGC TAAGTACGGA TTCACCGGCG AGAACGTGGC CCAGAAGGCC GAGCAGTCCC TCGCCAACCT GAAGGGATAG
|
Protein sequence | MTSTAEIQAL TTPHHPADWT DLDTRAVDTA RVLAADAVQK VGNGHPGTAM SLAPLAYTLF QKVMVHDPSD THWIGRDRFV LSCGHSSLTL YLQLYLGGFG LELSDIEALR TEGSLTPGHP EYGHTKGVEI TTGPLGQGLA SAVGMAMASR YERGLFDPET APGQSPFDHF IYVIASDGDI EEGVTSEASS LAGTQQLGNL ILFYDDNKIS IEHNTDIALS EDVAKRYEAY GWHVQYVEGG ENVTAIEEAT AAAKAVTDKP SIIIVRTIIG FPAPNKMNTG GVHGSALGAD EVAAVKEILG FDPAKSFDVD PEVIAHSREL VKRGQAEHEA WNAKFDAWAS ANPERKALLD RLEAGTLPEG WDAELPTWGV DDKAIATRAA SGAFLAAAGQ TLPELWGGSA DLAGSNNTTI KGADSFGPPS ISTDDWNAQP YGRTLHFGIR EHAMGSILNG IVLHGPTRPY GGTFLQFADY MRPAVRLAAL MDIDPIYVWT HDSVGLGEDG PTHQPVEHLA ALRAIPGLNV VRPADANETV AAWKATLERT GGNGPTGLIL TRQGVPILPG TSAEGVARGA YVLKEADGGT PDVIIIGTGS EVQLALQGAE LLAAKGIKAR VVSMPSVEWF HAQDQAYQDS VLPPSVKARV AVEAGIAQSW WRIVGGFGEV LSLEHFGESA SDKVLFAKYG FTGENVAQKA EQSLANLKG
|
| |