Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_2136 |
Symbol | |
ID | 9156292 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | - |
Start bp | 2226228 |
End bp | 2229239 |
Gene Length | 3012 bp |
Protein Length | 1003 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | |
Product | glycoside hydrolase family 38 |
Protein accession | YP_003647086 |
Protein GI | 296139843 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCACGACG AGACCTCACT GACCATCGGC CGCGTCAATC GGGTACTCGG TGAGCGCATC CGGCCGGCGA TCCACTCCGC CCGGATCCCG CTCGACGTCG CCGCCAACCA TCTCCCCGGA GAGCCGATCC CACCCGCCGC GGGCCTCCGG CTCGACTACT CCCCGTTCGC CGATGACCGG CGCTGGGGTC CGGCGTGGGG CACCACCTGG TTCCGGCTCC GCGGCGATGT TCCCGCGGAG TGGGCGGGCA AGCAGGTCGA GGCACTGATC GACCTCGGTT TCGACGTGAA CCAGACCGGA TTCCAGTGCG AGGGACTCGC CTACCGGTGC GGGCTCGACG GTGCGCCGGT GCCCGTGAAG GGCATCAACC CGCGAAACCA GGCGGTGGCG GTGGCCGACC GGGCCACCGG CGGCGAGCAG GTGGAGCTGT TCGTCGAGGC GGCGGCGAAC CCCGTGATCC TCGATTACCA CCCGTTTCTG CCCACCCGGC AGGGAGACGT GCTCACCTCC TCCCCGGAGC CCCTGTACCG GGTCCGCGCC ATGGATCTGG CCGTGTTCGA GCCCGAGGTG TTCGCGCTCG CTCTCGACGT CGAGGTGCTG TTGGAACTCC AGGCCGAACT CCCCGAGACG CAGCCGCGGC GGATGCGCAT CCTCCAGGCG CTCGACGACG CCCTCGACGT GCTCGACCTT CAGCGGATCG TCGAGACCGC GCCCGCGGCC CGCGCCGCAC TGGTGCAGGT TCTGGCGGCG CCCGCGGATG CGAGCGCGCA TCGGATCGCC GCGATCGGAC ACTCTCACAT CGACTCCGCG TGGCTGTGGC CATTGCGGGA GACCATACGC AAGGTCGCCC GCACCACCGC CTCGATGACC ACATTGCTCG ATGAGCAGCC ACAGTACAGC TACGGCATGT CGAGTGCGCA GCAGTACGCG TGGGTGAAAG AACACCGCCC CGAAGTGTGG GAGCGGATCC GCGCGGCGGT GGCGGACGGG CGCTTCCTTC CGCTCGGCGG AATGTGGGTG GAGGCGGATA CGGTGATGCC CACCGGTGAA TCGCTGGCGC GCCAGTTCTC GTACGGACAG CGGTTCTTCG AGCGCGAGTT CGGGATCCGC AGCCGTGGAG TGTGGCTGCC CGACAGCTTC GGATACTCCC CCGCGCTGCC GCAACTCATG CGCCGCGCCG GCTTCGACTG GTTCTTCACT CAGAAGATCT CCTGGAACCA GCGAAACGTG TTTCCGCACC ATACGTTCGA TTGGGAGGGG ATCGACGGCA CACGGGTCTT CACCCATTTC CCGCCGATGG ATACCTATTG CTCGTCGCTC TCCGGCGCGG AGGTGGCGCG GGCCGCGCGG CAGTTCAAGG AGAGTCGGGT CGCGACCCGG TCGATCGCCC CGGTCGGTTA CGGCGACGGT GGTGGCGGTA CCACCCGCGA AATGATCGGC AAGGCCGAAC GTCTCGCGAA TCTGGAGGGC AGTGCGCGAG TCCACTGGGT GCACCCGGAC GAGTTCTTCG ACGCAGCGAA GGCCGAACTG CCCGATCCCG CGGTCTGGGT CGGCGAGCTG TACCTTGAGC TGCACCGGGG CACCCTCACC AGTCAGCACG CCACCAAGGC CACGCACCGG CGGTGCGAGC AGGCATTGCT GGAGGCCGAG TTGTGGGCCG CGACCGCGGC GGTACAGCAG GGCCTCGCGT ACCCGTACGG CGACCTCGAC GCCCTGTGGG AGCAGGTACT CCTGCACCAG TTCCACGACA TCCTGCCGGG GACTTCGATC GCGTGGGTGC ACCGGGAGGC GGTGGCCGTG CTCACCGACG TTCTCGCCCG GGCCCGCGGC CTCGCCGCCG ACGCGCGCCG CGCGCTCGCG GGTGACGGTG GAGTCGACCT CGTCTTCCGA CCGGTACACA CCCCGGTCGA CGCGGCCGGC GCGCTCGGCG CCGCACCCGC CGCACCACCG TCCGGCGCGG TGACCCTGAC CCCGCGGGCC GGCGGATACC GCCTGACCAA CGAACTGATC TCGGTCACCG TCTCGAAGAA CGGGACCATC ACCTCGGCGA TCGACCTCGC CACCGGCCGG GAAGCGATCC CGGACGGGCG GCCCGCCAAC CTCTTCCAAC TGCATCAGGA CTTCCCGAAC GCCTGGGATG CCTGGGACAT CGATCGGTAC TACCGCAACC GGGTCGACGA TCTCACCGCC GCTACGTCCA TCACCGGAGA TCTCACCGAT GGTGTCGCGG AGGTCGTCGT CACGCGGACG TTCTCGGAAT CATCACTACA GCAGACGATT ACACTCGCCC CTGGGTCGCG CACCGTGATG CTGCGCAATC GGATCGATTG GCACGAGACG GAGAAGCTGC TCAAGCTCGC CTTCCCGCTC GATGTCTTCG CCCGGGAGAC CGCCGCCGAG ACGCAGTTCG GATTCCAGCG CCGGGCCACG CATGTCAACA CCAGTTGGGA GGCGGCGAAG TTCGAAACCT CGATGCATCG ATTCGTCCTC GCCGAGGAGG ACGGCTTCGG TGTCGCCGTG GTCAACGATT CGGTGTACGG CTACGACACC GCCCGCGATG ACGCGCAAGG GGCGATCACG ACCACCGTCC GGGTCTCCCT GCTGCGCGCG CCGCGGTTCC CCGACCCGGA TACCGACCAC GGGGTGCACG AGATCACCGT CGGACTGGTG GTCGGCGCAG ATCCCGCGAT CGCCACGACC GAGGGGCAGC GGCTCAATGC GCCTGAGACG GTGGTCCGCG GCGCGGGGCC GGTGGCGCCG CTGGTCACCC TCGACGGAGA AGGCATCGTC ATCTCCGCGA TCAAACTCGC CGACGACCGC TCCGGCGATG TGATCGTGCG CCTGTACGAG GCTCTAGGCC GGCGCGCCAG AGGCTCGCTG TCCGTAGGGT TCTCGCACGG AGGGATCCGG GAGGTGAGCC TGATCGAAGA CGAGATCGAC GATCCGCGGG TCGGTGGCGA CCTGGACCTG CGGCCCTTCG AGGTGCGAAC CCTGCGGATC AGTCGACGCT GA
|
Protein sequence | MHDETSLTIG RVNRVLGERI RPAIHSARIP LDVAANHLPG EPIPPAAGLR LDYSPFADDR RWGPAWGTTW FRLRGDVPAE WAGKQVEALI DLGFDVNQTG FQCEGLAYRC GLDGAPVPVK GINPRNQAVA VADRATGGEQ VELFVEAAAN PVILDYHPFL PTRQGDVLTS SPEPLYRVRA MDLAVFEPEV FALALDVEVL LELQAELPET QPRRMRILQA LDDALDVLDL QRIVETAPAA RAALVQVLAA PADASAHRIA AIGHSHIDSA WLWPLRETIR KVARTTASMT TLLDEQPQYS YGMSSAQQYA WVKEHRPEVW ERIRAAVADG RFLPLGGMWV EADTVMPTGE SLARQFSYGQ RFFEREFGIR SRGVWLPDSF GYSPALPQLM RRAGFDWFFT QKISWNQRNV FPHHTFDWEG IDGTRVFTHF PPMDTYCSSL SGAEVARAAR QFKESRVATR SIAPVGYGDG GGGTTREMIG KAERLANLEG SARVHWVHPD EFFDAAKAEL PDPAVWVGEL YLELHRGTLT SQHATKATHR RCEQALLEAE LWAATAAVQQ GLAYPYGDLD ALWEQVLLHQ FHDILPGTSI AWVHREAVAV LTDVLARARG LAADARRALA GDGGVDLVFR PVHTPVDAAG ALGAAPAAPP SGAVTLTPRA GGYRLTNELI SVTVSKNGTI TSAIDLATGR EAIPDGRPAN LFQLHQDFPN AWDAWDIDRY YRNRVDDLTA ATSITGDLTD GVAEVVVTRT FSESSLQQTI TLAPGSRTVM LRNRIDWHET EKLLKLAFPL DVFARETAAE TQFGFQRRAT HVNTSWEAAK FETSMHRFVL AEEDGFGVAV VNDSVYGYDT ARDDAQGAIT TTVRVSLLRA PRFPDPDTDH GVHEITVGLV VGADPAIATT EGQRLNAPET VVRGAGPVAP LVTLDGEGIV ISAIKLADDR SGDVIVRLYE ALGRRARGSL SVGFSHGGIR EVSLIEDEID DPRVGGDLDL RPFEVRTLRI SRR
|
| |