Gene Tpau_2136 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpau_2136 
Symbol 
ID9156292 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTsukamurella paurometabola DSM 20162 
KingdomBacteria 
Replicon accessionNC_014158 
Strand
Start bp2226228 
End bp2229239 
Gene Length3012 bp 
Protein Length1003 aa 
Translation table11 
GC content69% 
IMG OID 
Productglycoside hydrolase family 38 
Protein accessionYP_003647086 
Protein GI296139843 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCACGACG AGACCTCACT GACCATCGGC CGCGTCAATC GGGTACTCGG TGAGCGCATC 
CGGCCGGCGA TCCACTCCGC CCGGATCCCG CTCGACGTCG CCGCCAACCA TCTCCCCGGA
GAGCCGATCC CACCCGCCGC GGGCCTCCGG CTCGACTACT CCCCGTTCGC CGATGACCGG
CGCTGGGGTC CGGCGTGGGG CACCACCTGG TTCCGGCTCC GCGGCGATGT TCCCGCGGAG
TGGGCGGGCA AGCAGGTCGA GGCACTGATC GACCTCGGTT TCGACGTGAA CCAGACCGGA
TTCCAGTGCG AGGGACTCGC CTACCGGTGC GGGCTCGACG GTGCGCCGGT GCCCGTGAAG
GGCATCAACC CGCGAAACCA GGCGGTGGCG GTGGCCGACC GGGCCACCGG CGGCGAGCAG
GTGGAGCTGT TCGTCGAGGC GGCGGCGAAC CCCGTGATCC TCGATTACCA CCCGTTTCTG
CCCACCCGGC AGGGAGACGT GCTCACCTCC TCCCCGGAGC CCCTGTACCG GGTCCGCGCC
ATGGATCTGG CCGTGTTCGA GCCCGAGGTG TTCGCGCTCG CTCTCGACGT CGAGGTGCTG
TTGGAACTCC AGGCCGAACT CCCCGAGACG CAGCCGCGGC GGATGCGCAT CCTCCAGGCG
CTCGACGACG CCCTCGACGT GCTCGACCTT CAGCGGATCG TCGAGACCGC GCCCGCGGCC
CGCGCCGCAC TGGTGCAGGT TCTGGCGGCG CCCGCGGATG CGAGCGCGCA TCGGATCGCC
GCGATCGGAC ACTCTCACAT CGACTCCGCG TGGCTGTGGC CATTGCGGGA GACCATACGC
AAGGTCGCCC GCACCACCGC CTCGATGACC ACATTGCTCG ATGAGCAGCC ACAGTACAGC
TACGGCATGT CGAGTGCGCA GCAGTACGCG TGGGTGAAAG AACACCGCCC CGAAGTGTGG
GAGCGGATCC GCGCGGCGGT GGCGGACGGG CGCTTCCTTC CGCTCGGCGG AATGTGGGTG
GAGGCGGATA CGGTGATGCC CACCGGTGAA TCGCTGGCGC GCCAGTTCTC GTACGGACAG
CGGTTCTTCG AGCGCGAGTT CGGGATCCGC AGCCGTGGAG TGTGGCTGCC CGACAGCTTC
GGATACTCCC CCGCGCTGCC GCAACTCATG CGCCGCGCCG GCTTCGACTG GTTCTTCACT
CAGAAGATCT CCTGGAACCA GCGAAACGTG TTTCCGCACC ATACGTTCGA TTGGGAGGGG
ATCGACGGCA CACGGGTCTT CACCCATTTC CCGCCGATGG ATACCTATTG CTCGTCGCTC
TCCGGCGCGG AGGTGGCGCG GGCCGCGCGG CAGTTCAAGG AGAGTCGGGT CGCGACCCGG
TCGATCGCCC CGGTCGGTTA CGGCGACGGT GGTGGCGGTA CCACCCGCGA AATGATCGGC
AAGGCCGAAC GTCTCGCGAA TCTGGAGGGC AGTGCGCGAG TCCACTGGGT GCACCCGGAC
GAGTTCTTCG ACGCAGCGAA GGCCGAACTG CCCGATCCCG CGGTCTGGGT CGGCGAGCTG
TACCTTGAGC TGCACCGGGG CACCCTCACC AGTCAGCACG CCACCAAGGC CACGCACCGG
CGGTGCGAGC AGGCATTGCT GGAGGCCGAG TTGTGGGCCG CGACCGCGGC GGTACAGCAG
GGCCTCGCGT ACCCGTACGG CGACCTCGAC GCCCTGTGGG AGCAGGTACT CCTGCACCAG
TTCCACGACA TCCTGCCGGG GACTTCGATC GCGTGGGTGC ACCGGGAGGC GGTGGCCGTG
CTCACCGACG TTCTCGCCCG GGCCCGCGGC CTCGCCGCCG ACGCGCGCCG CGCGCTCGCG
GGTGACGGTG GAGTCGACCT CGTCTTCCGA CCGGTACACA CCCCGGTCGA CGCGGCCGGC
GCGCTCGGCG CCGCACCCGC CGCACCACCG TCCGGCGCGG TGACCCTGAC CCCGCGGGCC
GGCGGATACC GCCTGACCAA CGAACTGATC TCGGTCACCG TCTCGAAGAA CGGGACCATC
ACCTCGGCGA TCGACCTCGC CACCGGCCGG GAAGCGATCC CGGACGGGCG GCCCGCCAAC
CTCTTCCAAC TGCATCAGGA CTTCCCGAAC GCCTGGGATG CCTGGGACAT CGATCGGTAC
TACCGCAACC GGGTCGACGA TCTCACCGCC GCTACGTCCA TCACCGGAGA TCTCACCGAT
GGTGTCGCGG AGGTCGTCGT CACGCGGACG TTCTCGGAAT CATCACTACA GCAGACGATT
ACACTCGCCC CTGGGTCGCG CACCGTGATG CTGCGCAATC GGATCGATTG GCACGAGACG
GAGAAGCTGC TCAAGCTCGC CTTCCCGCTC GATGTCTTCG CCCGGGAGAC CGCCGCCGAG
ACGCAGTTCG GATTCCAGCG CCGGGCCACG CATGTCAACA CCAGTTGGGA GGCGGCGAAG
TTCGAAACCT CGATGCATCG ATTCGTCCTC GCCGAGGAGG ACGGCTTCGG TGTCGCCGTG
GTCAACGATT CGGTGTACGG CTACGACACC GCCCGCGATG ACGCGCAAGG GGCGATCACG
ACCACCGTCC GGGTCTCCCT GCTGCGCGCG CCGCGGTTCC CCGACCCGGA TACCGACCAC
GGGGTGCACG AGATCACCGT CGGACTGGTG GTCGGCGCAG ATCCCGCGAT CGCCACGACC
GAGGGGCAGC GGCTCAATGC GCCTGAGACG GTGGTCCGCG GCGCGGGGCC GGTGGCGCCG
CTGGTCACCC TCGACGGAGA AGGCATCGTC ATCTCCGCGA TCAAACTCGC CGACGACCGC
TCCGGCGATG TGATCGTGCG CCTGTACGAG GCTCTAGGCC GGCGCGCCAG AGGCTCGCTG
TCCGTAGGGT TCTCGCACGG AGGGATCCGG GAGGTGAGCC TGATCGAAGA CGAGATCGAC
GATCCGCGGG TCGGTGGCGA CCTGGACCTG CGGCCCTTCG AGGTGCGAAC CCTGCGGATC
AGTCGACGCT GA
 
Protein sequence
MHDETSLTIG RVNRVLGERI RPAIHSARIP LDVAANHLPG EPIPPAAGLR LDYSPFADDR 
RWGPAWGTTW FRLRGDVPAE WAGKQVEALI DLGFDVNQTG FQCEGLAYRC GLDGAPVPVK
GINPRNQAVA VADRATGGEQ VELFVEAAAN PVILDYHPFL PTRQGDVLTS SPEPLYRVRA
MDLAVFEPEV FALALDVEVL LELQAELPET QPRRMRILQA LDDALDVLDL QRIVETAPAA
RAALVQVLAA PADASAHRIA AIGHSHIDSA WLWPLRETIR KVARTTASMT TLLDEQPQYS
YGMSSAQQYA WVKEHRPEVW ERIRAAVADG RFLPLGGMWV EADTVMPTGE SLARQFSYGQ
RFFEREFGIR SRGVWLPDSF GYSPALPQLM RRAGFDWFFT QKISWNQRNV FPHHTFDWEG
IDGTRVFTHF PPMDTYCSSL SGAEVARAAR QFKESRVATR SIAPVGYGDG GGGTTREMIG
KAERLANLEG SARVHWVHPD EFFDAAKAEL PDPAVWVGEL YLELHRGTLT SQHATKATHR
RCEQALLEAE LWAATAAVQQ GLAYPYGDLD ALWEQVLLHQ FHDILPGTSI AWVHREAVAV
LTDVLARARG LAADARRALA GDGGVDLVFR PVHTPVDAAG ALGAAPAAPP SGAVTLTPRA
GGYRLTNELI SVTVSKNGTI TSAIDLATGR EAIPDGRPAN LFQLHQDFPN AWDAWDIDRY
YRNRVDDLTA ATSITGDLTD GVAEVVVTRT FSESSLQQTI TLAPGSRTVM LRNRIDWHET
EKLLKLAFPL DVFARETAAE TQFGFQRRAT HVNTSWEAAK FETSMHRFVL AEEDGFGVAV
VNDSVYGYDT ARDDAQGAIT TTVRVSLLRA PRFPDPDTDH GVHEITVGLV VGADPAIATT
EGQRLNAPET VVRGAGPVAP LVTLDGEGIV ISAIKLADDR SGDVIVRLYE ALGRRARGSL
SVGFSHGGIR EVSLIEDEID DPRVGGDLDL RPFEVRTLRI SRR