Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_2783 |
Symbol | |
ID | 9156948 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | + |
Start bp | 2879630 |
End bp | 2881618 |
Gene Length | 1989 bp |
Protein Length | 662 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | |
Product | glycoside hydrolase 15-related protein |
Protein accession | YP_003647720 |
Protein GI | 296140477 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.818274 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCCCGA AGCAGAACGC AGTAGCAGAC AGCGCCGATG CCGTCGTCAC GGCAGCGGCA GCCGAGGTCG CGGCGGTGGT CGTCGGCGAC AACGCCTTCC CGCCGATCCA CGACTACGCC TTCCTCTCGG ACTGCGAGAA CACCTGCCTG ATCGCGCGGG ACGGCAGCGT CGAGTGGATG TGCATCCCTC GCCCGGACTC CCCCAGCGTG TTCGGCGCCA TCCTGGATCG CGGTGCGGGA TCGTTCCGGC TCGCCCCGTA CGGGGTGCGA GTCCCGGTCG ACCGCCGCTA CCTGCCCGGC AGCCTGATGT TGGAGACCAC GTGGCAGACC TCCACAGGCT GGCTCATCGT GCGGGACGCC CTGGTCATGG GGCCGTGGCA CGACACCCAG ACACGGTCCC GCACGCACCG TCGCACTCCC ACCGACTGGG ATGCCGAGCA CATCCTGCTG CGCACCGTTC GGTGCGTTTC GGGCACGGTG GAGTTGCAGC TCAACTGCGA GCCGGCCTTC GATTACAACC GGGCGGGCGC GAAATGGGAG TACTCCTCGG ACGGCTACGG CGAGGCGATC GCCCGCGCGC GATCGGAGAC GGACCTGCAT CCCGAGCTGA AGCTCACCAC CAACATGCGG CTCGGCCTGG AGGGCAGGGA GGCCCGCGCG CACACCAGGA TGCAGGAGGG CGACGAGGCA TTCGTCGCAC TGTCGTGGAG CAAGCATCCG GTGCCGCAGA CCTTCGCGGA GGCTGCGAAG AAGATGTGGT CGACGGCGGA GTCGTGGCGG CAGTGGATCA ACCGGGGCCA GTTCCCCGAC CACCCGTGGC GCTCGTATCT GCAGCGTTCG GCGCTCACAC TCAAGGGGCT CACCTACTCG CCGACGGGTG CCTTGCTCGC GGCGTCGACC ACGTCGCTGC CCGAGACCCC GCACGGGGAG CGGAACTGGG ACTACCGCTA CGCCTGGGTG CGCGACTCGA CGTTCGCACT GTGGGGCCTG TACACACTGG GCTTGGACCG GGAGGCCGAG GACTTCTTCG CGTTCATCGC CGACGTCTCG GGGGCGAACA ACGGTGAGCG ACATCCGTTG CAGGTGATGT ACGGCGTGGG CGGCGAACGC ACGCTGATCG AGGAGGAGCT CCCGCACCTG TCGGGGTACG ACGGTGCGCA GCCCGTCCGG ATCGGCAACG GCGCGTACAA CCAACGCCAG CACGACATCT GGGGCACGAT GCTGGACTCG GTGTACCTGA ACACCAAGTC CAGTGAGCGC ATCCCGGAGA CCCTGTGGCC AGTGCTCAAG GAGCAGGTCG AGGAGACGCT CAAGCATTGG CGTGAGCCCG ACCGTGGCAT CTGGGAGGTG CGGGGCGAAC CGCAGCACTT CACCTCGTCG AAGGTGATGT GCTGGGTGGC GCTCGATCGC GGCGCCAAGC TGGCGGCGAT GCACGGTGAG GACGATTACG CCCGCAAGTG GGCCGAGCAG GCCGACGTCA TCAAGGAAGA CATCCTCGAA CACGGCGTGG ACGAGCGCGG GGTGTTCACC CAGCGGTACG GCAACGACGC GCTCGATGCC TCGCTGCTAC TGGTGCCGCT GCTGCGCTTC CTGCCCTCCG ACGATCCGCG GGTGCGGGCG ACGGTGATCG CCATCGCCGA CGAGCTCACC GAGGACGGCC TGGTGCTGCG GTACCGCGTG GAGGAGACCG ACGACGGACT CTCCGGCGAG GAGGGAACCT TCACGATCTG CTCGTTCTGG CTGGTGTCGG CGTTGGTGGA GATCGGCGAG GTGGCCCGCG GCCGGGCGCT GTGCGAGCGG CTGCTGGCCT ACTCGTCACC GCTGAAGCTG TACGCCGAGG AGATCGACCC GAAGTCCGGC AAACACCTGG GCAACTTCCC GCAGGCGTTC ACGCATCTGG CACTGATCAA CGCCGTGGTG CACGTGATCC GCGCCGAGGA CACCGCGCAC GACGCGTCGG CGTTCCTCCC GGCGCATCAC ACGCCGTAG
|
Protein sequence | MAPKQNAVAD SADAVVTAAA AEVAAVVVGD NAFPPIHDYA FLSDCENTCL IARDGSVEWM CIPRPDSPSV FGAILDRGAG SFRLAPYGVR VPVDRRYLPG SLMLETTWQT STGWLIVRDA LVMGPWHDTQ TRSRTHRRTP TDWDAEHILL RTVRCVSGTV ELQLNCEPAF DYNRAGAKWE YSSDGYGEAI ARARSETDLH PELKLTTNMR LGLEGREARA HTRMQEGDEA FVALSWSKHP VPQTFAEAAK KMWSTAESWR QWINRGQFPD HPWRSYLQRS ALTLKGLTYS PTGALLAAST TSLPETPHGE RNWDYRYAWV RDSTFALWGL YTLGLDREAE DFFAFIADVS GANNGERHPL QVMYGVGGER TLIEEELPHL SGYDGAQPVR IGNGAYNQRQ HDIWGTMLDS VYLNTKSSER IPETLWPVLK EQVEETLKHW REPDRGIWEV RGEPQHFTSS KVMCWVALDR GAKLAAMHGE DDYARKWAEQ ADVIKEDILE HGVDERGVFT QRYGNDALDA SLLLVPLLRF LPSDDPRVRA TVIAIADELT EDGLVLRYRV EETDDGLSGE EGTFTICSFW LVSALVEIGE VARGRALCER LLAYSSPLKL YAEEIDPKSG KHLGNFPQAF THLALINAVV HVIRAEDTAH DASAFLPAHH TP
|
| |