Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_0079 |
Symbol | |
ID | 9154213 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | - |
Start bp | 85384 |
End bp | 86559 |
Gene Length | 1176 bp |
Protein Length | 391 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | |
Product | glycoside hydrolase family 5 |
Protein accession | YP_003645072 |
Protein GI | 296137829 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.265461 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGTTTTCG TTGTTGCCGT GCTGGTCGCC GCCATGTCCG CGATTGCCAT CGGGCCTGCG CGAGCGGCGG TCCCCGTCGG CGATGGGCAA CGCGGGATCA ACGTCACCAC GATGGAGATT CAGGCGGTGA CCATATTTGA CCACCCCACC ATCACCAACA CCGGCGAACC TCAATCGTCG TACAACTATC TCGCGAGCCT CGGCCACAAG CTGATCCGCC TGCCCATCAG TTGGAACTTT CTCCAGCCCA ATCTCGACAC CGGCAACCGA ACATTTCACG CCGCGTACTG GACCGCGATC AAGGCAGAAG TTGCCAAGAT CAAGGCAGCA GGCCTGAAAA CCGTGCTCGA CCTCCATAAC GGCTGCGAAT GGACGAAGCC AAAGACGACA GCACCCGTGA AAGTCTGCGG CGCGGGACTC ACGCTCGCCG ACACGAACGA CGTATGGCTC CAATTGTCCA ATCAATTCAA GAATGAACCC AGCGTGGTGG CGTACGACCT CTTCAACGAA CCGGTGCGAT TCAATCACCC GACTCGCAGC GAACTCCAGA ATGCCGCCGA CCAGCCGTAC ACGTCGTACA AGGCGGCCGT GAACTCGATC GTCGCCGCTC TGCGTGCCAA CAACGACAGC AAGAAGATCT GGGTCGAATC GCTGTGCTGC CATAGAGAAC ACGACCTCGC CAGCACCGAT CCCAACGGCG GGTGGGTCAT CGATCCGCTG AACAAGATCG TGTACTCCCA GCACATGTAC CCCGTGAGCG ATAGCTCCAA GGGCGAAGTG TTCGATATGG CCAAGATCGA TCCCAACTAC GAACAACCTC AAGGCGACTT CTGGGCGGAC TGGGGCTACG TCCGAGGTTT TCTCTCGCGG CTCGACAGGT TCGCCGGGTG GTGCGATCGC AGCAACGTCC AGTGCTCCAT CGGCGAGGTC GGCTGGTACG GCGACGGGCA GTCGCCAGAC AGCGCCGCGT TGTGGAACCA GCTCGGCGAT GAGTGGTACA ACAAGGCCAA TTACCACGGA TTCGCGGTGA CGTACTACGG AGCGAGTTCG ACATCGCCGG GCAGCCTATG GGCCTACGAC TCCGCCACCC CCGCATGGCA TCCTGCACCC GGATTCAGCC GCAAGCAACC GCAGGCGCTG ATCCTGGAGA AGCCGTCACA TCTCTCACGG CCCTAG
|
Protein sequence | MVFVVAVLVA AMSAIAIGPA RAAVPVGDGQ RGINVTTMEI QAVTIFDHPT ITNTGEPQSS YNYLASLGHK LIRLPISWNF LQPNLDTGNR TFHAAYWTAI KAEVAKIKAA GLKTVLDLHN GCEWTKPKTT APVKVCGAGL TLADTNDVWL QLSNQFKNEP SVVAYDLFNE PVRFNHPTRS ELQNAADQPY TSYKAAVNSI VAALRANNDS KKIWVESLCC HREHDLASTD PNGGWVIDPL NKIVYSQHMY PVSDSSKGEV FDMAKIDPNY EQPQGDFWAD WGYVRGFLSR LDRFAGWCDR SNVQCSIGEV GWYGDGQSPD SAALWNQLGD EWYNKANYHG FAVTYYGASS TSPGSLWAYD SATPAWHPAP GFSRKQPQAL ILEKPSHLSR P
|
| |