Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_1915 |
Symbol | |
ID | 9156070 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | + |
Start bp | 1998827 |
End bp | 2000071 |
Gene Length | 1245 bp |
Protein Length | 414 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | |
Product | Sterol 3-beta-glucosyltransferase |
Protein accession | YP_003646867 |
Protein GI | 296139624 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCACCG TTCTCATCGC CGCCTACGGT TCCCGCGGCG ACATCATGCC GCTCACTGAC ATTGGCTGCC GACTCCGCGA CGCCGGCCAC CGCGTAGTGC TCACCTCCAA CGGCGAACTG GATGACGAGG TCCGCGCGAC CGGGCTCGAA ACGCGCGGAA TCTCCTTCGA TGTCGACCGC GACCTGGAAA CCGGCGAGGA GGACGCCCTC AAGGTGGCGC TCCAGGTGGT GAAGCCGGCG GGAATCCGCA GGCTGGGCAA CAGTTTCCTC GATGTCGTCG CGGATCTGGA GCCAGATCTG GTGATGCTCA CTCCGTTCAC CGAGCTCCCC GGGCACGCCC TCGCTGAGGC GCACGGCATC CCAACGCTCG GCCTCCGGTT CCAGCCGATG TCCGCGACCC GTGCCTACCC GCCCAGTCTG CTCGGAGCCC GATCGCTGGG CGGGCCGGGC AACCGTGCGG TCGGGAACCT CGCGGTCGCC GCGGTCGACC GTGTATACGG CGGTGCCGTC GCCGATTTCC GTCGGCGCCT CGGCCTCCCG GTGCAGTCCG CACGGGCACT CCGCCGAACC CGCACCGCGC AGGAGTGGCC GATCCTGTAC GGCTATTCCC CGTCGGTACT CCCGCGCCCC GCAGACTGGC GGACGGGTAT CAATGTCACC GGCTATTGGT GGTCGCGAGG ACTCGAAAGC TGGACGGCAC CTGTGGACCT CGAAGAATTC CTGGCCGCCG GTCCGCCGCC GGTATTCGTC GGGTTCGGCA GCCTTCCGGT GACCGATGCC GAGCGCGACC GCCTCGCCCA TACGGTGCGG GCGGCGGCGC TCGGCTCGGG ACAGAGGTTC CTGGTGCAGG CCGGGGGAGC AGGGTTGACG GTGGAGAACG ACGAGCACAC CCTCTCCATC GGCACCGTCC CCTACGACTG GCTGTTCAGT CGCGTCGCGG CGGTGGTGCA CTCCTGTGGC GCGGGCACCA CCGCTTCCGG TCTTCGGTCG GGAGTTCCGA CGGTCGGCGT GCCCTCGCCC GGCGGTGATC AGCAGTTCTG GGCGGAGCAA CTCCGCCGCC TCGGAGTGAG CCCCGCGACG CTGCCACGAC CGGCATTGCG CGCCGAGCGC CTCACCGACG CAGTGACGGC TGCGATCACC GACCCGTCGT ACCGGGAGGC CGCGGCGCGG ATCGCCGAAC GCATCCGGCA CGAAGACGGT GCGGGCCGTG TCGTCACTGA GGTCGAGCGG CTTCTCGGGC GATGA
|
Protein sequence | MATVLIAAYG SRGDIMPLTD IGCRLRDAGH RVVLTSNGEL DDEVRATGLE TRGISFDVDR DLETGEEDAL KVALQVVKPA GIRRLGNSFL DVVADLEPDL VMLTPFTELP GHALAEAHGI PTLGLRFQPM SATRAYPPSL LGARSLGGPG NRAVGNLAVA AVDRVYGGAV ADFRRRLGLP VQSARALRRT RTAQEWPILY GYSPSVLPRP ADWRTGINVT GYWWSRGLES WTAPVDLEEF LAAGPPPVFV GFGSLPVTDA ERDRLAHTVR AAALGSGQRF LVQAGGAGLT VENDEHTLSI GTVPYDWLFS RVAAVVHSCG AGTTASGLRS GVPTVGVPSP GGDQQFWAEQ LRRLGVSPAT LPRPALRAER LTDAVTAAIT DPSYREAAAR IAERIRHEDG AGRVVTEVER LLGR
|
| |