Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tcur_1738 |
Symbol | |
ID | 8603065 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermomonospora curvata DSM 43183 |
Kingdom | Bacteria |
Replicon accession | NC_013510 |
Strand | + |
Start bp | 2032869 |
End bp | 2034128 |
Gene Length | 1260 bp |
Protein Length | 419 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | |
Product | glycoside hydrolase family 6 |
Protein accession | YP_003299350 |
Protein GI | 269125980 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.0304195 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCACCCC CTCGCGGCGC CCCGGCCCCC GTGCGCGCCC GGCTGCGCGC CTGGTCGGCC CGCTGCGCCG CGCTCGCGAC CGCGGCGGCC CTCACCGCCG CAGCGGCGCC GCCCGCCCAG GCGCATGCCG CCGGCAACCC CCTGCAGGGC CCGAGGGCGG CGAAGGTCCG CTTCTTCGTC GAACCCGACA CCAACGCCGG ACGGCAGGCC CGGGTGTGGG CCGCGCAGGG ACGCTTCCAC GACGCCGCCC TCATGCGGGC GCTGTCGAAG ATCTCCCAGG CGGTCTGGTT CACCGAGGGA ACTCCCCAGC AGGTGCAGCG GCGGGTCCGC GAGACCATGC GCCAGGCCCG CCGCCAAGGC GCCGTCCCCG TGCTGGTGGC CTACTACGTG CCGGGACGGG ACTGCTCGCA GTACTCGGCC GGCGGCGCCC CCAGCGAGCG GGCCTACCGG GAGTGGATCA ACGCCTTCGC CCGCGGCATC GGCGGCGGCC GGGCCGTGGT GATCGTCGAA CCCGACGGGC TGGCCCTGCT GTCCAGCGAG CCGTGGTGCA ACGAAGGCGG CGGCGGCTCC ACCGGCCGGC CGGAGGACAT GTCGCTGGTC GAGCAGCGCT TCCGGGAGAT CGACCACGCC ATCACCACCT TCGCCAAGCT CCCCAACACC GGCGTGTACG TGGACGCCGG CCACTCGGCC TGGCAGCCGC TCAACGACTA CGACGCCGGC TACGGCGAGC CGCGCGCCCA GCTCGGCATC GTCAGCCGCC TGCTGCGCGG CGGCGTCGCC AAGGCCGACG GGTTCGTGCT GAACGTCTCC AACTACCGGG CCGACGCCGA GCTGATCGAC TACGGCGTCC GGGTCTCCAA GTGCCTGTGG CTGCGCCGCA GCACCGGCGC GCGCGAGTGC ACCGACGCCG ACCTGGCCGC CGTGCCCGAC GGCCGGCGCG ACCTGACCCC CTTCGTCCTG GACACCAGCC GCAACGGCCG GGGGCCGTGG ACGGCGCCGG AGGGCGCGTA CCCCGACCCC CAGGAGTGGT GCAACCCGCC CGGCCGCGGC CTGGGCGTCC GCCCCACCAC CCGCACCGGC CACCGCCTGG TGGACGCCTT CCTGTGGGTC AAACGGCCCG GCGAGTCCGA CGGCCAGTGC ACCCGCGGCA CCGCCGGACC GCAGGACCCC GAGTACGGCA TCGTCGACCC GCCCGCGGGC CAGTGGTGGC CCGAGTACGC CCTGGGCCTG GCCCGGCGGG CCGTGCCACC CCTCAAGTGA
|
Protein sequence | MAPPRGAPAP VRARLRAWSA RCAALATAAA LTAAAAPPAQ AHAAGNPLQG PRAAKVRFFV EPDTNAGRQA RVWAAQGRFH DAALMRALSK ISQAVWFTEG TPQQVQRRVR ETMRQARRQG AVPVLVAYYV PGRDCSQYSA GGAPSERAYR EWINAFARGI GGGRAVVIVE PDGLALLSSE PWCNEGGGGS TGRPEDMSLV EQRFREIDHA ITTFAKLPNT GVYVDAGHSA WQPLNDYDAG YGEPRAQLGI VSRLLRGGVA KADGFVLNVS NYRADAELID YGVRVSKCLW LRRSTGAREC TDADLAAVPD GRRDLTPFVL DTSRNGRGPW TAPEGAYPDP QEWCNPPGRG LGVRPTTRTG HRLVDAFLWV KRPGESDGQC TRGTAGPQDP EYGIVDPPAG QWWPEYALGL ARRAVPPLK
|
| |