Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tcur_4206 |
Symbol | |
ID | 8605562 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermomonospora curvata DSM 43183 |
Kingdom | Bacteria |
Replicon accession | NC_013510 |
Strand | + |
Start bp | 4800501 |
End bp | 4801631 |
Gene Length | 1131 bp |
Protein Length | 376 aa |
Translation table | 11 |
GC content | 78% |
IMG OID | |
Product | galactokinase |
Protein accession | YP_003301771 |
Protein GI | 269128401 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGATCCGC TGCTGGCGGC CTTCGAGGAG GCCTACGGCC GGCCCCCGCA GGGGGTGTGG CACGCCCCGG GCCGGGTCAA CCTGATCGGG GAGCACACCG ACTACAACGA CGGGCTGGTG CTGCCGTTCG CGCTGGCCCA GGGGGTGTCG GTGGCCGCCG CCCGCCGCGA CGACGGGGTG CTGGAGCTGC GTTCCCTGCA GGCCGCCGCC GACGGCCGCA CCGTGCGGGT GGAGGAGCTG ACGCCCGGCG CGGTGGACGG CTGGGCCGCC TACCCCGCGG GGGTCGCCGC CGTGCTGCGC GAGCACGGGG TGGGCGGGGC GTCCCTGCTG ATCGACTCCG ATCTGCCGCA GGGCGCCGGG CTGGCCTCGT CGGCGGCGCT GGAGTGCGCC GTCGCGCTGG CCCTGTGCCA GCTGCACGGC GTGGAAATCG AACGCGCCGA GCTGGCCCGC CTGGCGCAGC GGGCCGAGCG GGAGTTCACC GGCACGCCCT GCGGGATCAT GGACCAGTCG GCCGCGCTGC TGTGCACCGC GGGGCACGCC CTGCTGCTGG ACTGCCGCAG CGGCCTGTCC TCCCAGGTGC CGCTGCCGCT GGGTGAGGCG CTGTCGCTGC TGGTGGTCGA CACCCGCGCC CCGCACGCCC TGGCCGACGG GGACTATGCG GCGCGCCGGG CCGAGTGCGA GCGGGCCGCG TCCCTGCTGG GGGTGGACTC GCTGCGGGAC GTCAAGGACC TGGCCGGGGC GCTGGCGAGC CTGCCGGAGC CGGTGCTGCG CCGCCGCACC CAGCACGTGG TCACCGAGAA CCACCGGGTG GAGGCGGCCG TGGGGCTGCT GCGCGCCGGG GCGCTCGCCG AACTGGGCGC CCTGCTGACC GCCTCGCACC TGTCGCTGCG GGACCAGTTC GAAGTCTCCT GGCCGCGGGC GGACGCGGCG GTGGAGGCCG CGCTGCGGGC CGGGGCGCGG GGCGGCCGCA TGGTGGGCGG CGGCTTCGGC GGCTCGGTCA TCGTGCTGGC CGCCGCCGAC CGGCTCGCCG ACGTCCGGGA GGCCATCGAC GCCGCCTACG CCGAACGCGG CTGGCCCGCC CCCGCCTACC TGGAGGCCGT TCCCTCCGCA GGGGCCCGCC GCCTGCTGTG A
|
Protein sequence | MDPLLAAFEE AYGRPPQGVW HAPGRVNLIG EHTDYNDGLV LPFALAQGVS VAAARRDDGV LELRSLQAAA DGRTVRVEEL TPGAVDGWAA YPAGVAAVLR EHGVGGASLL IDSDLPQGAG LASSAALECA VALALCQLHG VEIERAELAR LAQRAEREFT GTPCGIMDQS AALLCTAGHA LLLDCRSGLS SQVPLPLGEA LSLLVVDTRA PHALADGDYA ARRAECERAA SLLGVDSLRD VKDLAGALAS LPEPVLRRRT QHVVTENHRV EAAVGLLRAG ALAELGALLT ASHLSLRDQF EVSWPRADAA VEAALRAGAR GGRMVGGGFG GSVIVLAAAD RLADVREAID AAYAERGWPA PAYLEAVPSA GARRLL
|
| |