Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tcur_1891 |
Symbol | |
ID | 8603218 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermomonospora curvata DSM 43183 |
Kingdom | Bacteria |
Replicon accession | NC_013510 |
Strand | - |
Start bp | 2234209 |
End bp | 2235456 |
Gene Length | 1248 bp |
Protein Length | 415 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | |
Product | glycosyl transferase group 1 |
Protein accession | YP_003299499 |
Protein GI | 269126129 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0120755 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCCTCGT CCCTGAACCT AGCGCTTGTC TCATTGGCCG ATCTGACGGG CGATTCGGCA AGCGCCGGCG TCCACTCCAT CCACGTGCTG GAGTTGGCCC GCGCATTGGG CGGGCAGGGC CATCGAGTGA CCGTCTACAC CCGGCGCACC ACCGAGAGCG GACGCCCCCG GACCAGGCTC GGCGCGGGCG CCACGCTCGA ACAGATCACC GCGGGCCCCG CCCGGCCGCT GCCGGAGGCC CATCTGCTGC CGCACATCCG GGCGTTCACC GACGAGCTGA GCCGCCGGCT GGCGCAGCAA CGCAACCGGC CCGACCTGGT GCACGCGCAC GGCTGGCTGG GCGGGCTGGC CGCCTACGCC GCGCTGCAGG ACACCGGGCT GCCGCTGGTG CAGTCCTTCC ACGGCCTGGG CGTGGTGGAA AGCCGCCGGC CCGGCGGGCG GCGCACCGTG CACCCGGCCC GCATCCGGAT CGAACGGGCG CTGGGCCGCG GCGCCGACAC GGTGCTGGCC GGCTGTGAAC ACGAGGCCGA CGAACTGGTG CGCATGGGCG TGGCGCGGCC CCGGATCTGC GTGGTGCCCT ACGGCGTGGA CTGCGAGCGG TTCCGCCAGA CGGGCCCGAC GATGCCGCGG GGGCGGCGCC CCCGGCTGGT GCTGGTCGGC AACGACCTGG ACAACGCCGG CGCGGCGGTG GCCGTGCGGG CACTGGCGCA CGTCCCCGAG GCCGAGCTGG TGGTGGCCGG CGGCCCGGCC CGCGAGGACC TGGAAAGCGA CCCCACCGTG CACCGGCTGC GCACCCTGGC CAAGGAGCTG GACGTGGCCG ACCGCACGCT GTTTTTGGGC CGGCTGCCCC GCAAGGACGT CCCCAAGCTG CTGCGCACCG CCCGGCTGGC GCTGTGCCTG GCCCCGCACC AGCCCTCGGG GATGGTGCCG CTGGAGGCCA TGGCCTGCGG CGTGCCGGTG GTCGCGGTGC CGACCGGCTC CGGCGCCGAC AGCGTGCTGG ACGGCGTCAC CGGGCTGCAC GTGCCGCCCG GCCAGCCGGT GACCCTGGGA CGGGCGCTGC GCCGGCTGCT GGCCGAGGAG ACCACGCTGT CGGCGTGGGC CATCGCCGCC GCCGACCGCG CCCACTCCCG CTACGCCTGG GAGCGGATCG CCGCCGAGAC CGTGCGCTGC TACAGCGCGC TGCTGCCCGA GCCGGAGCCC GAGCCCGCCG AGCGGCACGC CGAGGAACCG GTGGGCGCCG GCGTCTGA
|
Protein sequence | MPSSLNLALV SLADLTGDSA SAGVHSIHVL ELARALGGQG HRVTVYTRRT TESGRPRTRL GAGATLEQIT AGPARPLPEA HLLPHIRAFT DELSRRLAQQ RNRPDLVHAH GWLGGLAAYA ALQDTGLPLV QSFHGLGVVE SRRPGGRRTV HPARIRIERA LGRGADTVLA GCEHEADELV RMGVARPRIC VVPYGVDCER FRQTGPTMPR GRRPRLVLVG NDLDNAGAAV AVRALAHVPE AELVVAGGPA REDLESDPTV HRLRTLAKEL DVADRTLFLG RLPRKDVPKL LRTARLALCL APHQPSGMVP LEAMACGVPV VAVPTGSGAD SVLDGVTGLH VPPGQPVTLG RALRRLLAEE TTLSAWAIAA ADRAHSRYAW ERIAAETVRC YSALLPEPEP EPAERHAEEP VGAGV
|
| |