Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tcur_0799 |
Symbol | |
ID | 8602101 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermomonospora curvata DSM 43183 |
Kingdom | Bacteria |
Replicon accession | NC_013510 |
Strand | + |
Start bp | 887509 |
End bp | 888534 |
Gene Length | 1026 bp |
Protein Length | 341 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | |
Product | sulfotransferase |
Protein accession | YP_003298428 |
Protein GI | 269125058 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.0670086 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGCGCA TGGCTTCGCA ATCTGATCGG CCCATCTTCG TCATCGGATG CCCACGGTCG GGGACGACTC TGCTGCAGCT GATGCTGCAC TCCCACGAGC GGATCGCCAT CCCGGCCGAG ACCCGGTTCC TGCTGCAGGC CTACGCCTCC CGGCACCGCT TCGGGGACCT GCACGTGCCG GACAATCGCC GCGCGTTGGC GGAGTGGATC GTCAGGCGCC GGGAGACCAA GTTCCACGAC CTGGGCCTGG ACCCCGATGA GGTCATCGAG GAGATCGTCG CCGGCCCCCC GACCCTGGGC TCGGCGCTGG GCATCGTGTT CCGCGCCTAC GCCCGCCGCT TCGGCAAGCC CCGCTGGGGC GACAAGCGGC CCAGCTACTT CAAGCACGTG GACGTGCTGC GCCGCATGTG GCCGGACGCC CAGTTCATCC ACCTGATCCG CGACGGCCGG GACTGCGTGG CCTCCCTCAA GGAGATGCCC TGGTACAACC TCGGCTCCTA CCACGCCATC TGCGCCTGGC GGGAGGCGAT CGACTACGGC CGCCGCTACG CCCGCAAGCT CGGCCCCGAC ACCTACTACG AGCTGCAGTA CGAGCACCTG GTCGCCGACC CCGCCGGCGA GCTGGCCAAG CTGTGCAAGT TCCTCGGCGA GGACTTCGAC CCCCGCATGA CCCGCCCCCA GGAGATCGCC AAGCTGACCG TGCCGCCCAA CAAAAGGTGG CATGAACGCA CCCAGAGCGA CATCACCACC GGCCGGGTCG GCTCCTGGGC GCACCGCCTG GAGCCCTGGG AGATCGCCCT GGCCGAAAAG GCCCTGGGCT CGCGGCTGCG CGCCTACGGC TATGAGGTGA GCGGCAGTGA ACGCCCGTCC ATGACCCACA TGCTGCGCCT GGCCCGCGTG GCCGTCCGCC GCAAGACCAC CCAGCGCCGG CGCGCCCTGC GCGACCGCCT GGCGCGCCGC AACGAGCCCG GCCCGGTCGA ATGCCGCCTC AACCGCTCCG GCGACAGCGA ACGCGCCACC GCCTGA
|
Protein sequence | MKRMASQSDR PIFVIGCPRS GTTLLQLMLH SHERIAIPAE TRFLLQAYAS RHRFGDLHVP DNRRALAEWI VRRRETKFHD LGLDPDEVIE EIVAGPPTLG SALGIVFRAY ARRFGKPRWG DKRPSYFKHV DVLRRMWPDA QFIHLIRDGR DCVASLKEMP WYNLGSYHAI CAWREAIDYG RRYARKLGPD TYYELQYEHL VADPAGELAK LCKFLGEDFD PRMTRPQEIA KLTVPPNKRW HERTQSDITT GRVGSWAHRL EPWEIALAEK ALGSRLRAYG YEVSGSERPS MTHMLRLARV AVRRKTTQRR RALRDRLARR NEPGPVECRL NRSGDSERAT A
|
| |