Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_3333 |
Symbol | |
ID | 4075232 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008043 |
Strand | - |
Start bp | 342686 |
End bp | 343723 |
Gene Length | 1038 bp |
Protein Length | 345 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 638004841 |
Product | thiosulphate-binding protein |
Protein accession | YP_611567 |
Protein GI | 99078309 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG4150] ABC-type sulfate transport system, periplasmic component |
TIGRFAM ID | [TIGR00971] sulfate/thiosulfate-binding protein |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.885845 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.45738 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCCTCT CCTCCAAACC GTTTCTCCGG TCCTCTCGGG TTCTGGTTCG CAGCCTCGGT GCTGCCGCGA TCGCGCTTGC TGGTTTTGCG AGTTCGGCCG AAGCCGAAGA GCAGGAAATC CTCAATGTTT CCTACGATAT TGCCCGTGAA CTCTACGCGG CGCTGAACCC TGTTTTTGCC GAAAACTGGC AGGCAGAGAC GGGCGAGACA CTGACCATCA AACAAAGCCA TGCTGGCTCT TCCAAACAGG CTCGAGCGAT CCTGCAAGGT CTGCAGGCGG ATCTCGTGAC CTTCAATCAG GTGCTGGATG TGCAGATCCT CGCAGACAAG GGTTTTGTGG CGCAGGACTG GCAGCAAAAG CTGCCCAATA ACGCATCGCC TTACTACTCG CTCCCGGCTT TCCTGGTGCG GGGTGGCAAC CCCAAGGGTA TTGAAGACTG GGACGATCTG ACCCGTGATG ACGTGGAACT CGTGTTCCCG AACCCAAAAA CCAGCGGCAA TGCGCGCTAC ACCTATCTCG CGGCCTACGC CTATGCGCTT GACAAGTTTG GAGGCGATGA GGCCGCGGCG CAGGAATTTG TCGGTAAGAT CCTCTCCAAT GTCGTGGTGT TCGACACCGG TGGGCGTGGT GCGACAACGA GCTTTGTCGA GCGTGAGCTT GGCGATGTGC TGATTACCTT CGAGGCCGAG GTCGAGAACA TCCGCGCCAG TGAGGATGAA GGTGCCTTTG ATCGTGTGGT GCCTGCAATC TCCCTCTTGG CAGAGTTCCC TGTGGCGCTG GTTGACAAGG TGGCAGATGC ACGGGGCAGC CGAGCCGTTG GCGAGGCCTA TCTCGACTTT CTCTACTCCA AGGACGCGCA GGAAGTCATT GCCGGTTTCA ACAACCGTGT GCATCACCCC GAGGTGGTGG CTGCAACAGC CGACAAGTTC CCTGATGTGC GTCTGATCAC GGTCGAAGAA GTCTTTGGCA GCTGGGCCGA AGCGCAGGAG ACCCACTTTG GCGAGGGTGG TACGCTCGAC CGGGTCTTCA CCAACTAA
|
Protein sequence | MPLSSKPFLR SSRVLVRSLG AAAIALAGFA SSAEAEEQEI LNVSYDIARE LYAALNPVFA ENWQAETGET LTIKQSHAGS SKQARAILQG LQADLVTFNQ VLDVQILADK GFVAQDWQQK LPNNASPYYS LPAFLVRGGN PKGIEDWDDL TRDDVELVFP NPKTSGNARY TYLAAYAYAL DKFGGDEAAA QEFVGKILSN VVVFDTGGRG ATTSFVEREL GDVLITFEAE VENIRASEDE GAFDRVVPAI SLLAEFPVAL VDKVADARGS RAVGEAYLDF LYSKDAQEVI AGFNNRVHHP EVVAATADKF PDVRLITVEE VFGSWAEAQE THFGEGGTLD RVFTN
|
| |