Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rfer_1762 |
Symbol | |
ID | 3960853 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodoferax ferrireducens T118 |
Kingdom | Bacteria |
Replicon accession | NC_007908 |
Strand | - |
Start bp | 1894748 |
End bp | 1895746 |
Gene Length | 999 bp |
Protein Length | 332 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 637916585 |
Product | thiosulphate-binding protein |
Protein accession | YP_523022 |
Protein GI | 89900551 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1613] ABC-type sulfate transport system, periplasmic component |
TIGRFAM ID | [TIGR00971] sulfate/thiosulfate-binding protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.478994 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAATTTA TGCAAAAAGT GGCTCTAGCC ATTGCCGGCA TTGCGCTTAC AGCTACACAA TTTGCAGCAG CTCAGACCCT GCTCAATGCG TCTTACGACG TGTCGCGCGA GTTCTACAAA GACTACAACG CAGCTTTTGT GGCCCATTAC AAAAAGACCA CGGGCAAGGA TGTCAAGGTG GATCAATCGC ACGGCGGCTC CAGCGCGCAA GCCCGGGCGG TGAACGATGG GCTGAACGCC GATGTGGTCA CCATGAACAC CACCACCGAT GTCGAGTTTC TGGCCAAAAG CGGCATCGTC GCGGCTGACT GGGCCAAACG TTTCCCCAAC AATGCCTCAC CCACCACGTC GACCATGCTG TTTCTGGTGC GCAACGGCAA CCCCAAAGGC ATCAAGGACT GGAACGACCT GATTCGCCCC GATGTGAAGG TGATCGTGGT CAACCCCAAA ACCGGTGGCA ATGGCCGCAT GGCCTACCTG GCCGCCTGGG GCTATGTGCG CAAAACGGGC GGCACCGACG CGCAAGCGGC TGAGTTTGTG AGCAAGCTGT TCAAGAACGT GCCCGTGCTG GCCAAGGGCG GGCGTGATGC CACCACCATC TTTTTACAGC GCAACATTGG CGACGTGCTG ATCACGTTTG AGTCTGAAGT GGTGTCGGTG GACCGCGAAT TTGGCGCCGG CAAGGTGGAT GCCATCCACC CGTCGGTCAG CATCATTGCC GAAAACCCGG TGGCAGTCGT TGAGCGTACC GTGGCCAAGA AAGGTACCGG CGATTTGGCC AAGGCTTACC TGAACTACCT GTACTCCGAC GAAGCGCAAG AAATCGCCGC CAAGCACGGC ATTCGCCCAA GCAACCAAAA AGTGTTGACC AAATACGCCA GCACCTTCAA ACCGCTGCAA TTGTTCCCGG TGAGCGAGTA CTTTGGCTCG CTCTCTGAAG CGCAAAAGGT TCACTTCAAC GACGGCGGCC AGTTCGACAA GATCTATACC GTCAAGTAA
|
Protein sequence | MKFMQKVALA IAGIALTATQ FAAAQTLLNA SYDVSREFYK DYNAAFVAHY KKTTGKDVKV DQSHGGSSAQ ARAVNDGLNA DVVTMNTTTD VEFLAKSGIV AADWAKRFPN NASPTTSTML FLVRNGNPKG IKDWNDLIRP DVKVIVVNPK TGGNGRMAYL AAWGYVRKTG GTDAQAAEFV SKLFKNVPVL AKGGRDATTI FLQRNIGDVL ITFESEVVSV DREFGAGKVD AIHPSVSIIA ENPVAVVERT VAKKGTGDLA KAYLNYLYSD EAQEIAAKHG IRPSNQKVLT KYASTFKPLQ LFPVSEYFGS LSEAQKVHFN DGGQFDKIYT VK
|
| |