Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_3000 |
Symbol | |
ID | 4811148 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 3522402 |
End bp | 3523454 |
Gene Length | 1053 bp |
Protein Length | 350 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 640108421 |
Product | phosphate transporter |
Protein accession | YP_001039389 |
Protein GI | 125975479 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0306] Phosphate/sulphate permeases |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0000000358421 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTGTAT CACTTACTGA TTTTTTAAGT CATTTGATAT CGAGTCCTCC ATTAATTATA ACTACTCTTC TTACACTGGG AGTTATAATG GTTAATGGAT GGACGGATGC TCCTAACGCG ATTGCCACTT GTGTTTCCAC ACGGTCGATA AGTCCGCGTG CGGCTATAGT GATGGCAGCG TTTTTTGACT TTCTTGGTGT TTTTGTTATG ACCAAGATCA ATGCGAGGGT GGCAGAGACC ATTTATAAAA TGGTTGATTT CGGAGGAGAC CCTAACAAAG CCCTGGTAGC CCTTTGCGCT GCTCTTTTTG CAATTGTGCT TTGGGCGACG GCCGCATGGT GGTTTGGAAT TCCAACCAGT GAAAGCCATG CACTTATTGC CGGAATAAGC GGTGCTGCAA TTGCCCTGCA AAAAGGCCTT CACGGTATTA ATTTTAATGA GTGGGTTAAT GTCCTTTACG GACTGGCACT GTCCCTGGTA TTGGGTTTCG CAACGGGATG GACGGTTGTA AAGCTGGTTG AGAAGATTTT TAGAAGAGTC AACAGGGCAA AAACTTTTGG GTTTTTCAAA AATGCCCAGG TTTTGGCCAG TGCCGGCATG GCATTTATGC ATGGCGCGCA GGACGGACAG AAATTTATGG GTGTGTTTAT GCTGGGTGTG TTTTTAACGC AGGGACAAGC ACAAGTGACA GAATTTATCA TACCGAACTG GATGCTTATT CTTTGCTCGG CAGTCATGGC CACAGGTACT TCCATTGGCG GATATCGAAT TATTAAGGCC GTAGGAATGG ACATGGTCAA ACTTGAAAAA TATCAAGGTT TTTCTGCCGA CCTTGCCGGC GTGATATGTT TGCTTACAGC ATCAGTTTTC GGCTTGCCGG TCAGTACCAC TCATACCAAG ACAACGGCAA TTATGGGCGT TGGAGCGGCA AAACGTATAT CTTCAGTTAA CTGGGGAGTG GTAAAAGAGA TGGTGTCCGC ATGGGTTCTG ACTTTCCCGG GATGCGGATT GATAGGCTTT TTGATGGCAT TGCTCTTTAT GAACATATTT TAA
|
Protein sequence | MTVSLTDFLS HLISSPPLII TTLLTLGVIM VNGWTDAPNA IATCVSTRSI SPRAAIVMAA FFDFLGVFVM TKINARVAET IYKMVDFGGD PNKALVALCA ALFAIVLWAT AAWWFGIPTS ESHALIAGIS GAAIALQKGL HGINFNEWVN VLYGLALSLV LGFATGWTVV KLVEKIFRRV NRAKTFGFFK NAQVLASAGM AFMHGAQDGQ KFMGVFMLGV FLTQGQAQVT EFIIPNWMLI LCSAVMATGT SIGGYRIIKA VGMDMVKLEK YQGFSADLAG VICLLTASVF GLPVSTTHTK TTAIMGVGAA KRISSVNWGV VKEMVSAWVL TFPGCGLIGF LMALLFMNIF
|
| |