Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2531 |
Symbol | |
ID | 4809287 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 3000447 |
End bp | 3001517 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 640107947 |
Product | sulfate ABC transporter, periplasmic sulfate-binding protein |
Protein accession | YP_001038926 |
Protein GI | 125975016 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1613] ABC-type sulfate transport system, periplasmic component |
TIGRFAM ID | [TIGR00971] sulfate/thiosulfate-binding protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.000000000783626 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAGAAAA ATAAGGGAAG AAAAAATATT CTCAAGTCAA GCCTGGCAGT ACTGCTGATT ATAGCCGGCC TTGTCTCCTT TGCAGGATGC GCAAAAGAAG CAGGCAATAA TCAAAGCGAT TCCGGTCAAA CAGAACCTGT TACGATTCTT AATGTATCCT ATGACCCTAC CCGTGAGCTT TATCAGGATT ACAATGAAGC ATTCAGAAAA TACTGGAAAG AAAAAACCGG AAAGGACATT GTCATCCAGC AGTCCCATGG AGGCTCAGGC AGCCAGGCAA GAGCTGTAAT TGACGGTCTT GAGGCTGATG TCGTAACCCT GGCTTTGGCG TATGATATTG ATTCAATAAA CAAAAACAAA GAGATTTTGA GCAAAGACTG GCAGAAACGC TTGCCGTACA ATTCAACTCC ATATACTTCG ACCATTGTAT TTCTGGTAAG AAAGGGTAAC CCCAAAAATA TTAAGGACTG GGATGACCTT GCCAGACCGG GGGTGGAAGT TATCACTCCA AACCCCAAGA CTTCAGGAGG TGCACGCTGG AATTATCTTG CGGCATGGGG ATATGCCCTG AAAAAATACG GCAATGACCC GGAAAAAGCC AAGGAATTTG TTAAAGCAAT ATATGCCAAC GTACCCGTTC TTGATTCCGG AGCAAGGGGC TCTACCACGA CCTTTGTGGA GCGGGGATTG GGAGATGTGC TTATAGCCTG GGAGAATGAA GCATTTCTTT CCTTGAATGA GCTGGGCAAA GACAAATTCG AAATTGTTGT ACCGTCTGTG AGCATACTGG CTGAGCCTCC TGTTGCTGTT GTTGATTCGG TGGTTGACAA GAAAGGAACC CGTGAGGTGG CGGAAGCTTA TCTTGAATAC CTGTACAGTG ACGAGGGGCA GGAAATAGCG GCTAAAAACT ATTACAGGCC CAGAAAAGAA GAAATCAAGC AAAAATATGC TTCGCAATTT GCCGAAGTTG AACTTTTTAC CATTGATGAA GTCTTTGGCG GATGGGATAA AGCGCAAAAG GAACATTTTG ATGACGGCGG TATCTTTGAC CAAATATATG AGAAGAAATA A
|
Protein sequence | MEKNKGRKNI LKSSLAVLLI IAGLVSFAGC AKEAGNNQSD SGQTEPVTIL NVSYDPTREL YQDYNEAFRK YWKEKTGKDI VIQQSHGGSG SQARAVIDGL EADVVTLALA YDIDSINKNK EILSKDWQKR LPYNSTPYTS TIVFLVRKGN PKNIKDWDDL ARPGVEVITP NPKTSGGARW NYLAAWGYAL KKYGNDPEKA KEFVKAIYAN VPVLDSGARG STTTFVERGL GDVLIAWENE AFLSLNELGK DKFEIVVPSV SILAEPPVAV VDSVVDKKGT REVAEAYLEY LYSDEGQEIA AKNYYRPRKE EIKQKYASQF AEVELFTIDE VFGGWDKAQK EHFDDGGIFD QIYEKK
|
| |