Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1754 |
Symbol | |
ID | 4810184 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 2074396 |
End bp | 2075352 |
Gene Length | 957 bp |
Protein Length | 318 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 640107167 |
Product | periplasmic binding protein |
Protein accession | YP_001038168 |
Protein GI | 125974258 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0614] ABC-type Fe3+-hydroxamate transport system, periplasmic component |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.000000128267 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGGAATA AAAGGAGTAT GATTTTAAAA AGAAAGATAC TGCCGCTGTT AACGGCGTTA ATTTTGATTT TTGCTTTTTC TTCATGCAAT AAAAACAATA AAAACGGGCC AACTGATGGG GCCGGTGATA ATAATAACGG ATACAGTGTA ACTTTGAAGG ATTCCTATGA CAGGGAAGTC AACCTGGACA AAGAACCTGA AAGGATAGTC TCAGTTGCCC CCAACATAAC GGAGATAATT TTTGCGTTGG GCAAGCAGGA CAAGCTGGTG GGACGGACGG ATTTTTGCGA TTATCCGGAA GAGGCGAAGA ACATCGAGTC AATAGGAAAT ATAGACCAGC CGAATGTGGA AAAGATAGTT GAACTTCAGC CGGATGTGGT TATAGCATCT TCCATCTTTA CGAAAGAGAT GCTGCAAAAG CTTGAGGAGG CCAATATCAA GGTGGCTATC TTTCAGGCCG AGAAGGACTT TGAAGGTGTC TACAACATGA TCGAAAAGAT TGGTCTTTTG CTGAACGCCC GGGAAGAGGC AAAGAATGTT GTGACGGAAA TGAAGGAAAA AATAGAGTTT GTAAAGAGCA AGGTCGACGG CCTTGAAAAG CCCAGTGTTT ATTATGTTCT TGGCTATGGC GAGTTTGGGG ATTATACCGC AGGAAGGGAC ACATTTATCA GCCGCATGAT TGGGATGGCT GGAGGAAAGA ACGCGGCGGA TGATGTGGAA GGCTGGAAAT ACAACATAGA AAGCCTCCTT GAAAAGGATC CTGACATACT TATATGCTCA AAATATTATG ATACAAAAGA AGGAATAAAA AATACCGACG GATACAAGGA ACTTTCCGCG GTAAAAAACG GAAAGCTTTT TGAGATAGAC AACAATATGC TGGACAGGCA GGGGCCGAGA ATTGCCGACG GGGTTTTGGA ACTTGCTAAA ATAATTCATC CTGAAGTTTT TAAATGA
|
Protein sequence | MRNKRSMILK RKILPLLTAL ILIFAFSSCN KNNKNGPTDG AGDNNNGYSV TLKDSYDREV NLDKEPERIV SVAPNITEII FALGKQDKLV GRTDFCDYPE EAKNIESIGN IDQPNVEKIV ELQPDVVIAS SIFTKEMLQK LEEANIKVAI FQAEKDFEGV YNMIEKIGLL LNAREEAKNV VTEMKEKIEF VKSKVDGLEK PSVYYVLGYG EFGDYTAGRD TFISRMIGMA GGKNAADDVE GWKYNIESLL EKDPDILICS KYYDTKEGIK NTDGYKELSA VKNGKLFEID NNMLDRQGPR IADGVLELAK IIHPEVFK
|
| |