Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1588 |
Symbol | |
ID | 4809579 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 1919023 |
End bp | 1920129 |
Gene Length | 1107 bp |
Protein Length | 368 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 640107006 |
Product | extracellular solute-binding protein |
Protein accession | YP_001038007 |
Protein GI | 125974097 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1840] ABC-type Fe3+ transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 49 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAATTA AAAAAAGTTG GTTGGCACTT ATATTAGCAG CCCTAATGGC AATTTCTGTG ACTGCATGTT CGGGTTCAAA GACAGAAACG ACTAAACAAG ACCCAACAAC TAAAAATCAA GAAACAAACA ACAACAATAA TAATAACAAC AATAATAACG AAGAGAATGT ATTAGTAGTA TATACGGCAA GAAGTGAAGA ATTAAATAAA GCTGTGATTT CAGAGTTTGA AAAAGAAACC GGAATCAAAG TTGAATTGGT AACAGCAGGT ACAGGAGAAT TGCTTAAGCG TGTAGAGTCA GAAAAAGACA ACCCATTGGG TGATATTTTA TGGGCAGCAG ACCGTACCAT GCTTGCTGCA TCTGAGGATT TGTTCATGGA ATATGTATCC AAGGAAGACG GAAATATGAT GGAAGGATTC CAGAACACAA CCGGATATTT TTCGCCGGCA TTTGCAGATC CAGCGGTATT TATTGTCAAT ACTGACTTAA AAGGTGATAT AAAAATCGAA GGGTTTGAAG ATTTATTGAA TCCTAAACTG AAAGGAAAGA TTGCTTTTGG AGATCCTGTC AACTCCAGTT CGGCATTCCA GTCTTTAGTT GCAATGCTGT ATGCTATGGG CGAAAACAAT GACCCAATGT CCGATAAAGC ATGGGAGTTT GTAGATCAAT TCTTAAAGAA CCTTGACGGC AAAATGGCTA ACAGCTCCAG TCAGGTATAT AAAGGTGTGG CAGAAGGTGA ATATATTGTA GGACTTACCT GGGAAGACCC GGCTGCTAAA TATGTAAAAG AGGGAGCTAA CGTTGAGGTT GTATTCCCTA AAGAAGGAAC CATTTTCCCG GGTGAATCTG TGCAGATTAT AAAGAATTGC AAGCATCCGG AAAATGCAAA GAAATTTGTA GATTTCATGC TGTCTGAAGA GGTTCAGAAC AGAGTGGGCT CCGAATTGAC AGTTCGTCCT CTCAGAAAAG GTGCAAAATT GGCAGATTAT ATGACCCCTC AGTCCGAAAT TAAGTTATTC AGTAACTATG ATGAAGGATG GGTTGCTGCA AACAAATTGG CAATTACAAA CAAGTTTAGC GAGCACCTGG AATCATCTAT GGACTAA
|
Protein sequence | MKIKKSWLAL ILAALMAISV TACSGSKTET TKQDPTTKNQ ETNNNNNNNN NNNEENVLVV YTARSEELNK AVISEFEKET GIKVELVTAG TGELLKRVES EKDNPLGDIL WAADRTMLAA SEDLFMEYVS KEDGNMMEGF QNTTGYFSPA FADPAVFIVN TDLKGDIKIE GFEDLLNPKL KGKIAFGDPV NSSSAFQSLV AMLYAMGENN DPMSDKAWEF VDQFLKNLDG KMANSSSQVY KGVAEGEYIV GLTWEDPAAK YVKEGANVEV VFPKEGTIFP GESVQIIKNC KHPENAKKFV DFMLSEEVQN RVGSELTVRP LRKGAKLADY MTPQSEIKLF SNYDEGWVAA NKLAITNKFS EHLESSMD
|
| |