Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_3122 |
Symbol | |
ID | 4809685 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 3684732 |
End bp | 3686810 |
Gene Length | 2079 bp |
Protein Length | 692 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 640108555 |
Product | S-layer-like domain-containing protein |
Protein accession | YP_001039510 |
Protein GI | 125975600 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTTTTT TTAGGGGGAG GCTAATATCT TTAGCGCTTG TACTTGCAAT TATAATGCAA TTATTTGCAA ACTTTGCTTT TGCAGAGCCG ACCGAAAAGA ACACTATATT TTTCAGAGAC ATATCAGGAC ACTGGGCGGA AGAATCAATC AAGTACCTGG CAGAACGTGG CTTGGTAAAA GGCTATTTGA CTGACAACGG ATACATAATT AAACCTGATA CGTACATAAC AAGAGCTGAA TATCTTACCG TTCTTTTAAA TACAAAACCG AATCTCAAGG TTGTAAGTGA TAAAGTAAAA TTATTTGTTG ACGTAAAAGA TAATGATTGG TACAAAGAAG TGGTTGATAA AGCATCAAGT AATGGAATTC TCGAAGGATA TCCCGATGGA AGTTTCAAGC CTAATAATCC GATTACCAGG GCAGAAATTT CTGCGTTAAT GGTAAAAATC AACAATTGGA ACGAAAAGGA TATTACTGAG GATGCTGATG TGTTTTCTGA TGTTCAAAAG AATTCATGGT ACTATGCAGC TGTTTTAACG TGTAAGTTAA AGAAGATAAT AAATGGATAT CCGGATGGCA CTTTCAAACC GGGAAACTAT GCATCAAGAG CGGAAGCTTT TGCTTTGCTT GCAAATTATG TCAGGAATTT TATTGACAAG GATTTGGATG AAGAGCCTGA AAAAACTCCC GGTACAACTC ATGACGTGGT TTCACCGACT CCAAACCCTG GCACACCGTC TTTACCATCT TCCGGTTCCG GTTCAACAAT TGAAAAAGGG CCGTTTAAAG GTAAAACAGT TGATGACTTT TTATACGTAT CCGGCGGAGA CTTTAAAATT TTGAATGAAG AACAGAAAAA GATCACCTTT GAAGGTTTGG TTGACAAGGA AAATGTAAAA AAGATGTACT ACAATGTGGA ATACTATGGA TTCGACACGG ATACGCCCAA AATGGAAAAA AACAATATTA CAGACGGATT AATTATAGCA GGGCCTGATG AAGAAAACAA AGGAACATTT AATGAAAATC AAATCCCGTA CAGTTGGGTA CTTAAAGATT TTGAAGTTAA TAAGAGCTAT TTTAAGATTT ACATTAAACT GGTGATAGAA GACGAATACG GAAATAAACT TACTAAAATT TTGGCAATTA TAAATGATAA GGATTCTGAC GGGGATGGAT TGTCAGACTA TGAAGAGGTG TACATATATA ATACAAATCC GTTGAATTAC GATACAGATG GGGATAAACT TTCAGACTAT GAAGAAATTA ATATTTATGG AACAGATCCG TTAAATTCAG ATACAGATGA AGATGGTTTG ACGGATTATG AGGAAATGAA AGCATGGGTT ATTTTAGAAT TAGGTACAAT AGTATCAATT TTTTATAATG CTCCCGAATA CGAAGATATG GGAACTGAAG GCGTGGAGCT GGCAGCTGAA AAATTCGGCT ATAAAGAAGA GCAGATAATT TTGGGATTGG ATCCATTAAA TCCTGATACG GACGGAGACG GACTTCCTGA TGGCTATGAA TTCAGGATAT TGGGAACTGA TCCGACACGC AAAGTAACTT ATGGTGCGAA TGTGCCGGAT GTTGATCTTG ACATGGATAA AGACGGGCTT TCAAACTGGG ATGAATTTTT ATATGGAACG GATCCATGGT TAAAAGATAC AGATGGGGAC GGCATTAGTG ATTACGATGA GATTCGTACT TATAAAACAG ATCCATTAAA TCCTGATACA GACGGAGACG GGCTTGAAGA TGGATTGGAA CTGGAGCAGG GTTTTGATCC TCTAAAGTCT GATACCGGTA ACAGTGGTGT TTTGGATTCA GAAAAATTTG TCAGCATGAA TTTGTCCGAA GAAACTCTAA GTGACGTTTT GACTCCTGAA AACAGAGCAA TTCCTTCAAT AAAAGTGTTC GGCGTACCTG ATTTTGATAT TAATACTACT GTGGAAAATG CGTCTGAGCA TGAGAGTGTT AAAAACATTG TTGGCGTGGT GGGATTTCCT ATAGACATAA AAACTGATGA GGATTTTGAA AGTGCGCAGA TAAGTTTTAA AATAAGTGAA GAAGTTTAA
|
Protein sequence | MSFFRGRLIS LALVLAIIMQ LFANFAFAEP TEKNTIFFRD ISGHWAEESI KYLAERGLVK GYLTDNGYII KPDTYITRAE YLTVLLNTKP NLKVVSDKVK LFVDVKDNDW YKEVVDKASS NGILEGYPDG SFKPNNPITR AEISALMVKI NNWNEKDITE DADVFSDVQK NSWYYAAVLT CKLKKIINGY PDGTFKPGNY ASRAEAFALL ANYVRNFIDK DLDEEPEKTP GTTHDVVSPT PNPGTPSLPS SGSGSTIEKG PFKGKTVDDF LYVSGGDFKI LNEEQKKITF EGLVDKENVK KMYYNVEYYG FDTDTPKMEK NNITDGLIIA GPDEENKGTF NENQIPYSWV LKDFEVNKSY FKIYIKLVIE DEYGNKLTKI LAIINDKDSD GDGLSDYEEV YIYNTNPLNY DTDGDKLSDY EEINIYGTDP LNSDTDEDGL TDYEEMKAWV ILELGTIVSI FYNAPEYEDM GTEGVELAAE KFGYKEEQII LGLDPLNPDT DGDGLPDGYE FRILGTDPTR KVTYGANVPD VDLDMDKDGL SNWDEFLYGT DPWLKDTDGD GISDYDEIRT YKTDPLNPDT DGDGLEDGLE LEQGFDPLKS DTGNSGVLDS EKFVSMNLSE ETLSDVLTPE NRAIPSIKVF GVPDFDINTT VENASEHESV KNIVGVVGFP IDIKTDEDFE SAQISFKISE EV
|
| |