Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2805 |
Symbol | |
ID | 4809642 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 3305884 |
End bp | 3307266 |
Gene Length | 1383 bp |
Protein Length | 460 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 640108225 |
Product | carbohydrate-binding, CenC-like protein |
Protein accession | YP_001039197 |
Protein GI | 125975287 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0403261 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGACTTTA CAGCCTCCAC AACCAGTGTT GATATTGCAG TGCGTCTGGG TTACACAAAC AGCAAAGTGA CCGGAACCGC ATGGTTTGAT AATATAACGG TGGAGCATTT GCCAAAAACC GTATTTACAG AAGATTTTGA AAACGGTCTT TCCTCCAATT GGGAAATAAG ATCGTCAAAA CATGGCAATT CTCTGGCAAC CGTAACTGTC GAATCAGGCA CAGGTGTAAA CAACAGTAAA TGTTTAAAAA TAAGCTCATT AGCACAGGAT GAAGATGTCG GATGCGTAAA AACCCTTCAA CTTGCTCCAA ACAGCTATTA TAAACTTAGT GCCCTTATGA AGTACGAAAA TGTAACTCCG GGTAAAAGTG ACGGAGCCAA TATTTGCCTC TATAACAATG AAGGTGAAGA TGCAATATGG ATTCGAACAG CCACAGCAAC CGGAACAAAT ACTTCATGGG AATTAGTAAA ACTGCTTTTC AAAACTCCGG ATTCCGGCAG TGTGAACATT GGCTTGCGTT TAGGCTTTCT TAATTGCGAA ACCAAAGGTA CTGTGTGGTT TGACAATGTC AAGGTAGAAG CTGTACCTTC CGATTATATT TACGAATCAG AGCATATTAT TGCTTGCTTT GAAACTGACG ATACCCAGTT TGCGACAAGG GAAGGTATTC TTAACTGGTT GTCGGAGTTG GATAAAGTAT ATGTTTTGAT GAAAGAATTC TCCGGAAACA GAGTGCCTTT TGACGGCAAA AAAATGGGAA TATTATCATC AGACAGCCAT TCAGGCGGTG TCGGCGCTTA TGCGGGAGAT CCCATAATGT GGTTTAAAAG CGCCAACGAC AATCCTGTAG CTTATCAGTT AAAAATGACT TGTGAACACG GAGATTTATC TTTTGCAATT TTGCATGAGA TAGGACATAA TTTCAATCTC GGAAATACCA GTTGGAACTG GAATGACGAA ATGTTTGCCA ATTTCAGGGC ATATTATGCA GTTGAACAGC TGGAAAACTT GCACTTGAAC AATATGGGCC CAATGCCTGT CGTTTACAAC TATCCTAATA TACGTCGGGA AGGTACTGAG CTTAAAGATT ACTATTCTGA GAGGTATAAC GAAACCATGG CACAAGGAGA ATTTCATCAT GACGGATTAA TGTACACACT CATTCTAGCT AAAGAAAAAA TAGGCTGGGA GCCCTTCCTT AAAACCATAC AATACTTGTC AAGCAACGAT TTCAGCAACA AAAACGACAT GGAAAAATTT GAACTCTTTT ATTCCAAACT GTCTGAATAT TCACAACAGC CAATAAGCGA ATTAATATCC GACGAGGACA TGTCACTCGT CAGGGCATAC TGGAATGAGC ACGGTTCGTT ATTTACGCGT TAA
|
Protein sequence | MDFTASTTSV DIAVRLGYTN SKVTGTAWFD NITVEHLPKT VFTEDFENGL SSNWEIRSSK HGNSLATVTV ESGTGVNNSK CLKISSLAQD EDVGCVKTLQ LAPNSYYKLS ALMKYENVTP GKSDGANICL YNNEGEDAIW IRTATATGTN TSWELVKLLF KTPDSGSVNI GLRLGFLNCE TKGTVWFDNV KVEAVPSDYI YESEHIIACF ETDDTQFATR EGILNWLSEL DKVYVLMKEF SGNRVPFDGK KMGILSSDSH SGGVGAYAGD PIMWFKSAND NPVAYQLKMT CEHGDLSFAI LHEIGHNFNL GNTSWNWNDE MFANFRAYYA VEQLENLHLN NMGPMPVVYN YPNIRREGTE LKDYYSERYN ETMAQGEFHH DGLMYTLILA KEKIGWEPFL KTIQYLSSND FSNKNDMEKF ELFYSKLSEY SQQPISELIS DEDMSLVRAY WNEHGSLFTR
|
| |