Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0052 |
Symbol | |
ID | 4808747 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 67772 |
End bp | 69031 |
Gene Length | 1260 bp |
Protein Length | 419 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 640105461 |
Product | hypothetical protein |
Protein accession | YP_001036486 |
Protein GI | 125972576 |
COG category | [S] Function unknown |
COG ID | [COG1306] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.115032 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCAGCCA AAAATGCACA AACAATTTTA AGCAACAAAT GTTTTTCAGT AATTTTAGTG GTATTGGTTT TAGTTTTTTC TTTAGCAGCC TGCAAAGATA CAGGTGTCCA GGTAAACAAT CCGGCACCTA CTGCACAAGC GACCGATAAT GTAGAAGCAA ACAATGGCGG TAATACGGAC GGCGAAGCTC CTGAAACTTC AGAACCGAGT GAAGAAAGTC CGGTACCGAA CAATTCCGGC GAGGATGCCA AACCTGTAAA GAAGGATATC AAGGTAAAAG CATTGTATCT TACAGGATGG ACTGTGGGAA GTGATGAAAG ACTGCAGCAT TATGTTGATC TTGCAAACAG GACGGAGATT AATGCCTATG TTGTTGACAT CAAGGATGAT GACGGATATG TCGGTTATGA GTCGAATATA CCGGCAGTGA GAGAAATAGG CGCATGGAAG AGTAAGTACA ATGTGGACAA AGTATTGAAA ACCTTCCATG ATAACAATAT TCATGTTATT GGAAGATTGG TGTGCTTTAA GGACCCTGTT TTATCTTCCA AAAAGCCGGA GTTGGCGGTT AAGAGCGTAA ATGGAGGTTC CTGGCGTGAC AATCATAATC TTACATGGCT GGACCCCTAT AACAAGGATT CATGGCCTTA TTTGATTGAG ATAGCCAAGG AAGCGGTTGA AAAAGGTTTT GATGAAATAC AGTTTGACTA TATCAGATTT CCCAATGACG GAAGCAAAAA GAGCATGAGC TTTAATACCG GCGGCAAGGA AAAGCACGAA ATAATAAATG AGTTTTTGGC TTATGCCAGA GAGCAGCTTC CGGGAGTTGT CCTGTCCGCG GATGTGTTTG GGATAATACT GGAGAGTCCG GCAGATACCG AAGATATCGG TCAATATCTT GAAAAGATAG TAAAAGATGT GGATTATATT TCGCCGATGG TCTATCCGTC CCACTATGCC GTTGGACAGA TAGTTAACGG TGTTCAGTTC ATGAAGCCGG ATCTTGATCC TTATGGAGTG GTGTATCAAA GTCTTGTTAA GTGCAATAAC AGACTGGCAC AGGTGGAAGG CTACAAAGCC GATGTAAGGC CATATATCCA GGACTTTACT GCATCGTGGC TTGGCAAGGG TTATTACCAG AGTTACGGAC CCGAGCAGCT AAGACAGCAA ATTCAGGCCG TTTATGATGC AGGCTATGAG GAATGGATCT GCTGGGATGC GAACAATACG TACTCGGAAG AAGCATTCTT GAAAGAGTAA
|
Protein sequence | MSAKNAQTIL SNKCFSVILV VLVLVFSLAA CKDTGVQVNN PAPTAQATDN VEANNGGNTD GEAPETSEPS EESPVPNNSG EDAKPVKKDI KVKALYLTGW TVGSDERLQH YVDLANRTEI NAYVVDIKDD DGYVGYESNI PAVREIGAWK SKYNVDKVLK TFHDNNIHVI GRLVCFKDPV LSSKKPELAV KSVNGGSWRD NHNLTWLDPY NKDSWPYLIE IAKEAVEKGF DEIQFDYIRF PNDGSKKSMS FNTGGKEKHE IINEFLAYAR EQLPGVVLSA DVFGIILESP ADTEDIGQYL EKIVKDVDYI SPMVYPSHYA VGQIVNGVQF MKPDLDPYGV VYQSLVKCNN RLAQVEGYKA DVRPYIQDFT ASWLGKGYYQ SYGPEQLRQQ IQAVYDAGYE EWICWDANNT YSEEAFLKE
|
| |