Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1643 |
Symbol | |
ID | 4809338 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 1968659 |
End bp | 1969657 |
Gene Length | 999 bp |
Protein Length | 332 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 640107058 |
Product | phage-associated protein-like protein |
Protein accession | YP_001038059 |
Protein GI | 125974149 |
COG category | [S] Function unknown |
COG ID | [COG3600] Uncharacterized phage-associated protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATAGGA ATAAGACATT TTGCGAGGAA TGCAGAAGAG ATGTCGAATA CATGGTAGAA ACAGCAACAA TTAAGGGTAA ACTTAAAGGC GAAGAATATG AGTATACTGG AAAGAAGGCT ATTTGTACGG AATGTGGGAG CGAAGTCTAT GTAGCGGATA TAGAGGACGA AAATCTAAAG GCTTTGTATG ACACGTACCG TCAAAAAAAC GGCATTATTT CGCTGGAGAA GATATTAGAA ATACCTCAGA AATACAATAT TGGCAAACGT CCGCTATCAT TGCTTTTAGG TTGGGGGGAA ATGACTTTTT CGAGATATTG TGAAGGTGAT ATGCCTACAA AACAGTATTC AGATATTCTT CAAAAGATTT ATGATGATCC AGCGTATTAT AAAGAATTAC TGGAGAAAAA TAAGGACAAT TTAAAATCTC TGCAGGCATA TGAAAAAAGT AAGCGGAAGG TACAGGAACT GCTTGGGGAA GAAAACAAAA CGGGTTCGAA GCTGGACTCA ATTATCCAAT ATCTGCTTTA TAAGTGCGAG GACATAACTC CTTTAGCTTT ACAAAAGGCA CTATATTATG TCCAGGGCTT TTATTACGCT TTTGAAGGAC GGTTTCTTTT TGAAGAAGAC TGTGAGGCAT GGGTTCATGG ACCGGTTTAC AGAGATGTAT ATAACAGGTA TTCATCTTAT CGGTTTGACC CAATTGAGAG CGTTGAAGCT TTCGATGAAT CAGTTTTTAC AACTGCTGAA AAAGCAATAT TGGATAGCGT TATTAAGAAC TTCTGCTGCT ATAGTGGAAA AATACTAGAA AAGTTTACGC ATCTGGAGAA ACCATGGCGG TATACCAGAG ACGGTTTGCC GGTGGATGCG CATTCTAATC GTGTAATACC CAAAGAATTG ATCGGGGAAT ATTTTGTCGC TGTGAAAGAA AAATTCAACA TGCTCACTCC TGGAGATATA GAAGTATACT CGAAAGCTAT CTTTGAACAA ATAAACTGA
|
Protein sequence | MNRNKTFCEE CRRDVEYMVE TATIKGKLKG EEYEYTGKKA ICTECGSEVY VADIEDENLK ALYDTYRQKN GIISLEKILE IPQKYNIGKR PLSLLLGWGE MTFSRYCEGD MPTKQYSDIL QKIYDDPAYY KELLEKNKDN LKSLQAYEKS KRKVQELLGE ENKTGSKLDS IIQYLLYKCE DITPLALQKA LYYVQGFYYA FEGRFLFEED CEAWVHGPVY RDVYNRYSSY RFDPIESVEA FDESVFTTAE KAILDSVIKN FCCYSGKILE KFTHLEKPWR YTRDGLPVDA HSNRVIPKEL IGEYFVAVKE KFNMLTPGDI EVYSKAIFEQ IN
|
| |