Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1707 |
Symbol | |
ID | 4808882 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 2029269 |
End bp | 2031173 |
Gene Length | 1905 bp |
Protein Length | 634 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | 640107120 |
Product | phage minor structural protein |
Protein accession | YP_001038121 |
Protein GI | 125974211 |
COG category | [S] Function unknown |
COG ID | [COG4926] Phage-related protein |
TIGRFAM ID | [TIGR01665] phage minor structural protein, N-terminal region |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATCTATG TATATGATAA GAAGACAACT AGAGGAAACT TTGACAATAA TGGGCTTGCT GTTTTAGATG AATGCCTTAT GGCTGAGATA AATGAAGAGT TAAATGGAGA TTATAGTTTA GAAATTGAAT ACCCTGCTCA ATCTAAAAAA GCACAGTATT TAGAGGAACT CAATATTATT AAAGCTGATG GACAGCTTTT TAGAATATAT AAAGTAGAGA GAACACAGGA TAAAATAAGT AAAGTTAAGG TATGGGCAAG ACATATATTC TATGATCTTG CCTTTTATTT TATAGAATCA GCAAAGGTGC TTAATGCAAA TATGAAAGAA GCTCTTGAAG CAAGCATACC GCCCGAACTT CAAGGGTTGT TTTTATTCAA GGCATTAGAA GAGAATTTAG CACCTTTTGC TGTTAAAGAG GTTAATGCAG TAGATGCTAT TTTTAGACTT ATTGAAATTT ATGGTGGAGA ACTTTTTAGA GATAACTTTA ATATAGAGAT AAAAGAGTCC ATTGGTGAAA ACAATGGAAT TTTAATAAAA TACGGTAAAA ACATAAATGG AATGAAGGTT ATTGAGGATA CCAGTGAACT TGCTACAAGG ATATATGCAG TAGGAGCAAA TAATTTATTG TTGCCAGAAA GATATATAGA AGTAGAGGGA GAAAGAGCAA AATTACTTCC CTATCCCATA ACCAAAAGGG TTGAGTTTAA GGAATGCAAG GATGTAGAAA GCCTTAGAGC AAGGGCAGAA GAATATGCTG AAAAGGCTGC AAGTCCAAAG GTATTTATAA CAATAGATTT TATGGAGCTA AGTAAGACAG AAGAGTATAA AAATTATAGT CATCTTACTA AAGTAAGTGT TGGTGATTTT GTAAAGGTAA GAAATGAAAA GATAGCTGTA ACTACTGATT TAAGAGTTAT AAAGAAAAAG ACAGACCTTA TAAATCCTAT AAATACAAAG ATAGAGCTTG GTGATCCTTT AAACACAATT ATTGAAAAGC TGGATACCAG TAAGCTTTTA GAGGAAATAA ACAGTGCAAT AAGCGGTACT TTAAGCAGTG TAATAATCAA GAAAAACAGC GATACTATAA CAATCAGCAC TAGCAGCTAT CCAGCAATGA TTATAGGAAT AACAACAAAA GCAGATACAA ACTTAAATTG TAATATCACC ATGACAGGGA AAGCTAGTGC AGACTGCACA TTAACAATTC TGTTTTCCTT AGATGGAAAA TACTACGATT TTAAGCCAAT TCAAAAGTTA GCATCAGGAG ATAATGTTAT AGGATTGCCC CTTCCAATGC CACAGGTAAC GGCAGGAGAC CATACTTTTA TGGTAGAGAT GAAGGTTTCA AATGGAACAT TTACAATTGA AAAGAATAAT CTGCAGGTAA GTATTGAGGG AAGAGACTTA GAGGGTGGAC TAAGTGCAAG TATACCAAGG GCAGAGATTG TTTATACTTT CCTTTATAAT TTGTTCAATA TCAAGATTGG AACATACAGT TTTAATACAG GACACAGTTT TAAGTATTAT TTAGACAACA ATGTGAGTGG AGTAGATAGC TATAGTCTTG AGAATTTCAA TACAAGATTT AATATTGCTG CTATATCTCA TGGAAACCCA AAGTTAACCT TAAATGTAAT GGGAATTATA GAAGAGTTTA ATAACAGTAA ATCAAGCAGC TATGCTTTTG ACGCCGATTG GGTTGAGTTT AGTTCTGACT ACGATAAAAA CAAAGATGGA ACCTATGACT TTTATAATAG GGTAACAATA AAGGAGCCCA TTTTAAAAAG AGAAGGCGGA TATATTGAGG ACTTAGGAAA TGGCGTAATT TATGCTGCAA ATGTGCGGGA TACAACACTT TATAAGGATT TGATTGCAAT AAATGCATAT CTTAAGCAAG GTTAG
|
Protein sequence | MIYVYDKKTT RGNFDNNGLA VLDECLMAEI NEELNGDYSL EIEYPAQSKK AQYLEELNII KADGQLFRIY KVERTQDKIS KVKVWARHIF YDLAFYFIES AKVLNANMKE ALEASIPPEL QGLFLFKALE ENLAPFAVKE VNAVDAIFRL IEIYGGELFR DNFNIEIKES IGENNGILIK YGKNINGMKV IEDTSELATR IYAVGANNLL LPERYIEVEG ERAKLLPYPI TKRVEFKECK DVESLRARAE EYAEKAASPK VFITIDFMEL SKTEEYKNYS HLTKVSVGDF VKVRNEKIAV TTDLRVIKKK TDLINPINTK IELGDPLNTI IEKLDTSKLL EEINSAISGT LSSVIIKKNS DTITISTSSY PAMIIGITTK ADTNLNCNIT MTGKASADCT LTILFSLDGK YYDFKPIQKL ASGDNVIGLP LPMPQVTAGD HTFMVEMKVS NGTFTIEKNN LQVSIEGRDL EGGLSASIPR AEIVYTFLYN LFNIKIGTYS FNTGHSFKYY LDNNVSGVDS YSLENFNTRF NIAAISHGNP KLTLNVMGII EEFNNSKSSS YAFDADWVEF SSDYDKNKDG TYDFYNRVTI KEPILKREGG YIEDLGNGVI YAANVRDTTL YKDLIAINAY LKQG
|
| |