Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1719 |
Symbol | |
ID | 4808894 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 2039699 |
End bp | 2040988 |
Gene Length | 1290 bp |
Protein Length | 429 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 640107132 |
Product | HK97 family phage major capsid protein |
Protein accession | YP_001038133 |
Protein GI | 125974223 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01554] phage major capsid protein, HK97 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 55 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTATATGT CAGAAAAAAT GAAAGAATTA TTAGCTCAGT TATCAAATTT AGAAACAGAA TCTAAAAACC TTATAAATAA AGAGGAGGCT ACAGCTGATG AAATTAATGC AAAGCTTTCT GAAATTAAGG CTTTAAAAGC TAAAATTGAG GCACAAAAAG AAATTGATGC ATTAAATGCA GAAAGAGAAA AACAGGCTAA GACGCCAGTG AATGAACCAA TATATGCTCA GCCAAAGAAT CACAATGAAA AGAAGTGGAA GTGCATGGGA GAATTTTTAA GTGCCGTTGC AAAGGCTTCA TCTCCCGGAG GAAGAATGGA CAACAGATTA ACTTATCAGA ACTCAGCAAC AGGACTTAAT GAAAGCATAG CTTCAGAAGG AGGATTTTTA CTAGAAAATG AGTTTATAAA TGACTTATTT GAATCCATGA TGGCACAAAG TCAGGTGGCA AACAGAATAA GAATGATACC AATAGGGGCT AATACCAATA GACTTAGGGC GCTTGGAATT GATGAAAACA GCAGAGCCAA TGGCTCAAGA TGGGGAGGTG TACAGGCTTA CTGGGTAGCT GAAGCAGAAA CAGCAGCTCA AAGCAAGCCA AAGTTTAGGG AAATTGAAAT GTCACTTCAA AAGCTTTTAG CACTTTGCTA TGTAACCGAT GACCTTTTAC AAGATACTAC AGCACTTGAA GCTATAGTAA GGCAAGCTTA TGCAGATGAA ATGAGTTTTA AAATAGATGA TGCAATCATT AATGGTACTG GTGTTGGAAT GCCCCTTGGA ATATTAAACT CTGATGCATT AGTTACAGTA CCCAAGGAAA AAGATCAAGG AGCAGGAACA ATTAAGTATG AAAATATACT TAAAATGTGG AGTTCAATGC CTGCAAGACT TAGAGCAAAT GCAGTATGGT ATATAAATCA AGAGATAGAA CCACAGCTTT ACACTATGGC TCTTAATATT GGAGCTGGTG GAGCACCTGT GTTTATGCCT TCCGGTGGAG CTGCAGCATC ACAGTACAGT ACCTTACTTA ATAGACCAAT AATTCCAATA GAGCAGTGTT CACCTCTTGG TAAAAAGGGA GATATTATTT TAGCTGACCC AACCCAGTAT ATTGGAATAG ATAAAAAAGG TTTAACTTCT GATGTATCTA TCCATGTAAG ATTTTTATAT GATGAGCAGG TATTCAGATT CATCTATAAG TTCAATGGAA TGCCTTATAA GAATAAGCCA ATTATGCCTT ACAAGGGTGC AAATCCACTA AGTCCTTTTG TAACTTTAGC AGATAGGTAG
|
Protein sequence | MYMSEKMKEL LAQLSNLETE SKNLINKEEA TADEINAKLS EIKALKAKIE AQKEIDALNA EREKQAKTPV NEPIYAQPKN HNEKKWKCMG EFLSAVAKAS SPGGRMDNRL TYQNSATGLN ESIASEGGFL LENEFINDLF ESMMAQSQVA NRIRMIPIGA NTNRLRALGI DENSRANGSR WGGVQAYWVA EAETAAQSKP KFREIEMSLQ KLLALCYVTD DLLQDTTALE AIVRQAYADE MSFKIDDAII NGTGVGMPLG ILNSDALVTV PKEKDQGAGT IKYENILKMW SSMPARLRAN AVWYINQEIE PQLYTMALNI GAGGAPVFMP SGGAAASQYS TLLNRPIIPI EQCSPLGKKG DIILADPTQY IGIDKKGLTS DVSIHVRFLY DEQVFRFIYK FNGMPYKNKP IMPYKGANPL SPFVTLADR
|
| |