Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1628 |
Symbol | |
ID | 4809323 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 1954690 |
End bp | 1955892 |
Gene Length | 1203 bp |
Protein Length | 400 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 640107044 |
Product | HK97 family phage major capsid protein |
Protein accession | YP_001038045 |
Protein GI | 125974135 |
COG category | [R] General function prediction only |
COG ID | [COG4653] Predicted phage phi-C31 gp36 major capsid-like protein |
TIGRFAM ID | [TIGR01554] phage major capsid protein, HK97 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 37 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCAAAA TACTGGAACT GCGTGAAAAA CGCGCGAAAG TAGGGGAAGC TGCTAAAGCT TTCCTCGACA GCAAACGCGG GAACGACGGA CTGCTTTCAC CGGAGGATAC CGCAACTTAT GAAAAAATGG AAGCCGACGT TATTGCGCTG GGCAAAGAAA TAGAGCGTCT TGAGCGTCAG GCTGCCATAG ATTTGGAACT GTCAAAACCG TTGAATATTC CTATTACAGA CAAACCCACT TCCATATCTG GCAACAATGA AAAAACCGGA CGTGCCAGCG ATGAGTACAG GCAGTCTTTC TGGAACATGA TGCGCGGCAG GCGCAAATAT GACGTACACA ACGCGCTGCA GATTGGAGAG GACACCGAAG GTGGATATCT TGTTCCCGAC GACTTTGAGC GTACTCTTGT GGAAGCACTG GAGGAGGAGA ATATCTTTAG GCAGATTGCC AATGTTATTA CCACGTCCAG CGGTGACAAG AAAATTCCTG TGGTGGCAAG CAAGGGTACT GCATCCTGGG TGGATGAGGA AGGCCAGATT CCCGAAAGCG ATGACTCCTT TGCACAGGTA TCCATCGGCG CATATAAGCT GGCTACTATG ATCAAGGTGT CAGAGGAATT GTTAAACGAC AGTGTATTTA ACCTTGAACA GTATATAGCC AAAGAATTCG CCCGCCGAAT CGGAGCAAAA GAGGAGGAAG CATTTTTTAT CGGCGACGGA TCTGGCAAGC CAACCGGTAT CTTGGCGGAT AACGGCGGTG GCGAGATAGG AGTAACCGCG GCGAGTGCAA CAGCCATTAC CCTTGACGAG ATCATGGACT TGTTCTACAG CCTAAAGTCT CCGTACCGCA GGAACGCTGT ATTCATTATG AATGATTCGA CAATTAAAGC TATAAGGAAG CTCAAAGACA ACAACGGTCA GTATCTCTGG CAGCCTTCTG TAACTGCTGG AACACCGGAT ACTATCCTCA ATCGTCCAGT TAAAACTTCT GCATTTATGC CAGCCATTGC CGCCGGAGCA AAAACGATTG TATTCGGCGA TTTTTCTTAT TACTGGGTGG CAGACCGTCA AGGCAGGGTT TTTAAGCGGC TTAATGAGTT GTATGCTGCG ACCGGACAAG TTGGATTCAT GGCAACCCAG CGTGTAGATG GCAAGCTGGT ACTGTCTGAA GCAGTCAAGA TACTGCAGCA GAAATCAACT TAA
|
Protein sequence | MSKILELREK RAKVGEAAKA FLDSKRGNDG LLSPEDTATY EKMEADVIAL GKEIERLERQ AAIDLELSKP LNIPITDKPT SISGNNEKTG RASDEYRQSF WNMMRGRRKY DVHNALQIGE DTEGGYLVPD DFERTLVEAL EEENIFRQIA NVITTSSGDK KIPVVASKGT ASWVDEEGQI PESDDSFAQV SIGAYKLATM IKVSEELLND SVFNLEQYIA KEFARRIGAK EEEAFFIGDG SGKPTGILAD NGGGEIGVTA ASATAITLDE IMDLFYSLKS PYRRNAVFIM NDSTIKAIRK LKDNNGQYLW QPSVTAGTPD TILNRPVKTS AFMPAIAAGA KTIVFGDFSY YWVADRQGRV FKRLNELYAA TGQVGFMATQ RVDGKLVLSE AVKILQQKST
|
| |