Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1721 |
Symbol | |
ID | 4808896 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 2041764 |
End bp | 2043023 |
Gene Length | 1260 bp |
Protein Length | 419 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 640107134 |
Product | HK97 family phage portal protein |
Protein accession | YP_001038135 |
Protein GI | 125974225 |
COG category | [S] Function unknown |
COG ID | [COG4695] Phage-related protein |
TIGRFAM ID | [TIGR01537] phage portal protein, HK97 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 38 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGAAACTAA AAGATAGAGT GAAGTTATTT TTAACGCCCC AGAATGCCTT GTTTGAAGTC CTTCAGAAAT ATTCGGAGGA TTTTTTAAAT GGTGAAGAAG TTGTAAAAGA TAATTTTAAA ATAGATACAG CAGCGGCTAT GAGTTTTTCT GCAGTTTTTG CCTGCAATAG AGTCCTTTCT GAAACCTTGG CAAGCTGTCC TATAATGCTT TATGAAAAAG ATGATAAAGG AAACAGAAGG CAGGTTACAG ATACTGCTGA ATATGGTGTA CTTCATTATG CACCAAATGC AGAAATGACA CCAGTGCAGT TTAAAGAGTT TGGTATGACA AATATAAACC TTGGGGGAAA CTTTATAGCA CAAAAGGTTT TTAATATGCA TGGAGAGCTT TTAGAACTTA GACCAATAGC ATGGGACAGA GTAAGAATTG ATATAGATAA ATCTACAGGA AGGCTTCTTT ATTATATTGA TGGAAAGCAA GAACCTAAAA CAAGAGATGA AATATTTCAT ATTCCGGGAC TCACTTTAGA CGGGTATATA GGAATAACAC CTCTTAGTTA TGCGGCACTT ACTATTGATA TTGGATTATC TCAGGACACC TTTGAAAGAA ATTTTTATCA TAACAGGGCT TCAACCAGCG GTATTTTTCA GTATCCTAAC GAGCTTTCAG ATGAAGCATT TCAAAGGCTT AAAAAGGATA TTAAGAAGAA CTACACAGGA CTTTCTAATG CAGGAGTTCC AATGATTCTT GAAGGCGGCG GTCAGTTTAA GGAAATAACC ATGAAGCTTA CAGATGCACA GTTTTTAGAA TCCAAGAGAT TCAGAATTGA AGATGTGTGC AGAATTTTCA GAGTACCACT TCATCTGGTG CAGGATTTAA CAAGATCCAC AAATAACAAT ATTGAACATC AGAGCTTAGA GTTTATTGTT TACACTATGC TGCCGTGGTT TAAAAAATGG GAAGAAAATT TAAATCTTCA GCTTTTATCA AAAGAATCAA GAAGAAAAAA CAGATATTTT GAATTTAATA TCAGTGGACT ACTCCGTGGA GATATTAAAT CAAGATATGA AGCCTATGCA CAAGGAAGAC AGTGGGGATG GCTTTCTGTT AATGATATTA GAAGGCTTGA AAATATGAAT CCTATTGATA ACGGTGACAG ATATCTCGAA CCTCTCAATA TGAGCGAAGC AGGAAAACAG GAAGAGCAGC TTAAAGCACT AAGGGAAGAA ATATTTAATA TGATTAATGA AAGGAAGTGA
|
Protein sequence | MKLKDRVKLF LTPQNALFEV LQKYSEDFLN GEEVVKDNFK IDTAAAMSFS AVFACNRVLS ETLASCPIML YEKDDKGNRR QVTDTAEYGV LHYAPNAEMT PVQFKEFGMT NINLGGNFIA QKVFNMHGEL LELRPIAWDR VRIDIDKSTG RLLYYIDGKQ EPKTRDEIFH IPGLTLDGYI GITPLSYAAL TIDIGLSQDT FERNFYHNRA STSGIFQYPN ELSDEAFQRL KKDIKKNYTG LSNAGVPMIL EGGGQFKEIT MKLTDAQFLE SKRFRIEDVC RIFRVPLHLV QDLTRSTNNN IEHQSLEFIV YTMLPWFKKW EENLNLQLLS KESRRKNRYF EFNISGLLRG DIKSRYEAYA QGRQWGWLSV NDIRRLENMN PIDNGDRYLE PLNMSEAGKQ EEQLKALREE IFNMINERK
|
| |