Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2848 |
Symbol | |
ID | 4809128 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 3365247 |
End bp | 3366560 |
Gene Length | 1314 bp |
Protein Length | 437 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 640108268 |
Product | phage major capsid protein, HK97 |
Protein accession | YP_001039240 |
Protein GI | 125975330 |
COG category | [R] General function prediction only |
COG ID | [COG4653] Predicted phage phi-C31 gp36 major capsid-like protein |
TIGRFAM ID | [TIGR01554] phage major capsid protein, HK97 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGATTTA TAGCAGATTT AGTACATGAG AAAAAGCAAT TGGTGGAGAA AGCAGAAGCT ATTTTAAACG AAGCGGAAAA AGCAGGTGGA AGTTTGACTA AGGAACAGGA GCAACAGTTT AACCGCTACA AAGACAGAAT AACACGCATC AATGATAGCA TTGATGAAGA ATTATCAAAA ATCAGAACCT CTGAGCCAAT ATTGAATATG CCGCACAATC CCATGGCTCG TGAAGATGTG TCAAAAATTC CGGTAACAAA GGCTATATCA AAATCATTCA GAGGGATGTT CTATGGAAAC GAAACTGTGA GCTTAAGCAA CAATGGTTTT CATTCCATGG ATGAATTCCT GAGAACACTT CACTCAGGCA GAGCCGACAA CAGGCTAATA AATGCCAGTA TGGTGGAAGG AATACCCGAA TTCGGCGGAT ATTCCGTACC GGAGGAATAC GGAGCCTTCC TGATGGATAA ATCCCTGGAG AATGAGATCA TTCGTCCCAG AGCAACAGTA TGGGCAATGG GAAGTGAAAC AAAGAAAGTA TCAGCCTTTG ATGGAGCAGA CAGAACCAAT CACCTATTCG GCGGTATCTC AGGAGAATGG CTGGAGGAAG GACAGACAGG CACACGAAAA ACAGCCAAGT TAAGGCTGAT TCAACTGAAA GCCAAGAAGC TGGCCTGCTT CTCACAGGCA TCCAATGAAC TTATTGCAGA TGGTATGTCC TTTGAGGAAA TGTTAGCTGG AGCGCTTATT AAAGGCTTGG GCTGGTACAT GGACTATGCC TTTATCAATG GAACCGGTGA AGGCCAGCCT CTTGGTATTA TAAATGACCC GGCACTGATT ACTGTAAATA AAGAGGCTTC TCAAGAACCA GCCACAATTA CCTATCAAAA CGTGGTCAAT ATGTTCTCAA GGCTTGCTCC GTCATGTTTT ACCAATGCGG TATGGCTTGC CAATCCATCG GTAATACCAC AATTGCTCAC CATGACCATT ACCATTGGTA CCGGTGGCGC TCAGATACCG GTATTCAGGG AAGAGAGCGG GAAATTCACA CTTCTGGGTA AGGAGGTCTT ATTCACTGAG AAATGCCCCA CATTGGGTGC TAAGGGAGAT TTAATCCTTG CAGATCTTTC CCAGTATGCC ATAGGCATGA GGAAAGAGAT CGCTCTTGAC CGCTCCAATG TCCCAGGCTG GATGGAGGAT ATGACCGACT ACAGGGTGAT AGTGCGTGTA GATGGTCAGG GAACCTGGGA TAAACCTATA ACACCGAAAA ACGGAGCAAC GCTCTCATGG GCAGTGGCTT TGGAAGCGAG ATGA
|
Protein sequence | MRFIADLVHE KKQLVEKAEA ILNEAEKAGG SLTKEQEQQF NRYKDRITRI NDSIDEELSK IRTSEPILNM PHNPMAREDV SKIPVTKAIS KSFRGMFYGN ETVSLSNNGF HSMDEFLRTL HSGRADNRLI NASMVEGIPE FGGYSVPEEY GAFLMDKSLE NEIIRPRATV WAMGSETKKV SAFDGADRTN HLFGGISGEW LEEGQTGTRK TAKLRLIQLK AKKLACFSQA SNELIADGMS FEEMLAGALI KGLGWYMDYA FINGTGEGQP LGIINDPALI TVNKEASQEP ATITYQNVVN MFSRLAPSCF TNAVWLANPS VIPQLLTMTI TIGTGGAQIP VFREESGKFT LLGKEVLFTE KCPTLGAKGD LILADLSQYA IGMRKEIALD RSNVPGWMED MTDYRVIVRV DGQGTWDKPI TPKNGATLSW AVALEAR
|
| |