Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2866 |
Symbol | |
ID | 4809146 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 3385500 |
End bp | 3386813 |
Gene Length | 1314 bp |
Protein Length | 437 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 640108285 |
Product | phage major capsid protein, HK97 |
Protein accession | YP_001039257 |
Protein GI | 125975347 |
COG category | [R] General function prediction only |
COG ID | [COG4653] Predicted phage phi-C31 gp36 major capsid-like protein |
TIGRFAM ID | [TIGR01554] phage major capsid protein, HK97 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGATTTA TAGCAGATTT GGTACATGAG AAAAAGCAAT TGGTGGAGAA AGCAGAAGCT ATTTTAAACG AAGCTGAAAA AGCAGGTGGA AGTTTGACGA ATGAACAGGA GCGACAGTTT AACCGCTACA CAGACAAAAT TAAGAGCATT AATGAAAGCA TTGACGAGGA ATTATTAAAT ATCAGAACCT CTGAGCCAAT TCTAATTACA CCACAAAAAG CTGTATCTCC TATTGAAGAA TCAAAAACAC CTGTAACAAA AGCCGTATCA AAATCATTCA GAGGGATGTT CTATGGAAAC GAAACTGTGA GCTTAAGCAA CAATGGTTTT CATTCCATGG ATGAATTCCT GAGAACACTT CACTCAGGCA GAGCCGACAA CAGGCTAATA AATGCCAGTA TGGTGGAAGG GATACCTGAA TTCGGCGGAT ATTCCGTACC GGAGGAATAC GGAGCCTTCC TGATGGATAA ATCCCTGGAG AATGAAATCA TCCGTCCAAG AGCAACGGTA TGGGCAATGG GAAGTGAAAC AAAGAAAGTA CCAGCCTTTG ACGGAGCAGA CAGAACCAAC AACCTATTCG GCGGCATCTC GGGCGAATGG CTTGAAGAAG GACAGACAGG CACACGAAAA ACAGCCAAGT TAAGGCTGAT TCAACTGAAA GCCAAGAAGC TGGCTTGTTT CTCACAGGCA TCCAATGAAC TTATTGCAGA TGGGATGTCC TTTGAAGAAA TGTTAGCTGG AGCACTCATT AAAGGCTTGG GCTGGTACAT GGACTATGCC TTTATCAATG GAACCGGTGA AGGCCAGCCT CTTGGTATTA TAAATGACCC GGCGCTGATT ACTGTAAATA AAGAGGACTC TCAAGAACCA GCTACAATTA CCTATCAGAA TGTTGTCAAT ATGTTCTCAA GGCTTGCTCC ATCCTGCTTT ACCAATGCGG TATGGCTTGC CAATCCATCG GTAATACCAC AATTACTTAC CATGACTATC ACCATTGGTA CCGGTGGCGC TCAGATACCG GTGTTCAGGG AAGAGAGCGG GAAATTCACG CTTCTGGGTA AGGAGGTCTT ATTCACTGAG AAATGCCCCG CATTGGGTGC TAAGGGAGAT TTAATCCTTG CAGATCTTTC CCAGTATGCC ATAGGCATGA GGAAAGAGAT CGCTCTTGAC CGCTCCAATG TCCCAGGCTG GATGGAGGAT ATGACCGACT ACAGGGTGAT AGTGCGTGTA GATGGTCAGG GAACCTGGGA TAAACCTATA ACACCGAAAA ACGGAGCAAC GCTCTCATGG GCAGTGGCTC TGGAGGCAAG ATAG
|
Protein sequence | MRFIADLVHE KKQLVEKAEA ILNEAEKAGG SLTNEQERQF NRYTDKIKSI NESIDEELLN IRTSEPILIT PQKAVSPIEE SKTPVTKAVS KSFRGMFYGN ETVSLSNNGF HSMDEFLRTL HSGRADNRLI NASMVEGIPE FGGYSVPEEY GAFLMDKSLE NEIIRPRATV WAMGSETKKV PAFDGADRTN NLFGGISGEW LEEGQTGTRK TAKLRLIQLK AKKLACFSQA SNELIADGMS FEEMLAGALI KGLGWYMDYA FINGTGEGQP LGIINDPALI TVNKEDSQEP ATITYQNVVN MFSRLAPSCF TNAVWLANPS VIPQLLTMTI TIGTGGAQIP VFREESGKFT LLGKEVLFTE KCPALGAKGD LILADLSQYA IGMRKEIALD RSNVPGWMED MTDYRVIVRV DGQGTWDKPI TPKNGATLSW AVALEAR
|
| |