Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1985 |
Symbol | |
ID | 4810917 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 2364837 |
End bp | 2366108 |
Gene Length | 1272 bp |
Protein Length | 423 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 640107401 |
Product | phage major capsid protein, HK97 |
Protein accession | YP_001038396 |
Protein GI | 125974486 |
COG category | [R] General function prediction only |
COG ID | [COG4653] Predicted phage phi-C31 gp36 major capsid-like protein |
TIGRFAM ID | [TIGR01554] phage major capsid protein, HK97 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 41 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGAGAAAG CAGAAGCTAT TTTAAACGAA GCGGAAAAAG CGGGTGGAAG TTTGACAAAG GAACAGGAGC GACAGTTTAA TCGCTACACA GACAAAATAA AGAGCATTAA TGAAAGCATT GACGAGGAAT TATTAAATAT CAGAACCTCT GAGCCAATAC TGAATATGCC ACAAAAAGCT GTATTTCCTA TTGAAGAATC AAAAACTCCT GTAACAAAAG CCGTATCCAA ATCATTCAGA GGGATGTTCT ATGGCAATGA AACAGTAAAA CTCAGCAACA ATGGATTTCA TTCCATGGAT GAATTCCTGA GAACACTTCA CTCAGGCAGA GCCGACAACA GGCTAATAAA TGCCAGTATG GTGGAAGGAA TACCCGAATT CGGCGGATAT TCCGTACCGG AGGAATACGG AGCCTTCCTG ATGGATAAAT CCCTGGAGAA TGAAATCATC CGTCCCAGAG CAACAGTATG GGCAATGGGA AGCGAAACAA AGAAAGTACC AGCCTTCGAT GGAGCAGACA GAACCAATCA CCTATTCGGT GGTATTTCAG GAGAATGGCT GGAGGAAGGT CAGACAGGCA CACGAAAGAC CGCCAAGCTA AGACTGATCC AATTAAAGGC CAAGAAGCTT GCCTGCTTCT CACAGGCATC CAATGAACTC ATTGCAGATG GTATGTCCTT TGAAGAAATG CTTGCCGGAG CGCTTATTAA AGGCTTGGGC TGGTACATGG ACTATGCCTT TATCAATGGA ACCGGTGAAG GTCAGCCTCT TGGTATTATA AATGACCCGG CGCTGATTAC TGTAGATAAA GAGGACTCTC AAGCAGCAGC CACAATTACC TATCAAAACG TGGTCAATAT GTTTTCAAGG CTTGCTCCGT CCTGTTTTAC CAATGCGGTA TGGCTTGCCA ATCCATCGGT AATACCACAA TTGCTTACCA TGACTATCAC CATTGGTACC GGTGGCGCTC AGATACCGGT GTTCAGGGAA GAGAGCGGGA AATTCACGCT TCTGGGTAAG GAGGTCTTAT TCACTGAGAA ATGCCCCGCA TTGGGTGCTA AGGGAGATTT AATCCTCGCA GACCTTAGCC AGTATGCCAT AGGCATGAGG AAAGAGATCG CTCTTGACCG CTCCAATGTC CCAGGCTGGA TGGAGGATAT GACCGACTAC AGGGTGATAG TGCGTGTAGA TGGTCAGGGA ACCTGGGATA AACCTATAAC ACCGAAAAAC GGAGCAACGC TCTCATGGGC AGTGGCTTTG GAGGCAAGAT AG
|
Protein sequence | MEKAEAILNE AEKAGGSLTK EQERQFNRYT DKIKSINESI DEELLNIRTS EPILNMPQKA VFPIEESKTP VTKAVSKSFR GMFYGNETVK LSNNGFHSMD EFLRTLHSGR ADNRLINASM VEGIPEFGGY SVPEEYGAFL MDKSLENEII RPRATVWAMG SETKKVPAFD GADRTNHLFG GISGEWLEEG QTGTRKTAKL RLIQLKAKKL ACFSQASNEL IADGMSFEEM LAGALIKGLG WYMDYAFING TGEGQPLGII NDPALITVDK EDSQAAATIT YQNVVNMFSR LAPSCFTNAV WLANPSVIPQ LLTMTITIGT GGAQIPVFRE ESGKFTLLGK EVLFTEKCPA LGAKGDLILA DLSQYAIGMR KEIALDRSNV PGWMEDMTDY RVIVRVDGQG TWDKPITPKN GATLSWAVAL EAR
|
| |