Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1616 |
Symbol | |
ID | 4809311 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 1945599 |
End bp | 1948121 |
Gene Length | 2523 bp |
Protein Length | 840 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 640107032 |
Product | phage minor structural protein |
Protein accession | YP_001038033 |
Protein GI | 125974123 |
COG category | [S] Function unknown |
COG ID | [COG4926] Phage-related protein |
TIGRFAM ID | [TIGR01665] phage minor structural protein, N-terminal region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGATAA AATCAATTCT AACGAGCCAA GAGGATTTTA CCGGTGAGTT TCCTGTAACA TCAAGGACGT CTGCTTTATG GCGATTTAAT GAAAAAACAC CAGACGAAAA TCTTCTGCTT ATGGATTCAT CGGGACATGG CAGACATTTT ACCATCTCCG GCTGGTCAGG GACATCGGCA AACCTTATTG CTGGAAGATT CGGAAGATAC TTTAGGCAAA ATATTGTTAA TCCGACTTCT GAAAAGACCC ATCTTATAGC AGAAAATGAT GGGAGTTTCT TTAGCAATCT GGGCGAAAAG ATTGTTGTAG GCGGTTGGAT TAATCCTACC ACCTATTCGG TCGGCCAGAC ATATATATCC ATATTCAATA CCCGCCAAGG ACCTGGTCAG CCAATTCTTT ATGTTTCACT TTATCAAGGA AGACTTAGGC TGATGTTGTA TAACTCCTCC GGCACACTAA TCTACGACCA GAGTGAAACA GCTACCATTA CCTTAAAAAA CGGCGGCTGG TACTTTATCG CCTCCATCAT TGAAGTAAAC AACAAAAAGG TGCAGAACAT CATATGCGAT CGCAGCGACG GGGCAACCTG GGTGTCACCT GTGCGTTCCT TTTCGGGAGA GCTGAATCGG GAATGTATAG CAGACATTAT TATGGGGATG CATGCAAATA CCTACTACTA TGCCGGAGGC TTCGACGACT GGTTTCTGGA AACGGACTCA CAGCTTACAG CTGATGATTT GCTGTTATAT TTTAAGTCGT CTTTACAAGC AAACGGTGGG GATGCGGCTT CGGATGTAGA TGCTTTGGCA GAGCCTGGCG CAGTCACCCT TAAAGCAACA GATGGCGAGT ATCCTGCAAG TGGCGTACTT TATACAAGGG CGGTTCCATG TGCATTATCG GGCAGCGGTC GTGTAGCTGT GACAAGCGAA TATACTGCAG GTGTTACTTC AGTGTCTCTA GTAGAGACCA GCACAAGCGA TGATCTTGAA GAATGGTCTG CATGGCAGGC TGTGGGAACC AGCGGTGAAC TTCAATCGCC AAATCGGCAA TATATAAGGT TCCGTGTTAC CCTTACCAGC AGCGATCCGT TGAAGACGCC AAAACTTCTG GAAATACAGC TTCATGATAT ACCGAAAGCG CCCTATGAGA AATTAGGCTT TGCCCGTCCT GTGATTTTGG ACAAAAACGG AGCATGGGAA GCTGTTCTTG AAAATGCCTT TGATATCATT GTCACTGGTG AGGTGAACGG CGCGGATACG CTGGAATTCA AGCTTCCGTT CCATGATCCA AAAAGAAGCA CACTGGAAAA TGAAAAACAA GTGCAAATCG TAAATGACAT TTACCGGATC CGAACCTTAA CGGACAATAA AGGCGAAGAT GGGCGTGTTA TCACGCAAGT ATATGCTGAA GCGGTATTTT ACGATCTGTC TTTCAGTGCG GAAAAAGAAC CTAGAGAATT CAATGCAGAT ACTGCAGATG TTCCGATGCA ATATGCACTT TTGGGTACAG GCTGGACAGT AGGAAATGTT ACTGTCACTA CGAAACGGAC ATGGCAGTGT ACAGAAAAAA ATGCCTTATC CATCCTTCGC ACCGTACAGA ATATTTATGG CGGCGATCTG GTGTTTGACA GCGCCAACCG CCAGGTACAC CTTTTGACTT TTAGTGGTAC TGATAGCGGA GCGCTTTTTT CATATAGAAA GAATTTGAAA AGTATTCAGC GGGTAGTCGA TACACGTGAA TTAGTGACAA AGCTCTATGC TTATGGAAAG GACGGATTGA CCTTCGCTTC AATTAATGGA GGTAAGGAAT ACGTGGAAGA TTACACTTTT TCCAGTGAAG TGAGGGTGTC GACGCTTGAT TGTTCGTCGT TTACAAATCC GTATCAGATG CTGGAATATG CAAAAATGCG GCTTGCAGAA TATTCGAAGC CTCGCGTCTC TTATGTGCTG TCGGCAATGG ATTTATCTGC GCTAACCGGT TATGAGCACG AAGCATGGAA ACTGGGTGAT ATTGTTACAG TGGACGATAA AGAACTAGGC CTTTTGGTAA AGACTCGTGT TGTGAGAAGG CAGTATAACT TGCAGGAACC ATGGAAAACA GTGATTGAGC TTTCAACTAA ACTGCGGGAA CTTGGCGATT CTTCAGCACA GTGGGACAAG GCAGCGGATG CGCTGTCCTC AGCAGAGTTG ATAAACCGTC AGGAAATTAA AGATATGGTA CCATTCAACC ATCTGCGCAA TTCCAGAGCG GATGATGGTT TTGCCTACTG GGTCAATTCC GGTTTTGAAG TGGATACTGA AAATGGTGTT TCGGGAACTG CTTCCTTCAA GGCTGTCGGT GTACCTGGTA TGAAAAAGAG CCTTTCACAG ACGGTATATC CAGCAACGCG TAAAAGCTAC ACTTTTTCAG CACAAATTGC TTCCGAAAGC CTCGAAAAGG GTGAAAACGG CCAAGTTGGT GTTGAGATAG TCATTGAATA CGAGGACGGT ACAACAGAAA CAAGATTTAT AGACCTGATT TGA
|
Protein sequence | MAIKSILTSQ EDFTGEFPVT SRTSALWRFN EKTPDENLLL MDSSGHGRHF TISGWSGTSA NLIAGRFGRY FRQNIVNPTS EKTHLIAEND GSFFSNLGEK IVVGGWINPT TYSVGQTYIS IFNTRQGPGQ PILYVSLYQG RLRLMLYNSS GTLIYDQSET ATITLKNGGW YFIASIIEVN NKKVQNIICD RSDGATWVSP VRSFSGELNR ECIADIIMGM HANTYYYAGG FDDWFLETDS QLTADDLLLY FKSSLQANGG DAASDVDALA EPGAVTLKAT DGEYPASGVL YTRAVPCALS GSGRVAVTSE YTAGVTSVSL VETSTSDDLE EWSAWQAVGT SGELQSPNRQ YIRFRVTLTS SDPLKTPKLL EIQLHDIPKA PYEKLGFARP VILDKNGAWE AVLENAFDII VTGEVNGADT LEFKLPFHDP KRSTLENEKQ VQIVNDIYRI RTLTDNKGED GRVITQVYAE AVFYDLSFSA EKEPREFNAD TADVPMQYAL LGTGWTVGNV TVTTKRTWQC TEKNALSILR TVQNIYGGDL VFDSANRQVH LLTFSGTDSG ALFSYRKNLK SIQRVVDTRE LVTKLYAYGK DGLTFASING GKEYVEDYTF SSEVRVSTLD CSSFTNPYQM LEYAKMRLAE YSKPRVSYVL SAMDLSALTG YEHEAWKLGD IVTVDDKELG LLVKTRVVRR QYNLQEPWKT VIELSTKLRE LGDSSAQWDK AADALSSAEL INRQEIKDMV PFNHLRNSRA DDGFAYWVNS GFEVDTENGV SGTASFKAVG VPGMKKSLSQ TVYPATRKSY TFSAQIASES LEKGENGQVG VEIVIEYEDG TTETRFIDLI
|
| |