Gene Cthe_1616 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1616 
Symbol 
ID4809311 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1945599 
End bp1948121 
Gene Length2523 bp 
Protein Length840 aa 
Translation table11 
GC content45% 
IMG OID640107032 
Productphage minor structural protein 
Protein accessionYP_001038033 
Protein GI125974123 
COG category[S] Function unknown 
COG ID[COG4926] Phage-related protein 
TIGRFAM ID[TIGR01665] phage minor structural protein, N-terminal region 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGATAA AATCAATTCT AACGAGCCAA GAGGATTTTA CCGGTGAGTT TCCTGTAACA 
TCAAGGACGT CTGCTTTATG GCGATTTAAT GAAAAAACAC CAGACGAAAA TCTTCTGCTT
ATGGATTCAT CGGGACATGG CAGACATTTT ACCATCTCCG GCTGGTCAGG GACATCGGCA
AACCTTATTG CTGGAAGATT CGGAAGATAC TTTAGGCAAA ATATTGTTAA TCCGACTTCT
GAAAAGACCC ATCTTATAGC AGAAAATGAT GGGAGTTTCT TTAGCAATCT GGGCGAAAAG
ATTGTTGTAG GCGGTTGGAT TAATCCTACC ACCTATTCGG TCGGCCAGAC ATATATATCC
ATATTCAATA CCCGCCAAGG ACCTGGTCAG CCAATTCTTT ATGTTTCACT TTATCAAGGA
AGACTTAGGC TGATGTTGTA TAACTCCTCC GGCACACTAA TCTACGACCA GAGTGAAACA
GCTACCATTA CCTTAAAAAA CGGCGGCTGG TACTTTATCG CCTCCATCAT TGAAGTAAAC
AACAAAAAGG TGCAGAACAT CATATGCGAT CGCAGCGACG GGGCAACCTG GGTGTCACCT
GTGCGTTCCT TTTCGGGAGA GCTGAATCGG GAATGTATAG CAGACATTAT TATGGGGATG
CATGCAAATA CCTACTACTA TGCCGGAGGC TTCGACGACT GGTTTCTGGA AACGGACTCA
CAGCTTACAG CTGATGATTT GCTGTTATAT TTTAAGTCGT CTTTACAAGC AAACGGTGGG
GATGCGGCTT CGGATGTAGA TGCTTTGGCA GAGCCTGGCG CAGTCACCCT TAAAGCAACA
GATGGCGAGT ATCCTGCAAG TGGCGTACTT TATACAAGGG CGGTTCCATG TGCATTATCG
GGCAGCGGTC GTGTAGCTGT GACAAGCGAA TATACTGCAG GTGTTACTTC AGTGTCTCTA
GTAGAGACCA GCACAAGCGA TGATCTTGAA GAATGGTCTG CATGGCAGGC TGTGGGAACC
AGCGGTGAAC TTCAATCGCC AAATCGGCAA TATATAAGGT TCCGTGTTAC CCTTACCAGC
AGCGATCCGT TGAAGACGCC AAAACTTCTG GAAATACAGC TTCATGATAT ACCGAAAGCG
CCCTATGAGA AATTAGGCTT TGCCCGTCCT GTGATTTTGG ACAAAAACGG AGCATGGGAA
GCTGTTCTTG AAAATGCCTT TGATATCATT GTCACTGGTG AGGTGAACGG CGCGGATACG
CTGGAATTCA AGCTTCCGTT CCATGATCCA AAAAGAAGCA CACTGGAAAA TGAAAAACAA
GTGCAAATCG TAAATGACAT TTACCGGATC CGAACCTTAA CGGACAATAA AGGCGAAGAT
GGGCGTGTTA TCACGCAAGT ATATGCTGAA GCGGTATTTT ACGATCTGTC TTTCAGTGCG
GAAAAAGAAC CTAGAGAATT CAATGCAGAT ACTGCAGATG TTCCGATGCA ATATGCACTT
TTGGGTACAG GCTGGACAGT AGGAAATGTT ACTGTCACTA CGAAACGGAC ATGGCAGTGT
ACAGAAAAAA ATGCCTTATC CATCCTTCGC ACCGTACAGA ATATTTATGG CGGCGATCTG
GTGTTTGACA GCGCCAACCG CCAGGTACAC CTTTTGACTT TTAGTGGTAC TGATAGCGGA
GCGCTTTTTT CATATAGAAA GAATTTGAAA AGTATTCAGC GGGTAGTCGA TACACGTGAA
TTAGTGACAA AGCTCTATGC TTATGGAAAG GACGGATTGA CCTTCGCTTC AATTAATGGA
GGTAAGGAAT ACGTGGAAGA TTACACTTTT TCCAGTGAAG TGAGGGTGTC GACGCTTGAT
TGTTCGTCGT TTACAAATCC GTATCAGATG CTGGAATATG CAAAAATGCG GCTTGCAGAA
TATTCGAAGC CTCGCGTCTC TTATGTGCTG TCGGCAATGG ATTTATCTGC GCTAACCGGT
TATGAGCACG AAGCATGGAA ACTGGGTGAT ATTGTTACAG TGGACGATAA AGAACTAGGC
CTTTTGGTAA AGACTCGTGT TGTGAGAAGG CAGTATAACT TGCAGGAACC ATGGAAAACA
GTGATTGAGC TTTCAACTAA ACTGCGGGAA CTTGGCGATT CTTCAGCACA GTGGGACAAG
GCAGCGGATG CGCTGTCCTC AGCAGAGTTG ATAAACCGTC AGGAAATTAA AGATATGGTA
CCATTCAACC ATCTGCGCAA TTCCAGAGCG GATGATGGTT TTGCCTACTG GGTCAATTCC
GGTTTTGAAG TGGATACTGA AAATGGTGTT TCGGGAACTG CTTCCTTCAA GGCTGTCGGT
GTACCTGGTA TGAAAAAGAG CCTTTCACAG ACGGTATATC CAGCAACGCG TAAAAGCTAC
ACTTTTTCAG CACAAATTGC TTCCGAAAGC CTCGAAAAGG GTGAAAACGG CCAAGTTGGT
GTTGAGATAG TCATTGAATA CGAGGACGGT ACAACAGAAA CAAGATTTAT AGACCTGATT
TGA
 
Protein sequence
MAIKSILTSQ EDFTGEFPVT SRTSALWRFN EKTPDENLLL MDSSGHGRHF TISGWSGTSA 
NLIAGRFGRY FRQNIVNPTS EKTHLIAEND GSFFSNLGEK IVVGGWINPT TYSVGQTYIS
IFNTRQGPGQ PILYVSLYQG RLRLMLYNSS GTLIYDQSET ATITLKNGGW YFIASIIEVN
NKKVQNIICD RSDGATWVSP VRSFSGELNR ECIADIIMGM HANTYYYAGG FDDWFLETDS
QLTADDLLLY FKSSLQANGG DAASDVDALA EPGAVTLKAT DGEYPASGVL YTRAVPCALS
GSGRVAVTSE YTAGVTSVSL VETSTSDDLE EWSAWQAVGT SGELQSPNRQ YIRFRVTLTS
SDPLKTPKLL EIQLHDIPKA PYEKLGFARP VILDKNGAWE AVLENAFDII VTGEVNGADT
LEFKLPFHDP KRSTLENEKQ VQIVNDIYRI RTLTDNKGED GRVITQVYAE AVFYDLSFSA
EKEPREFNAD TADVPMQYAL LGTGWTVGNV TVTTKRTWQC TEKNALSILR TVQNIYGGDL
VFDSANRQVH LLTFSGTDSG ALFSYRKNLK SIQRVVDTRE LVTKLYAYGK DGLTFASING
GKEYVEDYTF SSEVRVSTLD CSSFTNPYQM LEYAKMRLAE YSKPRVSYVL SAMDLSALTG
YEHEAWKLGD IVTVDDKELG LLVKTRVVRR QYNLQEPWKT VIELSTKLRE LGDSSAQWDK
AADALSSAEL INRQEIKDMV PFNHLRNSRA DDGFAYWVNS GFEVDTENGV SGTASFKAVG
VPGMKKSLSQ TVYPATRKSY TFSAQIASES LEKGENGQVG VEIVIEYEDG TTETRFIDLI