Gene Cthe_1628 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1628 
Symbol 
ID4809323 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1954690 
End bp1955892 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content48% 
IMG OID640107044 
ProductHK97 family phage major capsid protein 
Protein accessionYP_001038045 
Protein GI125974135 
COG category[R] General function prediction only 
COG ID[COG4653] Predicted phage phi-C31 gp36 major capsid-like protein 
TIGRFAM ID[TIGR01554] phage major capsid protein, HK97 family 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCAAAA TACTGGAACT GCGTGAAAAA CGCGCGAAAG TAGGGGAAGC TGCTAAAGCT 
TTCCTCGACA GCAAACGCGG GAACGACGGA CTGCTTTCAC CGGAGGATAC CGCAACTTAT
GAAAAAATGG AAGCCGACGT TATTGCGCTG GGCAAAGAAA TAGAGCGTCT TGAGCGTCAG
GCTGCCATAG ATTTGGAACT GTCAAAACCG TTGAATATTC CTATTACAGA CAAACCCACT
TCCATATCTG GCAACAATGA AAAAACCGGA CGTGCCAGCG ATGAGTACAG GCAGTCTTTC
TGGAACATGA TGCGCGGCAG GCGCAAATAT GACGTACACA ACGCGCTGCA GATTGGAGAG
GACACCGAAG GTGGATATCT TGTTCCCGAC GACTTTGAGC GTACTCTTGT GGAAGCACTG
GAGGAGGAGA ATATCTTTAG GCAGATTGCC AATGTTATTA CCACGTCCAG CGGTGACAAG
AAAATTCCTG TGGTGGCAAG CAAGGGTACT GCATCCTGGG TGGATGAGGA AGGCCAGATT
CCCGAAAGCG ATGACTCCTT TGCACAGGTA TCCATCGGCG CATATAAGCT GGCTACTATG
ATCAAGGTGT CAGAGGAATT GTTAAACGAC AGTGTATTTA ACCTTGAACA GTATATAGCC
AAAGAATTCG CCCGCCGAAT CGGAGCAAAA GAGGAGGAAG CATTTTTTAT CGGCGACGGA
TCTGGCAAGC CAACCGGTAT CTTGGCGGAT AACGGCGGTG GCGAGATAGG AGTAACCGCG
GCGAGTGCAA CAGCCATTAC CCTTGACGAG ATCATGGACT TGTTCTACAG CCTAAAGTCT
CCGTACCGCA GGAACGCTGT ATTCATTATG AATGATTCGA CAATTAAAGC TATAAGGAAG
CTCAAAGACA ACAACGGTCA GTATCTCTGG CAGCCTTCTG TAACTGCTGG AACACCGGAT
ACTATCCTCA ATCGTCCAGT TAAAACTTCT GCATTTATGC CAGCCATTGC CGCCGGAGCA
AAAACGATTG TATTCGGCGA TTTTTCTTAT TACTGGGTGG CAGACCGTCA AGGCAGGGTT
TTTAAGCGGC TTAATGAGTT GTATGCTGCG ACCGGACAAG TTGGATTCAT GGCAACCCAG
CGTGTAGATG GCAAGCTGGT ACTGTCTGAA GCAGTCAAGA TACTGCAGCA GAAATCAACT
TAA
 
Protein sequence
MSKILELREK RAKVGEAAKA FLDSKRGNDG LLSPEDTATY EKMEADVIAL GKEIERLERQ 
AAIDLELSKP LNIPITDKPT SISGNNEKTG RASDEYRQSF WNMMRGRRKY DVHNALQIGE
DTEGGYLVPD DFERTLVEAL EEENIFRQIA NVITTSSGDK KIPVVASKGT ASWVDEEGQI
PESDDSFAQV SIGAYKLATM IKVSEELLND SVFNLEQYIA KEFARRIGAK EEEAFFIGDG
SGKPTGILAD NGGGEIGVTA ASATAITLDE IMDLFYSLKS PYRRNAVFIM NDSTIKAIRK
LKDNNGQYLW QPSVTAGTPD TILNRPVKTS AFMPAIAAGA KTIVFGDFSY YWVADRQGRV
FKRLNELYAA TGQVGFMATQ RVDGKLVLSE AVKILQQKST