Gene Cthe_1721 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1721 
Symbol 
ID4808896 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2041764 
End bp2043023 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content35% 
IMG OID640107134 
ProductHK97 family phage portal protein 
Protein accessionYP_001038135 
Protein GI125974225 
COG category[S] Function unknown 
COG ID[COG4695] Phage-related protein 
TIGRFAM ID[TIGR01537] phage portal protein, HK97 family 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAAACTAA AAGATAGAGT GAAGTTATTT TTAACGCCCC AGAATGCCTT GTTTGAAGTC 
CTTCAGAAAT ATTCGGAGGA TTTTTTAAAT GGTGAAGAAG TTGTAAAAGA TAATTTTAAA
ATAGATACAG CAGCGGCTAT GAGTTTTTCT GCAGTTTTTG CCTGCAATAG AGTCCTTTCT
GAAACCTTGG CAAGCTGTCC TATAATGCTT TATGAAAAAG ATGATAAAGG AAACAGAAGG
CAGGTTACAG ATACTGCTGA ATATGGTGTA CTTCATTATG CACCAAATGC AGAAATGACA
CCAGTGCAGT TTAAAGAGTT TGGTATGACA AATATAAACC TTGGGGGAAA CTTTATAGCA
CAAAAGGTTT TTAATATGCA TGGAGAGCTT TTAGAACTTA GACCAATAGC ATGGGACAGA
GTAAGAATTG ATATAGATAA ATCTACAGGA AGGCTTCTTT ATTATATTGA TGGAAAGCAA
GAACCTAAAA CAAGAGATGA AATATTTCAT ATTCCGGGAC TCACTTTAGA CGGGTATATA
GGAATAACAC CTCTTAGTTA TGCGGCACTT ACTATTGATA TTGGATTATC TCAGGACACC
TTTGAAAGAA ATTTTTATCA TAACAGGGCT TCAACCAGCG GTATTTTTCA GTATCCTAAC
GAGCTTTCAG ATGAAGCATT TCAAAGGCTT AAAAAGGATA TTAAGAAGAA CTACACAGGA
CTTTCTAATG CAGGAGTTCC AATGATTCTT GAAGGCGGCG GTCAGTTTAA GGAAATAACC
ATGAAGCTTA CAGATGCACA GTTTTTAGAA TCCAAGAGAT TCAGAATTGA AGATGTGTGC
AGAATTTTCA GAGTACCACT TCATCTGGTG CAGGATTTAA CAAGATCCAC AAATAACAAT
ATTGAACATC AGAGCTTAGA GTTTATTGTT TACACTATGC TGCCGTGGTT TAAAAAATGG
GAAGAAAATT TAAATCTTCA GCTTTTATCA AAAGAATCAA GAAGAAAAAA CAGATATTTT
GAATTTAATA TCAGTGGACT ACTCCGTGGA GATATTAAAT CAAGATATGA AGCCTATGCA
CAAGGAAGAC AGTGGGGATG GCTTTCTGTT AATGATATTA GAAGGCTTGA AAATATGAAT
CCTATTGATA ACGGTGACAG ATATCTCGAA CCTCTCAATA TGAGCGAAGC AGGAAAACAG
GAAGAGCAGC TTAAAGCACT AAGGGAAGAA ATATTTAATA TGATTAATGA AAGGAAGTGA
 
Protein sequence
MKLKDRVKLF LTPQNALFEV LQKYSEDFLN GEEVVKDNFK IDTAAAMSFS AVFACNRVLS 
ETLASCPIML YEKDDKGNRR QVTDTAEYGV LHYAPNAEMT PVQFKEFGMT NINLGGNFIA
QKVFNMHGEL LELRPIAWDR VRIDIDKSTG RLLYYIDGKQ EPKTRDEIFH IPGLTLDGYI
GITPLSYAAL TIDIGLSQDT FERNFYHNRA STSGIFQYPN ELSDEAFQRL KKDIKKNYTG
LSNAGVPMIL EGGGQFKEIT MKLTDAQFLE SKRFRIEDVC RIFRVPLHLV QDLTRSTNNN
IEHQSLEFIV YTMLPWFKKW EENLNLQLLS KESRRKNRYF EFNISGLLRG DIKSRYEAYA
QGRQWGWLSV NDIRRLENMN PIDNGDRYLE PLNMSEAGKQ EEQLKALREE IFNMINERK