Gene Cthe_2474 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2474 
Symbol 
ID4809854 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2945643 
End bp2946899 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content42% 
IMG OID640107889 
ProductPBSX family phage terminase large subunit 
Protein accessionYP_001038869 
Protein GI125974959 
COG category[R] General function prediction only 
COG ID[COG1783] Phage terminase large subunit 
TIGRFAM ID[TIGR01547] phage terminase, large subunit, PBSX family 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATAGTGA TGACGCAAGT TAGACTTAGC GAATTAGTCG CACCGAGTTT CTACGAGATC 
CACAATGACA TAAAGCATAA TAGATATACT CATTACTGGC TTAAAGGTGG CCGTGGCTCA
ACCAAATCCT CTTTCGTGAG CATTGAAATC ATCCTCGGCG TAATGAAGGA CCCTAACGCT
AATGCAGTGG CCCTGAGAAA AGTTAAGGAG ACTATCAAAG ATAGCGTATT CGAGCAGTTA
GTGTGGGCAA TTGAGAAGCT GAAAGTTACT GAATACTGGG AGATAAAGCA CAACCCTATG
GAATTGACAT ATCTACCTAC GGGACAAAAA ATATTGTTCC GTGGCGCTGA TAAGCCAAGG
AAGATTAAAT CCATCAAAGT AAGCCGGGGA TATGTAAAGT TTATCTGGTA TGAAGAAGTT
GACGAATTCC TCGGAATGGA AGAAATCCGA ATCATTAATC AGTCCTTGAT GCGTGGCGGA
GAGCAGTTTG TCGTCTTTTA TACTTACAAT CCTCCAAACA GGGTTAACGC TTGGGTGAAT
GAAGAAATAC TGATTGATAG ACCGGACAGA AAGGTCCATC ATAGCACGTA TTTGACTGTT
CCTCGAGATT GGCTTGGGGA ACAGTTTTTT ATTGAGGCAG AACATCTTAA AAAAGTTAAC
GAGAAAGCGT ATAGGCACGA GTATTTAGGT GAAGTCACCG GCACAGGCGG CGAGGTATTT
ACAAACGTGA AAGCAAGGAA GATAAATGAC GAGGAAATAA AAGCATTTGA CAGGATAAAA
AGAGGACTGG ACTTTGGCTA TGCTGTTGAC CCGGCAGCTT ACATTGTGTG CCACTTTGAT
AAAACAAGGC GGCGGCTTTA TATATTTCAC GAGATATTCC AGGTCGGCTT GAGCAATAGG
AAATTGGCAG AGTTAATTAA GAAAGAAAAC AAAAGCAATA AGTTAGTGGT TGCGGACAGC
GCGGAGCCAA AGTCAATAGC CGAATTGCGT GGTTATGGAA TCAACATAAG GGGAGCGAAA
AAAGGACCGG ACAGCGTTGA ATATGGAATA AAGTTTTTGC AAGACCTTGA AGAGATAATA
ATTGACCCTG AGCGATGTCC AAATACATTG CGAGAGTTCG TAAATTATGA ACTTGAGAAA
GACAAAGACG GCAATTTTAA AGCTGAATTC CCGGATAAAA ACAACCACAC GATCGATGCT
GTTAGGTATG CGCTTGAGGA TGATATGAGG ACGGGCGGCC TATCAATTTT AAAGTGA
 
Protein sequence
MIVMTQVRLS ELVAPSFYEI HNDIKHNRYT HYWLKGGRGS TKSSFVSIEI ILGVMKDPNA 
NAVALRKVKE TIKDSVFEQL VWAIEKLKVT EYWEIKHNPM ELTYLPTGQK ILFRGADKPR
KIKSIKVSRG YVKFIWYEEV DEFLGMEEIR IINQSLMRGG EQFVVFYTYN PPNRVNAWVN
EEILIDRPDR KVHHSTYLTV PRDWLGEQFF IEAEHLKKVN EKAYRHEYLG EVTGTGGEVF
TNVKARKIND EEIKAFDRIK RGLDFGYAVD PAAYIVCHFD KTRRRLYIFH EIFQVGLSNR
KLAELIKKEN KSNKLVVADS AEPKSIAELR GYGINIRGAK KGPDSVEYGI KFLQDLEEII
IDPERCPNTL REFVNYELEK DKDGNFKAEF PDKNNHTIDA VRYALEDDMR TGGLSILK