Gene Cthe_1722 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1722 
Symbol 
ID4808897 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2043035 
End bp2045584 
Gene Length2550 bp 
Protein Length849 aa 
Translation table11 
GC content35% 
IMG OID640107135 
Productphage terminase 
Protein accessionYP_001038136 
Protein GI125974226 
COG category[R] General function prediction only 
COG ID[COG4626] Phage terminase-like protein, large subunit 
TIGRFAM ID[TIGR01443] intein C-terminal splicing region 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.879362 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTATGATG AAGCAAAAGC ACAGCATGCC GTAAACTTTA TTAACTGCTT AAAGCATACA 
AAGGGTCAGT GGCGTGGTGT TCCTTTTGAT CTTCTGCCTT GGCAGGATAA AATTATAAGG
GATATATTCG GAACAGTAAA AGAAAATGGA TACAGGCAGT ATAATACTGC TTATGTTGAA
ATTCCTAAGA AAAATGGAAA ACAGTTAGCC CTTGATACTC CGATTCCAAC ACCTGATGGA
TGGACTACAA TGGGGGAAAT AAAAGCAGGA GATAAGGTAA TTGATGAAAA GGGAAGACCT
TGTAATGTTG TTGCAATAAG TGAAATTGAT GATACGGAGC AGGCATATAA AATAAATTTT
AGAGATGGAA CAAGTATAGT AGCTGGAGAA AGGCATCTAT GGAAGGTTCA AGTTACTAAT
AATGGCAGAA GAGAAAAACT ATTAACAACA GGAGAAATGT ATCAAAAGCA GTTTAAAACT
AAAAGTAAAG AAAATAGAGC ATTATTTCGC ATCCCAATAG CGGATGCTTT TATTTTGCCT
GAAAATAAAC TTCCTATAGA TCCGTATCTA TTTGGGTACT GGATAGGAAA TGGTAATGCT
GTAAAGCCTG AAATAACTGT AATGAGAGAT GATGTTGACG AAGTTATTAA AAATATACCA
TATAAACTTC ATAATAGATA TAAGCAGGAG GGTAACAGCG ATATTTTAGT ATATAAAGAA
CTTAAAAGTA TATTAGTTAA AAACTTTAGG GAAAAAAGGA TACCTATTGA ATATTTAAGA
GCATCAGCTC AGCAAAGAAA AAGATTATTA CAAGGGTTAA TAGATTCTGA TGGATGTGTA
AGCACTGCTA AAAGCCAGGC AATATATGTG ACAATTCTTT TTGAACTTGC CAAGGATGTT
CAGGATTTAT TATGGTCATT GGGAATAAAG AATACGTTAA AAACAGCTCC ATCAGCTAGA
TATGGAATTG AAACAGGTGA AATATGTTAT TTAATAAAGT TTACTGCTTT TAATGACTTA
GAAGTATCAG GATTAGATAG AAAGCTTAAA AGAGGCAGAG AAAGAAATAT TAAAACAAGA
TCACATTTTC ATTATATAAA GTCTATTGAA AAAACAGGAA AGACAAAAAT GAGATGTATT
CAGGTTGACA GCCCATCAAG ATTATATTTA GCAGGTAAAT CCATGATTCC TACACATAAT
AGCGAGCTTG CTGCTGCAGT TGCTCTTTAT ATGACCTGCG GAGATGGAGA ATGGGGAGCT
GAAGTTTACG GCTGTGCTGC AGACAGACAA CAGGCTTCTA TCGTTTTTGA TGTAGCTGTT
GAAATGGTAG AACAGTGTCC TGCTCTTAAG AAAAGAATTA AACCTGTTCT TTCTGTAAAA
AGATTGATAT ATAAGCCTAC AAACAGCTTT TATCAGGTAT TATCTGCTGA AGCTTATTCA
AAACATGGAC TTAATGTTCA TGGAGTTGTA ATGGATGAAC TTCATGCTCA GCCTAACAGG
GATTTATATG ATGTTATGAC TAAAGGAAGT GGTGATGCAA GATTGCAGCC GCTGTTTTTT
CTTATAACCA CAGCCGGAAC AGATAGAAAT TCTATATGCT ATGAAGTACA TCAAAAGGCA
GTAGATATAT TAGAAGGAAG AAAAATCGAT CCAACATTTT ATCCTGTTAT TTATGGAATA
GATGACAATG ACGATTGGAC ATTAGAGAAA AACTGGTATA AAGCAAACCC TTCTCTTGGG
CATACCATAG ATATAGAAAA AGTGAGAAAT GCCTTTAACA GTGCAAAAGA AAATCCTGCT
GAAGAAAATA TATTCCGTCA GCTTAGATTA AATCAATGGG TGAAGCAGTC CACAAGATGG
ATGCAGATGG ACAAGTGGGA TGAGTGTGCT TTTAAAGTTG ATATAGATAG TTTAAAAGGA
AGAGAGTGTT ATGGGGGACT TGACCTTTCA AGTACCACAG ATATCACAGC CTTTGTTTTA
GTATTTCCTC CAAGAACATC AGATGAAAAA TATATTGTTC TTCCTCACTT TTGGATACCA
GAGGATAATT TAAATTTAAG AGTAAGACGA GATCATGTAC CTTATGATAT TTGGAAAAAG
CAGGGATACT TAAAAACTAC TGAAGGAAAT GTAGTTCATT ATGGCTATAT AGAAACCTTT
ATTGAAGAGC TTGGGAAAAA ATACAACATA AAAGAAATTG CCTTTGACAG ATGGGGTGCT
GTGCAGATGG TACAGAACCT GGAGGGAATG GGTTTTACAG TTGTACCTTT TGGGCAGGGG
TATAAGGATA TGTCTCCTCC TACAAAGGAG CTTATGAAAA TTACTCTTGA AAAGAAAATA
GCCCATGGAG GACATCCTGT TTTAAGGTGG ATGATGGATA ATATTTATGT AAAAACTGAT
CCTGCAGGCA ATATAAAGCC TGATAAAGAA AAGTCTACTG AAAAGATAGA TGGTGCTGTA
GCACTTATTA TGGCACTTGA TAGATCCATA AGACATGAAA ATAAAGAAAG TGTCTATGAA
AAAAGAGGAA TGAGAAGTTT TCTTGATTAG
 
Protein sequence
MYDEAKAQHA VNFINCLKHT KGQWRGVPFD LLPWQDKIIR DIFGTVKENG YRQYNTAYVE 
IPKKNGKQLA LDTPIPTPDG WTTMGEIKAG DKVIDEKGRP CNVVAISEID DTEQAYKINF
RDGTSIVAGE RHLWKVQVTN NGRREKLLTT GEMYQKQFKT KSKENRALFR IPIADAFILP
ENKLPIDPYL FGYWIGNGNA VKPEITVMRD DVDEVIKNIP YKLHNRYKQE GNSDILVYKE
LKSILVKNFR EKRIPIEYLR ASAQQRKRLL QGLIDSDGCV STAKSQAIYV TILFELAKDV
QDLLWSLGIK NTLKTAPSAR YGIETGEICY LIKFTAFNDL EVSGLDRKLK RGRERNIKTR
SHFHYIKSIE KTGKTKMRCI QVDSPSRLYL AGKSMIPTHN SELAAAVALY MTCGDGEWGA
EVYGCAADRQ QASIVFDVAV EMVEQCPALK KRIKPVLSVK RLIYKPTNSF YQVLSAEAYS
KHGLNVHGVV MDELHAQPNR DLYDVMTKGS GDARLQPLFF LITTAGTDRN SICYEVHQKA
VDILEGRKID PTFYPVIYGI DDNDDWTLEK NWYKANPSLG HTIDIEKVRN AFNSAKENPA
EENIFRQLRL NQWVKQSTRW MQMDKWDECA FKVDIDSLKG RECYGGLDLS STTDITAFVL
VFPPRTSDEK YIVLPHFWIP EDNLNLRVRR DHVPYDIWKK QGYLKTTEGN VVHYGYIETF
IEELGKKYNI KEIAFDRWGA VQMVQNLEGM GFTVVPFGQG YKDMSPPTKE LMKITLEKKI
AHGGHPVLRW MMDNIYVKTD PAGNIKPDKE KSTEKIDGAV ALIMALDRSI RHENKESVYE
KRGMRSFLD