Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1722 |
Symbol | |
ID | 4808897 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 2043035 |
End bp | 2045584 |
Gene Length | 2550 bp |
Protein Length | 849 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 640107135 |
Product | phage terminase |
Protein accession | YP_001038136 |
Protein GI | 125974226 |
COG category | [R] General function prediction only |
COG ID | [COG4626] Phage terminase-like protein, large subunit |
TIGRFAM ID | [TIGR01443] intein C-terminal splicing region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.879362 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGTATGATG AAGCAAAAGC ACAGCATGCC GTAAACTTTA TTAACTGCTT AAAGCATACA AAGGGTCAGT GGCGTGGTGT TCCTTTTGAT CTTCTGCCTT GGCAGGATAA AATTATAAGG GATATATTCG GAACAGTAAA AGAAAATGGA TACAGGCAGT ATAATACTGC TTATGTTGAA ATTCCTAAGA AAAATGGAAA ACAGTTAGCC CTTGATACTC CGATTCCAAC ACCTGATGGA TGGACTACAA TGGGGGAAAT AAAAGCAGGA GATAAGGTAA TTGATGAAAA GGGAAGACCT TGTAATGTTG TTGCAATAAG TGAAATTGAT GATACGGAGC AGGCATATAA AATAAATTTT AGAGATGGAA CAAGTATAGT AGCTGGAGAA AGGCATCTAT GGAAGGTTCA AGTTACTAAT AATGGCAGAA GAGAAAAACT ATTAACAACA GGAGAAATGT ATCAAAAGCA GTTTAAAACT AAAAGTAAAG AAAATAGAGC ATTATTTCGC ATCCCAATAG CGGATGCTTT TATTTTGCCT GAAAATAAAC TTCCTATAGA TCCGTATCTA TTTGGGTACT GGATAGGAAA TGGTAATGCT GTAAAGCCTG AAATAACTGT AATGAGAGAT GATGTTGACG AAGTTATTAA AAATATACCA TATAAACTTC ATAATAGATA TAAGCAGGAG GGTAACAGCG ATATTTTAGT ATATAAAGAA CTTAAAAGTA TATTAGTTAA AAACTTTAGG GAAAAAAGGA TACCTATTGA ATATTTAAGA GCATCAGCTC AGCAAAGAAA AAGATTATTA CAAGGGTTAA TAGATTCTGA TGGATGTGTA AGCACTGCTA AAAGCCAGGC AATATATGTG ACAATTCTTT TTGAACTTGC CAAGGATGTT CAGGATTTAT TATGGTCATT GGGAATAAAG AATACGTTAA AAACAGCTCC ATCAGCTAGA TATGGAATTG AAACAGGTGA AATATGTTAT TTAATAAAGT TTACTGCTTT TAATGACTTA GAAGTATCAG GATTAGATAG AAAGCTTAAA AGAGGCAGAG AAAGAAATAT TAAAACAAGA TCACATTTTC ATTATATAAA GTCTATTGAA AAAACAGGAA AGACAAAAAT GAGATGTATT CAGGTTGACA GCCCATCAAG ATTATATTTA GCAGGTAAAT CCATGATTCC TACACATAAT AGCGAGCTTG CTGCTGCAGT TGCTCTTTAT ATGACCTGCG GAGATGGAGA ATGGGGAGCT GAAGTTTACG GCTGTGCTGC AGACAGACAA CAGGCTTCTA TCGTTTTTGA TGTAGCTGTT GAAATGGTAG AACAGTGTCC TGCTCTTAAG AAAAGAATTA AACCTGTTCT TTCTGTAAAA AGATTGATAT ATAAGCCTAC AAACAGCTTT TATCAGGTAT TATCTGCTGA AGCTTATTCA AAACATGGAC TTAATGTTCA TGGAGTTGTA ATGGATGAAC TTCATGCTCA GCCTAACAGG GATTTATATG ATGTTATGAC TAAAGGAAGT GGTGATGCAA GATTGCAGCC GCTGTTTTTT CTTATAACCA CAGCCGGAAC AGATAGAAAT TCTATATGCT ATGAAGTACA TCAAAAGGCA GTAGATATAT TAGAAGGAAG AAAAATCGAT CCAACATTTT ATCCTGTTAT TTATGGAATA GATGACAATG ACGATTGGAC ATTAGAGAAA AACTGGTATA AAGCAAACCC TTCTCTTGGG CATACCATAG ATATAGAAAA AGTGAGAAAT GCCTTTAACA GTGCAAAAGA AAATCCTGCT GAAGAAAATA TATTCCGTCA GCTTAGATTA AATCAATGGG TGAAGCAGTC CACAAGATGG ATGCAGATGG ACAAGTGGGA TGAGTGTGCT TTTAAAGTTG ATATAGATAG TTTAAAAGGA AGAGAGTGTT ATGGGGGACT TGACCTTTCA AGTACCACAG ATATCACAGC CTTTGTTTTA GTATTTCCTC CAAGAACATC AGATGAAAAA TATATTGTTC TTCCTCACTT TTGGATACCA GAGGATAATT TAAATTTAAG AGTAAGACGA GATCATGTAC CTTATGATAT TTGGAAAAAG CAGGGATACT TAAAAACTAC TGAAGGAAAT GTAGTTCATT ATGGCTATAT AGAAACCTTT ATTGAAGAGC TTGGGAAAAA ATACAACATA AAAGAAATTG CCTTTGACAG ATGGGGTGCT GTGCAGATGG TACAGAACCT GGAGGGAATG GGTTTTACAG TTGTACCTTT TGGGCAGGGG TATAAGGATA TGTCTCCTCC TACAAAGGAG CTTATGAAAA TTACTCTTGA AAAGAAAATA GCCCATGGAG GACATCCTGT TTTAAGGTGG ATGATGGATA ATATTTATGT AAAAACTGAT CCTGCAGGCA ATATAAAGCC TGATAAAGAA AAGTCTACTG AAAAGATAGA TGGTGCTGTA GCACTTATTA TGGCACTTGA TAGATCCATA AGACATGAAA ATAAAGAAAG TGTCTATGAA AAAAGAGGAA TGAGAAGTTT TCTTGATTAG
|
Protein sequence | MYDEAKAQHA VNFINCLKHT KGQWRGVPFD LLPWQDKIIR DIFGTVKENG YRQYNTAYVE IPKKNGKQLA LDTPIPTPDG WTTMGEIKAG DKVIDEKGRP CNVVAISEID DTEQAYKINF RDGTSIVAGE RHLWKVQVTN NGRREKLLTT GEMYQKQFKT KSKENRALFR IPIADAFILP ENKLPIDPYL FGYWIGNGNA VKPEITVMRD DVDEVIKNIP YKLHNRYKQE GNSDILVYKE LKSILVKNFR EKRIPIEYLR ASAQQRKRLL QGLIDSDGCV STAKSQAIYV TILFELAKDV QDLLWSLGIK NTLKTAPSAR YGIETGEICY LIKFTAFNDL EVSGLDRKLK RGRERNIKTR SHFHYIKSIE KTGKTKMRCI QVDSPSRLYL AGKSMIPTHN SELAAAVALY MTCGDGEWGA EVYGCAADRQ QASIVFDVAV EMVEQCPALK KRIKPVLSVK RLIYKPTNSF YQVLSAEAYS KHGLNVHGVV MDELHAQPNR DLYDVMTKGS GDARLQPLFF LITTAGTDRN SICYEVHQKA VDILEGRKID PTFYPVIYGI DDNDDWTLEK NWYKANPSLG HTIDIEKVRN AFNSAKENPA EENIFRQLRL NQWVKQSTRW MQMDKWDECA FKVDIDSLKG RECYGGLDLS STTDITAFVL VFPPRTSDEK YIVLPHFWIP EDNLNLRVRR DHVPYDIWKK QGYLKTTEGN VVHYGYIETF IEELGKKYNI KEIAFDRWGA VQMVQNLEGM GFTVVPFGQG YKDMSPPTKE LMKITLEKKI AHGGHPVLRW MMDNIYVKTD PAGNIKPDKE KSTEKIDGAV ALIMALDRSI RHENKESVYE KRGMRSFLD
|
| |