Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2474 |
Symbol | |
ID | 4809854 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 2945643 |
End bp | 2946899 |
Gene Length | 1257 bp |
Protein Length | 418 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 640107889 |
Product | PBSX family phage terminase large subunit |
Protein accession | YP_001038869 |
Protein GI | 125974959 |
COG category | [R] General function prediction only |
COG ID | [COG1783] Phage terminase large subunit |
TIGRFAM ID | [TIGR01547] phage terminase, large subunit, PBSX family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATAGTGA TGACGCAAGT TAGACTTAGC GAATTAGTCG CACCGAGTTT CTACGAGATC CACAATGACA TAAAGCATAA TAGATATACT CATTACTGGC TTAAAGGTGG CCGTGGCTCA ACCAAATCCT CTTTCGTGAG CATTGAAATC ATCCTCGGCG TAATGAAGGA CCCTAACGCT AATGCAGTGG CCCTGAGAAA AGTTAAGGAG ACTATCAAAG ATAGCGTATT CGAGCAGTTA GTGTGGGCAA TTGAGAAGCT GAAAGTTACT GAATACTGGG AGATAAAGCA CAACCCTATG GAATTGACAT ATCTACCTAC GGGACAAAAA ATATTGTTCC GTGGCGCTGA TAAGCCAAGG AAGATTAAAT CCATCAAAGT AAGCCGGGGA TATGTAAAGT TTATCTGGTA TGAAGAAGTT GACGAATTCC TCGGAATGGA AGAAATCCGA ATCATTAATC AGTCCTTGAT GCGTGGCGGA GAGCAGTTTG TCGTCTTTTA TACTTACAAT CCTCCAAACA GGGTTAACGC TTGGGTGAAT GAAGAAATAC TGATTGATAG ACCGGACAGA AAGGTCCATC ATAGCACGTA TTTGACTGTT CCTCGAGATT GGCTTGGGGA ACAGTTTTTT ATTGAGGCAG AACATCTTAA AAAAGTTAAC GAGAAAGCGT ATAGGCACGA GTATTTAGGT GAAGTCACCG GCACAGGCGG CGAGGTATTT ACAAACGTGA AAGCAAGGAA GATAAATGAC GAGGAAATAA AAGCATTTGA CAGGATAAAA AGAGGACTGG ACTTTGGCTA TGCTGTTGAC CCGGCAGCTT ACATTGTGTG CCACTTTGAT AAAACAAGGC GGCGGCTTTA TATATTTCAC GAGATATTCC AGGTCGGCTT GAGCAATAGG AAATTGGCAG AGTTAATTAA GAAAGAAAAC AAAAGCAATA AGTTAGTGGT TGCGGACAGC GCGGAGCCAA AGTCAATAGC CGAATTGCGT GGTTATGGAA TCAACATAAG GGGAGCGAAA AAAGGACCGG ACAGCGTTGA ATATGGAATA AAGTTTTTGC AAGACCTTGA AGAGATAATA ATTGACCCTG AGCGATGTCC AAATACATTG CGAGAGTTCG TAAATTATGA ACTTGAGAAA GACAAAGACG GCAATTTTAA AGCTGAATTC CCGGATAAAA ACAACCACAC GATCGATGCT GTTAGGTATG CGCTTGAGGA TGATATGAGG ACGGGCGGCC TATCAATTTT AAAGTGA
|
Protein sequence | MIVMTQVRLS ELVAPSFYEI HNDIKHNRYT HYWLKGGRGS TKSSFVSIEI ILGVMKDPNA NAVALRKVKE TIKDSVFEQL VWAIEKLKVT EYWEIKHNPM ELTYLPTGQK ILFRGADKPR KIKSIKVSRG YVKFIWYEEV DEFLGMEEIR IINQSLMRGG EQFVVFYTYN PPNRVNAWVN EEILIDRPDR KVHHSTYLTV PRDWLGEQFF IEAEHLKKVN EKAYRHEYLG EVTGTGGEVF TNVKARKIND EEIKAFDRIK RGLDFGYAVD PAAYIVCHFD KTRRRLYIFH EIFQVGLSNR KLAELIKKEN KSNKLVVADS AEPKSIAELR GYGINIRGAK KGPDSVEYGI KFLQDLEEII IDPERCPNTL REFVNYELEK DKDGNFKAEF PDKNNHTIDA VRYALEDDMR TGGLSILK
|
| |