Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2475 |
Symbol | |
ID | 4809855 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 2946915 |
End bp | 2948348 |
Gene Length | 1434 bp |
Protein Length | 477 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 640107890 |
Product | SPP1 family phage portal protein |
Protein accession | YP_001038870 |
Protein GI | 125974960 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01538] phage portal protein, SPP1 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGCTGCTTA ATCTTTTTAA TTTCAGGAAC TTTAAAGACT TATTCCGCAA TGATATAAAC ATGATGACTG TAGAAGAAAT TTTATATAAC GAAATCAAAG AGTTCCAGGC ATCCGATAGA AGGGCCTGGA TGGTTATTGG CGATAGATAT TACCGGTGCG AAAATGACAT CCTTAACAGG CGTATAGTAC GCCATACAGA GAGCGGAGAC ATTGAAGATA GGTCAAAAGC AAACAATAGG TTGGCCCATG GTTTTGTTAA AAACCTTGTG GATGAAAAAA TAGGATATCT GCTTACAAAG GATTATTCGC TGAAGTGCGA CAATAAAGAA TATATTGAGA AAGTTAAAAA CGTCTTGGGT AAATATTTTC AATACACCCT TACCAGGCTC GGATATGAAG CGTCGAATAA AGGCATAGCA TGGTTACAAG TTTACATAAA TGAGCAGGGC AAATTTGGAA TGATGATAAT TCCTGCTGAA CAGTGCGTTC CACTCTGGAA AGATAACACT CACACTGAAC TTTATGGCAT GATTAGATAT TATGTGCAGA CAGTTTATGA AGGCAAGGAA AAGAAGCAGA TCACTCGCGT GGAATATTAC ACGGATAAAG AGGTTTATTT TTATGTTCTC GATAATGACC ATCTTATCCC GGATATAGAG CAATATGAAG GAGGGCCCAT ACTACACTAT AAAAAAGGGG AAGAAGGCCG AAGTTGGGGG AAAGTGCCTT TTATTGCCTG GAAGAATAAC CATCTTGAAT ATCCGGATGT TAAATTCATT AAATCGCTTG TGGACGCTTA CGATAAGTCA CGGAGTGAAA TAGATAATTT CATTGAAGAA ACAAAAAATC TTATCTATGT TTTAAAAGGC TATGGCGGAG AAAATTTATC TGATTTCATG AAAGACCTTA ATTACTACCG GGCTATAAAA ATAGATGATC CAGAGCATGG TGGAGTTGAT ACACTAACAC CGAAAATAGA TATTCAGGCA GCAAAGGAAC ATTTCGAACA ATTAAAGCGG GATATAAATG AGTTTGGCCA AGGTGTGCCC AAGGACCTTG ACAAATATGG CAATTCTCCC AGTGGGACAG CATTGAAGTT TTTATATAGT GGGCTGGATT TAAAATGCAA CCACTTGGAA GTAGAATTTA GACAGTCATT TAATCAGCTT TTGTATTTTG TAAACAGATA TCTCGCAGAA AACGGTCAGG GAAATTATGA GAATGAAAAT GTAGAGCTAA TTTTCAATAG AGATATACAG ATTAATGAAA CTGAAACTAT CAATAATTGT GTTAACAGTA AAGGCATTAT TAGCGATGAG ACTATCCTTG CAAATCATCC ATGGGTGTCT GATGTAGAAG AAGAATTAAA GCAGATTGAG AAAGAAAGAA AATCAGAGGA ACCGCCAATG TTTGGTGAGG GGGATGAAGA GTGA
|
Protein sequence | MLLNLFNFRN FKDLFRNDIN MMTVEEILYN EIKEFQASDR RAWMVIGDRY YRCENDILNR RIVRHTESGD IEDRSKANNR LAHGFVKNLV DEKIGYLLTK DYSLKCDNKE YIEKVKNVLG KYFQYTLTRL GYEASNKGIA WLQVYINEQG KFGMMIIPAE QCVPLWKDNT HTELYGMIRY YVQTVYEGKE KKQITRVEYY TDKEVYFYVL DNDHLIPDIE QYEGGPILHY KKGEEGRSWG KVPFIAWKNN HLEYPDVKFI KSLVDAYDKS RSEIDNFIEE TKNLIYVLKG YGGENLSDFM KDLNYYRAIK IDDPEHGGVD TLTPKIDIQA AKEHFEQLKR DINEFGQGVP KDLDKYGNSP SGTALKFLYS GLDLKCNHLE VEFRQSFNQL LYFVNRYLAE NGQGNYENEN VELIFNRDIQ INETETINNC VNSKGIISDE TILANHPWVS DVEEELKQIE KERKSEEPPM FGEGDEE
|
| |