Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1709 |
Symbol | |
ID | 4808884 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 2031877 |
End bp | 2034156 |
Gene Length | 2280 bp |
Protein Length | 759 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 640107122 |
Product | Phage-related protein-like protein |
Protein accession | YP_001038123 |
Protein GI | 125974213 |
COG category | [S] Function unknown |
COG ID | [COG5412] Phage-related protein |
TIGRFAM ID | [TIGR01760] phage tail tape measure protein, TP901 family, core region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAAGAG ATGCAAATAC CGTAGTTGCA AGGGTAGGAC TTGATGATAG AGGTTTTCAA GAAGGTGTAG CAAAAATTCA AAGAAGTCTA AAGGTTGTTC AAAGTGAATT TGCAGCAGCT TCTTCTAAGC TTGGTGATTT TGGCAAATCT GAAGAAGGAC TAAGACTTAA ATCAGATACC TTAAATAAAC AGATAGAACT TCAGAAGGAT AAAGTTGCGG CATTAGAAAA AGCATATCAA AAGAGTGTAG AAACAAAGGG TGAAGATGCA AAGGCTACTG AAAATCTTAA AATTAAGCTT AATTATGCTA CAGCAGAACT AAATAAAATG GAGAATGAGC TGAAAGAGGC AACAAGAGAA CTTAAGGAAA AAAGCTCGGC TTGGTATAAG CTGTCTGAAA GCATGAATAG TGCAGGAGAA AAGATGAAAT CTGTAGGAGA TAAGATGTCT TCTATAGGAA GTAAGCTTTC TACTGCTGTA ACACTTCCTT TAGTTGGAAT AGGAACTGCT GCAACAAAAA TGGCTATGGA TGCAGTGGAA TCTGAAAATC TCTTTGAAGT AGCTATGGGT TCAATGGCAG GCGATGCAAG AAAGTGGTCA GAAGAAACCT CAAAAGCTCT AGGACTCAAT GCTTTCAATG TAAGAAAAAA TGTAGCAACT TATAATGCCA TGCTTACCTC TATGGGGTTA ACTTCACAAG AGTCATTAAA GATGTCAGAA GGATTAACTC AGCTTTCCTA TGATATGGCT TCTTTCTATA ACTTAAAACC AGAAGAGGCA TTTGAGAAAT TAAAATCTGG TATTAGTGGA GAGGCAGAAC CACTTAAAGC TTTAGGTATA TTAGTTAATG ATAATACAAT TAAAACCTAT GCTTATTCTC ATGGAATTGC AAAGCAGGGT GAACAGCTTA CTGAAGCACA AAAGGTTCAA GCAAGGTATG GTGCTATAAT GGAAGCTACA AAAAATGCTC AAGGTGACCT TGCAAGAACT ATGGATTCAC CAACCAATAA GCTTAGAGTT ATGAAAGAGC AAACACAGCA GCTTGGCATT CAGTTTGGAC AACTTTTAAT TCCTATACTT GAAAAACTAA TGAACACTAT AAAACCTCTT TTAGATAAGT TCCAAGGGCT ATCAAAGGAA CAGCAAGAAA CAATTATTAA AATCGGATTA GTAGTTGCAG CAATAGGTCC AGTAATCATG ATTATAGGTA AGGTAATAAG TATTGCAGGA ACTCTTTCTA CTGTAATTGG AACAGTGAGT GGAGCAATGG CAGCAGCAGG TGGTGCATCT GGAGCCTTAG GAGCTGCTTT TGCAGCAATA ACTGGTCCAG TTGGCATTGC AGTAGCGGCT ATTACAGGTC TTATTGCTAT TTTTGTAGCC TTATACAAAA ATAATGAGGA CTTTAGGAAT TCAGTAAATA CAGTATGGAA TGGAGTTAAA GCTTTAATAA GTGGTGTCAT TGAAAGCTTA AAGGCTATGT TTCAAGCCTT TATTACCTTA GCAAATCAAA TATGGAAAAA GTATGGTGAT GATTTTGTAA AGATAATAAC AACTGCTTTT AATTTAGTAG CAACTATTGT AAATACCACA CTTAAAGCCA TTCAAGATGT TATAAAAATA GTTACCAGTG CAATAAAAGG TGATTGGAAG GGTGTATGGG AAGGAATAAA AAATCTTACC TCTGACTTAT GGAATGGAAT AAAGAATGTG ATAAAATCAG CCATTGATTT AGTTAAAGGA ACTATAAAAA CAGAATTTGA ATTTATCAAA GGCATAATCT TAGGAATATG GAATGGCATT AAAGGAATAA CTTCAGCAGT TTGGAATGAG ATAAAATCAG CTATTGAAAA TCCAATAAAT GCAGCAAAGA ATGCTGTAGG TAATGCTATA AATGCAATTA AAGGATTTTT CAGCAATCTA CATTTACCAG AAATAAAAAT ACCTAAAATA AAACTTCCTC ATTTTAGTAT TGAGGGAGAG TTTAGTTTGA AACCTCCAAG TGTACCTTAC CTAGGTGTAG ATTGGTATGC GAAGGGTGGT ATATTTAATA GACCTAGTAT AATCGGTGTC GGTGAAGCAG GAACTGAAGC TGTACTTCCT ATAGATAGGT TAGATGAGCT TATGGCAAGG GCAATTGAAA AAGCAAAAGG AGGAAGTGGA AGCGGATTAA CACTTCATAT AGAAAATTTC ATTAATAATT CAGATAAGGA TATAGAGCAG CTTGCCTATG AGCTTGAATT TTACAGGCAG AGAGTTTCAA TGGGAAGGGG TGGTGCTTAA
|
Protein sequence | MARDANTVVA RVGLDDRGFQ EGVAKIQRSL KVVQSEFAAA SSKLGDFGKS EEGLRLKSDT LNKQIELQKD KVAALEKAYQ KSVETKGEDA KATENLKIKL NYATAELNKM ENELKEATRE LKEKSSAWYK LSESMNSAGE KMKSVGDKMS SIGSKLSTAV TLPLVGIGTA ATKMAMDAVE SENLFEVAMG SMAGDARKWS EETSKALGLN AFNVRKNVAT YNAMLTSMGL TSQESLKMSE GLTQLSYDMA SFYNLKPEEA FEKLKSGISG EAEPLKALGI LVNDNTIKTY AYSHGIAKQG EQLTEAQKVQ ARYGAIMEAT KNAQGDLART MDSPTNKLRV MKEQTQQLGI QFGQLLIPIL EKLMNTIKPL LDKFQGLSKE QQETIIKIGL VVAAIGPVIM IIGKVISIAG TLSTVIGTVS GAMAAAGGAS GALGAAFAAI TGPVGIAVAA ITGLIAIFVA LYKNNEDFRN SVNTVWNGVK ALISGVIESL KAMFQAFITL ANQIWKKYGD DFVKIITTAF NLVATIVNTT LKAIQDVIKI VTSAIKGDWK GVWEGIKNLT SDLWNGIKNV IKSAIDLVKG TIKTEFEFIK GIILGIWNGI KGITSAVWNE IKSAIENPIN AAKNAVGNAI NAIKGFFSNL HLPEIKIPKI KLPHFSIEGE FSLKPPSVPY LGVDWYAKGG IFNRPSIIGV GEAGTEAVLP IDRLDELMAR AIEKAKGGSG SGLTLHIENF INNSDKDIEQ LAYELEFYRQ RVSMGRGGA
|
| |