Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1618 |
Symbol | |
ID | 4809313 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 1948913 |
End bp | 1951201 |
Gene Length | 2289 bp |
Protein Length | 762 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 640107034 |
Product | hypothetical protein |
Protein accession | YP_001038035 |
Protein GI | 125974125 |
COG category | [S] Function unknown |
COG ID | [COG5412] Phage-related protein |
TIGRFAM ID | [TIGR01760] phage tail tape measure protein, TP901 family, core region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGCAGACA ATTTTGGCTT GAAGATTGGG ATTGAAGGCG AAAAGGAATT TAAGAACGCC ATTCGTGAGA TCAACCAAAG TTTTAAGGTA CTGGGCAGCG AGATGAACCT GGTTGCATCT CAGTTTGATA AGCAAGATAA ATCAGTTGAA GCTGTTACTG CAAGAAACAA GGTGCTGAAT AAAGAAATCG AATTGCAGAA AGAAAAAATT GCTACTTTGG AGAAAGCGCT TGCCAACGCC GCCTCCTCTT TCGGGGAGAC CGACAAGCGG ACGCAGTCCT GGCAGATACA GCTTAACAAC GCCAAAGCAG AGCTGAACAA AATGGAGCGC GAGCTGGAAG CGAACAACAA AGCGCTGGAC AATGCGGGAA AAGAGTTTGA CGAAGCGGAA AAACAGGCGG GCGAATTTGG CAGAGAAATT AAAAAGGCCG CGGATCAGGC GGATGACGCA GGCGGGCGCT TTGAAAAACT GGGAGGTGTA TTGAAAGGTA TCGGTGTGGC CATGGGAGCA GCCCTGGCCG CTATTGGTAC AGCAGCGGTC GGTGCGGGAA AAGCTCTTGT GGATATGTCG GTAAATTCGG CGGCCTATGC CGATGAAATC CTTACCGCCT CGACCGTAAC CGGCATGTCC ACCGACAGCC TGCAGGCGTA TAAATACGCC GCGGAGCTTG TGGATGTCTC CTTAGATACT TTAACCGGCA GCATGGCAAG GAACGTCAGA TCCATGTCTT CAGCACGGAA AGGCACCGGT GAGATCGCGG ACGCTTACCG GAAGCTCGGC GTTTCGGTCA CGGACGCCAA CGGCAACCTG CGCGACAGCG AAGCCGTATA CTGGGAAACC ATAGACGCAC TTGGCAAGGT GTCCAACGAA ACTGAGCGTG ACGCGCTGGC CATGCAGATT TTCGGAAAGT CCGCACAGGA ACTCAATCCC CTGATTGCGC AGGGTTCGGC AGGGATAGCG GAGCTGACCG AGGAAGCAAA GCGCATGGGC GCAGTGATGA GCGAGGATTC ATTGAACGCT CTCGGGAAAT TTGACGACAG CATCCAGCGG CTCAAGGCGG GCGGCGCGGC GGCCAAAAAC ATGCTGGGCA CCGTGCTGCT TCCCCAGCTT CAGATATTGG CCGACGACGG GGTTGTGCTT CTCGGGGAAT TTACTCGTGG ATTATCTGAA GCAAACGGAG ACTGGACGAA GATCAGCGGG GTCATCGGCA ATACGGTGGG AAGCCTTGTA AACATGCTGA TGGAAAACCT GCCGAAGCTT ATCCAGGTAG GATTGGATAT CGTCACCTCC ATCGGCGGGG CTATTGTGGA CAATCTGCCG GTTATTATCG ACGCGGCGGT GCGGATTGTC ATGACGCTGC TGCAGGCTTT AATCGATGCA CTGCCGCAGA TAACCGACGG TGCTTTGCAG CTTGTTATGG CGCTGGTGCA GGGAATTATT GACAACCTTC CCGCTTTGGT GGAAGCCGCG GTGCAAATGA TTGCCACACT GGCGTCCGGT ATCGGGGAGG CGCTTCCGGA GCTGATACCC GCTGTTGTCG AAGCCATTAT CCTCATTGCC GAGGTACTTC TTGACAATAT GGATAAAATT CTTGACGCAG CGTTTCAGAT CATACAGGGG TTGGCGCAGG GACTTTTAAA TGCATTGCCA GAACTAATTG AAGCACTGCC GAGGATAATT ACAACAATCA TTGACTTTGT GACGAACAAT ATGCCGAAGA TCATAGAATT GGGAATTACG CTTATCGTAC ATCTTGCTGC CGGGCTTGTG AAAGCCATTC CAGAACTAGT AAAGTCTTTA CCTCAGATTG TTGCGGCAAT TATTGAAGGT TTGGGCAAGG CGGTTGTTTC AGTAGTTGAG ATTGGTAAGA ACATTGTAAA AGGCATCTGG GAAGGTATTA AAAGCCTTGG TAGCTGGATT AAGGATAAGG TTTCCGGTTT CTTTTCCGGT ATTGTTGATG GAGTAAAGAA TTTTCTTGGA ATCAGATCTC CGTCCACTGT TTTTGAAGGC ATTGGCGGCA ATATGGCACT GGGTATTGGT GAGGGATTTG ACAAGGCTAT GGCCAGAGTG GCAGACGATA TGCAAAATGC AGTGCCGACA GATTTTAATA TATCTCCTGA TATTAATGTA AGTGGAAGAG GTGAATTTAG CGGTTTAGCT TCTGGGCCGC TTGTTGTGGT GCAGCAGATG ATTGTTCGTG GTGAAGAAGA CATACGTAGG ATTTCACAGG AGTTATATAA CCTGATGCAG ACAGGTTCAA GGGCGCAGGG ACGTTTTATA ACAGCGTAA
|
Protein sequence | MADNFGLKIG IEGEKEFKNA IREINQSFKV LGSEMNLVAS QFDKQDKSVE AVTARNKVLN KEIELQKEKI ATLEKALANA ASSFGETDKR TQSWQIQLNN AKAELNKMER ELEANNKALD NAGKEFDEAE KQAGEFGREI KKAADQADDA GGRFEKLGGV LKGIGVAMGA ALAAIGTAAV GAGKALVDMS VNSAAYADEI LTASTVTGMS TDSLQAYKYA AELVDVSLDT LTGSMARNVR SMSSARKGTG EIADAYRKLG VSVTDANGNL RDSEAVYWET IDALGKVSNE TERDALAMQI FGKSAQELNP LIAQGSAGIA ELTEEAKRMG AVMSEDSLNA LGKFDDSIQR LKAGGAAAKN MLGTVLLPQL QILADDGVVL LGEFTRGLSE ANGDWTKISG VIGNTVGSLV NMLMENLPKL IQVGLDIVTS IGGAIVDNLP VIIDAAVRIV MTLLQALIDA LPQITDGALQ LVMALVQGII DNLPALVEAA VQMIATLASG IGEALPELIP AVVEAIILIA EVLLDNMDKI LDAAFQIIQG LAQGLLNALP ELIEALPRII TTIIDFVTNN MPKIIELGIT LIVHLAAGLV KAIPELVKSL PQIVAAIIEG LGKAVVSVVE IGKNIVKGIW EGIKSLGSWI KDKVSGFFSG IVDGVKNFLG IRSPSTVFEG IGGNMALGIG EGFDKAMARV ADDMQNAVPT DFNISPDINV SGRGEFSGLA SGPLVVVQQM IVRGEEDIRR ISQELYNLMQ TGSRAQGRFI TA
|
| |