Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0451 |
Symbol | |
ID | 4808379 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 565307 |
End bp | 567178 |
Gene Length | 1872 bp |
Protein Length | 623 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 640105865 |
Product | hypothetical protein |
Protein accession | YP_001036882 |
Protein GI | 125972972 |
COG category | [S] Function unknown |
COG ID | [COG2604] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAATGAAG TATTGGCTAA AAATCTGTCC CTTTTAAAAG AGTATCAGCC TGAAACATAC CTGAAGCTTG ACAGATACAT AAAGGGAGAG TATGTTCCAA AGGACAATTC AGTGGAAAAA ATACTTTTGG CCCGTCAGGA CGATCTCATA ATAAATATTT TGGTAAAATG CTCTGACAAA GATTTTCTGC TTTGTGACCA TGAAAACCCG ATTAATGAAG CTTATGCCTG GATTGACAAA TATATTGACC CTTCCAACAA GGCGGACATT GTCTTTGGAA TGGGATTGGC ATTTCACCTT GAAGTTCTTC TTACAAGTTT TCCTAACAAA AAAGTAATTG TAATTGAACC CAATATAAAC TTGTTTTATC AGATTGCCTG TATAAGAAAT CTTGAGCCGG TGATTAAAAA GGCTGAAATA ATTGTGGATG AAGACTTGGA TGTTATACTT GAGAGAATAA ATTCCTTGTT CTGGGATACG GAAAAGGGCG GGATTCAGGT ACAGCCTCTT GAGGTGTATG GTGAAATGTT TCCCGAAATG TGGGACAAGC TTCGGGACAG TTTCATAAAG CTTGCAAACA ATTTCACTGT CGATATTGCA ACCAGAAGGA AATTTGGAGA GCTGTGGGTG CACAATAATA TAAAAAATCT CAACAAAATT TGTGAAGCCT CCAATGCCGG TGTTCTGGTC GGAAAATTCA AGGGCATTCC CGGTATATTG GTATCGGCCG GGCCTTCCCT TGAAAAAAAC ATCCACCTTT TAAAAGGTCT TGAGGATAAG TGTGTGATTA TGGCAGCGGG AACGGCAGTA CGAATTATGG AGGATTTCGG TCTGGCACCG CATTTTATGG TGGGAATTGA CGCGGGAGCT AAAGAAGGGG AAATACATTC CAACGTAAAA AACAAAGATA TATATTTTAT TTATTCAAAC CAGGTTTCAA CATATTCTGT GGATGGCTAC AAAGGCCCCA AATTTGTTAT GAATTATCCT ATTGACATGT ATACGGCAGG CTTTTTTGAG TATGCCGGTA TCAAGTCGGA TTTTTTCCTA AGCGGGCCCT CGGTTGCCAA TACCTGCTTT GACATCTTGT TTAAGATGGG CTGCGACCCG ATTATAATTA TCGGTCAGGA CATGGCGTTT ACATACGGAA GCATGTATGC AGGTGAGGTG CCGGGAACGG TTGTAGACGG TGCCGGGGAA GCAAAAAGAA GGGGATATGT TCTTGCAAAA GATATTTACG GAAATGAAGT TTATACCACC CGGCCATTTC TTGCAATGAG AAACTGGTTT GAAGGATATT TTGAAAAAGT GCGGGACAAA ACGACAATAA TAAATGCCAC CGAAGGAGGA CTGAATATTT CGTATGCCAG AAATGAAACT CTTGAGGCCG CACTAAAGAG CTGTAATTTA TCAGAATCAG GCATTAAAGA TCATATCAGG TCGCTGCATG AGGAAGGAAA ATTTGCGGAT ACGGTAGCTT CAAAAATCGA GGAATACAGA GCATACGTTC AAAGGGAGAT AAGACGGCTG GAACAGCTTT CCAAAAGGCA GCTTGAGACA GCAGAGGATT TAAAGAGAGA TGTTTATCAT CCGTCAAAAA GCCGCTCAAG GTTTATTAAG GCTGTCAATT CAATAAATGA AATGTCGGAC AGGGTTCTTC AATCTCCGAT ATACAATTCC CTTCTTAAAA ATCTTGTTGA AATTGACTTT TATATTATAA AAGCGGAAGT GGACAGGTTA GTAAAGATAT TAACAAAATA TGACGATATA AAGAATGTAT ATGTGAATGC CATATTGAGT CAGAATCAAA AACTCAACGC AAGCCTTGGC AAAATCAAGA AATTCTTTGA CGAATCGGAT GTTACTGCTT AA
|
Protein sequence | MNEVLAKNLS LLKEYQPETY LKLDRYIKGE YVPKDNSVEK ILLARQDDLI INILVKCSDK DFLLCDHENP INEAYAWIDK YIDPSNKADI VFGMGLAFHL EVLLTSFPNK KVIVIEPNIN LFYQIACIRN LEPVIKKAEI IVDEDLDVIL ERINSLFWDT EKGGIQVQPL EVYGEMFPEM WDKLRDSFIK LANNFTVDIA TRRKFGELWV HNNIKNLNKI CEASNAGVLV GKFKGIPGIL VSAGPSLEKN IHLLKGLEDK CVIMAAGTAV RIMEDFGLAP HFMVGIDAGA KEGEIHSNVK NKDIYFIYSN QVSTYSVDGY KGPKFVMNYP IDMYTAGFFE YAGIKSDFFL SGPSVANTCF DILFKMGCDP IIIIGQDMAF TYGSMYAGEV PGTVVDGAGE AKRRGYVLAK DIYGNEVYTT RPFLAMRNWF EGYFEKVRDK TTIINATEGG LNISYARNET LEAALKSCNL SESGIKDHIR SLHEEGKFAD TVASKIEEYR AYVQREIRRL EQLSKRQLET AEDLKRDVYH PSKSRSRFIK AVNSINEMSD RVLQSPIYNS LLKNLVEIDF YIIKAEVDRL VKILTKYDDI KNVYVNAILS QNQKLNASLG KIKKFFDESD VTA
|
| |