Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1613 |
Symbol | |
ID | 4809308 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 1942344 |
End bp | 1944767 |
Gene Length | 2424 bp |
Protein Length | 807 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 640107029 |
Product | glycosyl hydrolase-like protein |
Protein accession | YP_001038030 |
Protein GI | 125974120 |
COG category | [R] General function prediction only |
COG ID | [COG3858] Predicted glycosyl hydrolase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.5027 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGAAATG CCCGAATGTA TGAAGCACTT AGAGACTACG GCGACCGCTT TGATACGGTA GGCATTTTTA CTTTTGAGGT TGACGAAACA GGCACAATCA CTGAAACCGG TACCAGCATC AGCAGCATGC TTCCGTATAT TCAGAAATGG CCGCACATTA AGTGGCTGCT CACTATTATG AATCATGGAA TAGCCAATAT TTTTACTGCA CTTCGCAACA ACGAAAACGG TGCAAAGGAT AAGTTTCTCA CTGAAATCAT CCGAATAATG AACAAGTATC CATGGTGCGC TGGGGTAGAT ATTGACCTGG AGCGCGGCGG CGGTTATGAA AACAAGGATG CGGCGAATGC ACTATTTAGG GATATATACA ATACAGTTAA GTCTTATGAT GCAACAAAGC TTGTCAACAT CTGCCTGCCG GGTATGATCG GTGTTCAAGG CTCGGTGGGC GGTGAAAACT GGTGTGTTTA TGCCGATCTT AACGACTATT GCGATACCGC CGCCATCATG AGCTACGGCA TGGCATGGGC GGGTTCTGCT CCCGGTCCGG TATCTCCCCG TGACTGGCTT GAGGGCATAT ATGATTATGC TGTTTCCGTT ATGTCGCCGG ACAAGATATT CATGGGTTTG CCTGCTTATG GCTGGAACTG GAGGATCCAT GATACGCCTG AAAACCTCGG AATAACCTAT CGAGGAGTGT CTAATACCTA CTATGCGGCT AAATACTGGA TGACTGGGGT TTACAATTTC ACAGGTGATG CACCGCCCCA GCCGTTTATT CCAATTGTGG CTTACTGGGA TGACTATAAC AAAGTACCTT GGGCTCTTCC TCATGTATAT GACTATATGG AAGGATGGGA TGCTGTATCC TGGGAATATC CGCTGCTAAA AGGGGTTTAC AACAGGCGAA GATATTTGAC AAGCTATGGC AAGGAGCAGA AAGCGGAGTT CGGAACCATT TATATTGACA GGAACGGAGT TCCGGATGAA TACGAAGGAA ATGTCATTAT TACTGATGAG ATGACCTCAC TGGGAGATGC CCAGGCGTCA GCAGAGTACC GTTTTGAGAT AAGAGAAGCG GGATATTACG ATATTGCAGT ACAGCTTTGC TTTCCTTACT GGGACAAAAA TGCGATTATT GTTTCCCTTG ATGGTATATC AAAGACTTTC AGCGAGAACC GTTTATGGTG GCCATACTGG AGAAGAGTTT GCTGGTTGAC ACTTGCAAAA GGTGTATTTC TCCAAGAGGG AACGCATGCT ATCAGCATAA GCGGTGGTGT GCCGGGAGTC CAGTTTTACG GTTTTAGGGT TTGCAGTGGA TTTTCGGAGT ATCCCTTTGC CGGTGAAGCC AGCTTTATGC TCTCTCCTCG TCGGTTCAAG GATGTAAATG GTGTGATGGT TGAGCCGGAT CGAGGTTTTA AACTGACCTT TGAAATGTTG CGAAGAAAAC CCGACTCGGC GCTTATTTGG TATGAGGATT TTCGAGACAG GAACATCCTG CCTGAAAACT ACTGGACTGT GCTGGATGGC GAATGGGATG TTTGGCAAGA CCCAGACAGC ACAGAAAACC GTCCATATTC CCAGCTCGAG GGATATGGCA AACTTGCATG GAAATACGAC GGGTTTTCCG ATATTCATAT CCGGGCAAGG CTGGCTTTCC CTCAAAATAG CAGTGGACGG GCTGGGGTGT TCCTTGGGGA TATTTTCTGC TGCTTAAATT ATGACACGCA AAGAGTCGAG CTTTATCAAG GTAATTCCTT GCTTGGTAGC TATTCCACCA GTTTCTCAAA AACTGTAGAT GCCGATCTTC GTGCTAATCC GAATATGTAT ACTATAGAGA TGCGAAAACG CGGCAATAAG GTAAGAGTAT ATTCAGGTGC AGCTTCAACC CTGCGCTTCA CAGTGAATGT AACCGGTGGT AGTGGTTATG TAGGTTACTG CTCGGACAAC CGGACGGTAT GCGAGCTGCT GCGACTGGGC GATGCATGGG TATATGAACC ATACGAGCGT TTTGATGTGG AACTTCCGGA TGGAAATATA ACCAGCTTTG GCAGGCTTGC TCGCACTGGT GTCACGTGGG ATGATGAATT TCAGGTGTTT TCAGTAAATA ACGATGTGGA GGAATCGATA ACTCGCAGTG AGGACATTTC GATGGACTAT GACTTCTTCC ACTCACAACT TTTGGCTCTT TCTTGCGGTA ATGACTATGA AGTAAAGATT ATACTGAAAG ACATCAATAT CTGGATATCC CGTCTCTTCC TCGGAGATGC AGATGGTTTT TCTATTCTGT ATTATCAGGA TGTGGACAGC CTTGTTTACT GGGCAAACGA AGCGGCTTAT CGATGGAAAC TGCGAGGTAT AGCCATCTGG TCTCTTGGGC AGGAGGATAT GCGGTTGTGG GAGGCGCTTC CGAAGCAAAT ATAG
|
Protein sequence | MGNARMYEAL RDYGDRFDTV GIFTFEVDET GTITETGTSI SSMLPYIQKW PHIKWLLTIM NHGIANIFTA LRNNENGAKD KFLTEIIRIM NKYPWCAGVD IDLERGGGYE NKDAANALFR DIYNTVKSYD ATKLVNICLP GMIGVQGSVG GENWCVYADL NDYCDTAAIM SYGMAWAGSA PGPVSPRDWL EGIYDYAVSV MSPDKIFMGL PAYGWNWRIH DTPENLGITY RGVSNTYYAA KYWMTGVYNF TGDAPPQPFI PIVAYWDDYN KVPWALPHVY DYMEGWDAVS WEYPLLKGVY NRRRYLTSYG KEQKAEFGTI YIDRNGVPDE YEGNVIITDE MTSLGDAQAS AEYRFEIREA GYYDIAVQLC FPYWDKNAII VSLDGISKTF SENRLWWPYW RRVCWLTLAK GVFLQEGTHA ISISGGVPGV QFYGFRVCSG FSEYPFAGEA SFMLSPRRFK DVNGVMVEPD RGFKLTFEML RRKPDSALIW YEDFRDRNIL PENYWTVLDG EWDVWQDPDS TENRPYSQLE GYGKLAWKYD GFSDIHIRAR LAFPQNSSGR AGVFLGDIFC CLNYDTQRVE LYQGNSLLGS YSTSFSKTVD ADLRANPNMY TIEMRKRGNK VRVYSGAAST LRFTVNVTGG SGYVGYCSDN RTVCELLRLG DAWVYEPYER FDVELPDGNI TSFGRLARTG VTWDDEFQVF SVNNDVEESI TRSEDISMDY DFFHSQLLAL SCGNDYEVKI ILKDINIWIS RLFLGDADGF SILYYQDVDS LVYWANEAAY RWKLRGIAIW SLGQEDMRLW EALPKQI
|
| |