Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0269 |
Symbol | |
ID | 4808552 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 331571 |
End bp | 333004 |
Gene Length | 1434 bp |
Protein Length | 477 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 640105681 |
Product | glycoside hydrolase family protein |
Protein accession | YP_001036701 |
Protein GI | 125972791 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3405] Endoglucanase Y |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.275199 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAAGAACG TAAAAAAAAG AGTAGGTGTG GTTTTGCTGA TTCTTGCAGT GTTGGGGGTT TATATGTTGG CAATGCCGGC AAACACTGTG TCAGCGGCAG GTGTGCCTTT TAACACAAAA TACCCCTATG GTCCTACTTC TATTGCCGAT AATCAGTCGG AAGTAACTGC AATGCTCAAA GCAGAATGGG AAGACTGGAA GAGCAAGAGA ATTACCTCGA ACGGTGCAGG AGGATACAAG AGAGTACAGC GTGATGCTTC CACCAATTAT GATACGGTAT CCGAAGGTAT GGGATACGGA CTTCTTTTGG CGGTTTGCTT TAACGAACAG GCTTTGTTTG ACGATTTATA CCGTTACGTA AAATCTCATT TCAATGGAAA CGGACTTATG CACTGGCACA TTGATGCCAA CAACAATGTT ACAAGTCATG ACGGCGGCGA CGGTGCGGCA ACCGATGCTG ATGAGGATAT TGCACTTGCG CTCATATTTG CGGACAAGTT ATGGGGTTCT TCCGGTGCAA TAAACTACGG GCAGGAAGCA AGGACATTGA TAAACAATCT TTACAACCAT TGTGTAGAGC ATGGATCCTA TGTATTAAAG CCCGGTGACA GATGGGGAGG TTCATCAGTA ACAAACCCGT CATATTTTGC GCCTGCATGG TACAAAGTGT ATGCTCAATA TACAGGAGAC ACAAGATGGA ATCAAGTGGC GGACAAGTGT TACCAAATTG TTGAAGAAGT TAAGAAATAC AACAACGGAA CCGGCCTTGT TCCTGACTGG TGTACTGCAA GCGGAACTCC GGCAAGCGGT CAGAGTTACG ACTACAAATA TGATGCTACA CGTTACGGCT GGAGAACTGC CGTGGACTAT TCATGGTTTG GTGACCAGAG AGCAAAGGCA AACTGCGATA TGCTGACCAA ATTCTTTGCC AGAGACGGGG CAAAAGGAAT CGTTGACGGA TACACAATTC AAGGTTCAAA AATTAGCAAC AATCACAACG CATCATTTAT AGGACCTGTT GCGGCAGCAA GTATGACAGG TTACGATTTG AACTTTGCAA AGGAACTTTA TAGGGAGACT GTTGCTGTAA AGGACAGTGA ATATTACGGA TATTACGGAA ACAGCTTGAG ACTGCTCACT TTGTTGTACA TAACAGGAAA CTTCCCGAAT CCTTTGAGTG ACCTTTCCGG CCAACCGACA CCACCGTCGA ATCCGACACC TTCATTGCCT CCTCAGGTTG TTTACGGTGA TGTAAATGGC GACGGTAATG TTAACTCCAC TGATTTGACT ATGTTAAAAA GATATCTGCT GAAGAGTGTT ACCAATATAA ACAGAGAGGC TGCAGACGTT AATCGTGACG GTGCGATTAA CTCCTCTGAC ATGACTATAT TAAAGAGATA TCTGATAAAG AGCATACCCC ACCTACCTTA TTAG
|
Protein sequence | MKNVKKRVGV VLLILAVLGV YMLAMPANTV SAAGVPFNTK YPYGPTSIAD NQSEVTAMLK AEWEDWKSKR ITSNGAGGYK RVQRDASTNY DTVSEGMGYG LLLAVCFNEQ ALFDDLYRYV KSHFNGNGLM HWHIDANNNV TSHDGGDGAA TDADEDIALA LIFADKLWGS SGAINYGQEA RTLINNLYNH CVEHGSYVLK PGDRWGGSSV TNPSYFAPAW YKVYAQYTGD TRWNQVADKC YQIVEEVKKY NNGTGLVPDW CTASGTPASG QSYDYKYDAT RYGWRTAVDY SWFGDQRAKA NCDMLTKFFA RDGAKGIVDG YTIQGSKISN NHNASFIGPV AAASMTGYDL NFAKELYRET VAVKDSEYYG YYGNSLRLLT LLYITGNFPN PLSDLSGQPT PPSNPTPSLP PQVVYGDVNG DGNVNSTDLT MLKRYLLKSV TNINREAADV NRDGAINSSD MTILKRYLIK SIPHLPY
|
| |