Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0795 |
Symbol | |
ID | 4810413 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 959624 |
End bp | 961351 |
Gene Length | 1728 bp |
Protein Length | 575 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 640106212 |
Product | alpha amylase, catalytic region |
Protein accession | YP_001037223 |
Protein GI | 125973313 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0366] Glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACTTG AGGCAATATA TCACAAACCT TACAGCGAGT TTGCATTTCC TGTTGCTCCC GACACTCTGG TAATACGGCT CAGGACGGCT AAAAACGATG TAAACACCTG CATTCTCATC TATCATGAAA AGTATGATAC GTCACAAAGA GGAAAAGTGA AAATGGATAA AGTTGCAAGC GATGGAATGT TTGACTATTA CGAGGTGGAG CTTAATGTCG GTATAAAGAG AATTAAGTAT ATGTTTTATC TTGAGGATAA TTATTCAATA AAATGGTACA GCAGCGACGG ATTTTTTGAT TATATGCCTC AGTGGGGACA TTTTACCTAT TCTTATATAT GCAAAGACGA CATATTTCAT GAGGTTGAAT GGTTTCGTAA TTCAACAATA TATCAGATAT TTCCCGACAG ATTTGCAAAG TTTCCGCCGG ATACCGAAAA TTCGGGCAAA AGAACGATAC ATGGGGGCAA TATTAAAGGG ATAATCGACC GGTTTGATCA TTTAGTCAAA CTTGGAGTTG ATGTGGTTTA TTTAAATCCC ATATTCAAAT CGGAATCTTA TCATCGCTAT GATGTTGTCG ATTATTATGA AATTGACCCG ATGTTTGGAA GTAAAGAAGA GCTCAGGGAA TTGATGGACT TGTGTCACAA AAACGGTATA AAAGTAATAT TTGACGGAGT TTTTAATCAC TCCGGAGACA AGTTTTTTGC ATTCAGAGAT GTTGTGGAAA AAGGAGAAAA ATCAAAGTAT GCAAACTGGT ATTTTATAAA TTCTTTTCCC GTTCAGGGAT ATCCAAGACC CAATTATGAA TGCTTTTCCT TTTATGGAGG AATGCCGAAG CTTAACACCG GAAATCCTGA GACGGCAAAA TACTTTCTTG ATGTGGTAAA GTATTGGACA GTGGAGTTTG GGGTGGACGG ATGGAGACTT GATGCGGCGG ATGAGGTGGA CCGAAAGTTT TGGAGAAAGC TTAGGGATAT GCTGAAAGAT TTGAACAAGG ATGTGGTGCT GATTGGCGAA ATATTTGACG AGGCGTCTTC GTGGCTTTGG GGAGACCAGT TTGACTCGGT GATAAATTAT CCGTTGAAAG CCATGATAAA TGACCTTTTT GCCTATAGGT CCATTGATGT GGAGACTTTC AGGAACAGAA TAAGCGGCTA TATTATGAAG TTTAACAAAA AGGTGCTAAG CAGCCTGGTA AATATAATAA GCACTCATGA CACGCCAAGG TTTTTGACTC TTTGCAACGG AGATGAAAAG AGGTTTGAGA TGGCAGTAGT GTTCCAGTTT ACCTTTCCTG GAGTTCCCCT CATATACTAC GGCGATGAAA TAGGGATGGA AGGCGAAGGA GACCCTGATT GCAGAAGACC GATGATATGG GACGAAGCGA AATGGAACAA AAAAACTCTC GAGTTGTATA AATTCCTGAT TGGCTTGAGG AAGAGGTTCG ATGCCTTGAG AACTGGAGAA TATGGAGAAC TTCCTGTAAC AGGATGCAAT GGGATACTGG CATACAGAAG GGGCCGGGGA GAAAACGGAA TTATTGTTGC CATGAATACA TTGGACCGAA AGGAAAATGT CGTAGTAGAA ACAGGAGATT CTTTTGACAC GGTAAAAGCT TTTGAGTCTT TGAAAGACGA AGAAAGACTG AATGTTGACA AAAAAAGGAT AAACATATGC TTAAATCCGT TTGAGTGGAG GATTTACAAA GCCTGCGGCG AATTATAA
|
Protein sequence | MKLEAIYHKP YSEFAFPVAP DTLVIRLRTA KNDVNTCILI YHEKYDTSQR GKVKMDKVAS DGMFDYYEVE LNVGIKRIKY MFYLEDNYSI KWYSSDGFFD YMPQWGHFTY SYICKDDIFH EVEWFRNSTI YQIFPDRFAK FPPDTENSGK RTIHGGNIKG IIDRFDHLVK LGVDVVYLNP IFKSESYHRY DVVDYYEIDP MFGSKEELRE LMDLCHKNGI KVIFDGVFNH SGDKFFAFRD VVEKGEKSKY ANWYFINSFP VQGYPRPNYE CFSFYGGMPK LNTGNPETAK YFLDVVKYWT VEFGVDGWRL DAADEVDRKF WRKLRDMLKD LNKDVVLIGE IFDEASSWLW GDQFDSVINY PLKAMINDLF AYRSIDVETF RNRISGYIMK FNKKVLSSLV NIISTHDTPR FLTLCNGDEK RFEMAVVFQF TFPGVPLIYY GDEIGMEGEG DPDCRRPMIW DEAKWNKKTL ELYKFLIGLR KRFDALRTGE YGELPVTGCN GILAYRRGRG ENGIIVAMNT LDRKENVVVE TGDSFDTVKA FESLKDEERL NVDKKRINIC LNPFEWRIYK ACGEL
|
| |