Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2073 |
Symbol | |
ID | 4810671 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 2466588 |
End bp | 2467769 |
Gene Length | 1182 bp |
Protein Length | 393 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 640107480 |
Product | amidohydrolase |
Protein accession | YP_001038473 |
Protein GI | 125974563 |
COG category | [R] General function prediction only |
COG ID | [COG1473] Metal-dependent amidase/aminoacylase/carboxypeptidase |
TIGRFAM ID | [TIGR01891] amidohydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTGTACTT TGGAAATAAA AGAAAAATGT TCTGAAATAA TGGATGAGGT CATCCGCATA AGAAGGGACA TTCACAAAAA TCCTGAACTG GGCTTTAATG AATACAGGAC ATCCTCCATC GCATCGGATT TTATGAAAAA CCTCGGTTTC AGTGTCCGCA CAAACGTAGC CAAAACAGGT GTTGTCGGCG TCCTTGAAGG TGAAAGACCC GGTAAGACAA TTGCAATAAG AGCCGACATG GATGCCATCC CCATAGCCGA GGAAAACGAT TTTGAATATG CATCCCAAAA TAAAAATGTC ATGCATGCCT GCGGGCACGA TGCCCACATC GCCATAGCGC TGGGAACTGC AAAGATACTT TATCATTTTA AAGACAGAAT ATCCGGCAAT GTCAAATTTA TTTTCCAGCC TGCGGAGGAA GGGCTGGGAG GAGCCTCTTT TATGATTGAA GAAGGGGCGT TGGACAATCC CGCAACCGAT GCCATAATCG CCCTTCATGT CTCCCCGCTT TTAAAGTCGG GTCAAATTTC AGTCGGCGCA GGACCGGTAA TGGCTTCGCC CGCCGAGTTC GACATAGTCA TAAAAGGCAG GGGTGGTCAT GCGGCCCAGC CCAACAAATG CGTTAATCCA ATATCCATAG GGGCAAATAT TATAAACATG TTTTCATCCA TTATTCCAAA AACCCTGAGT CCTTTTAAAA GCGCCGTTCT GTCGGTTACA TGCTTTGAAG CGGGCAACAC CTACAACGTT ATTCCCTCAC AGGCTGTCAT CAAAGGCACC GTCAGGGCTT TCGACCGGGA AACCCACAAT GTAATATACA ATAAAATGTA TTCTGTAATC GCCTCATTAA CGTCGGCGGA GGGAGCGGAC TTCTCTTTTG ACTACAACCT CGGCTATCCT CCTGTCGTAA ACAATGCAGA AATTGCAAAG CTTGTTGCAA ATGCCGCGAA AAAAATTGTA GGGGACGACA ACGTAGTGGA AAATCCGGAG CCTTCCATGC TTGCGGAAGA TTTTTCCTAC TACGCTTTAA AAATCCCGGG GGCAATTTTC AACTTAGGCT GCAGACACCC TCACGATGAA AATTTTTACA ACCTTCACTC CTCCAAATTC AACCTTGACG AAAGCTGCAT AATCACAGGA ATACAGATAT TATCCCAGTG CGTACTGGAT TTTCTGGGAT AA
|
Protein sequence | MCTLEIKEKC SEIMDEVIRI RRDIHKNPEL GFNEYRTSSI ASDFMKNLGF SVRTNVAKTG VVGVLEGERP GKTIAIRADM DAIPIAEEND FEYASQNKNV MHACGHDAHI AIALGTAKIL YHFKDRISGN VKFIFQPAEE GLGGASFMIE EGALDNPATD AIIALHVSPL LKSGQISVGA GPVMASPAEF DIVIKGRGGH AAQPNKCVNP ISIGANIINM FSSIIPKTLS PFKSAVLSVT CFEAGNTYNV IPSQAVIKGT VRAFDRETHN VIYNKMYSVI ASLTSAEGAD FSFDYNLGYP PVVNNAEIAK LVANAAKKIV GDDNVVENPE PSMLAEDFSY YALKIPGAIF NLGCRHPHDE NFYNLHSSKF NLDESCIITG IQILSQCVLD FLG
|
| |