Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1777 |
Symbol | |
ID | 4810022 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 2098540 |
End bp | 2099742 |
Gene Length | 1203 bp |
Protein Length | 400 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 640107191 |
Product | amidohydrolase |
Protein accession | YP_001038191 |
Protein GI | 125974281 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1228] Imidazolonepropionase and related amidohydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0000349054 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTGCTGA TAAGAAACGG TAAAATTTTA ACCATGGCCG GAGTAAATTA TGAAAACGGA TATATTTTGA TTGATGCGGG GAAAATAGTT GAAGTGGGAG AATATCCTGC CGCATTTAAT CAGGAAGTTT TGAATTCCGG TGATTTGGAA GTTATTGATG CAAAGAATAA ATACATACTT CCTGGGCTTA TTGATGCGCA CTGTCATGTG GGAATGTGGG AAGATTCCGT TGGCTTTGAA GGGGATGACG GCAATGAAGC AACAGATCCT GTTACTCCTC ATCTTAGGGC TATAGATGCG GTGTATTATT TGGACCGAGC ATTTGAGGAG GCGCGGGAAA ACGGAGTTAC CACCGTGGTT ACAGGGCCGG GGAGCGCCAA TGTGATAGGT GGACAGTTTG TTGCCTTGAA AACTTACGGA AGACGAATAG AGGAAATGGT GGTAAAAGAC CCTGTAGCCA TGAAAGTGGC CTTTGGAGAA AACCCAAAGA CAGTGTACAA TGAAAGAAAA ACGGCGCCTA CAACCCGTAT GGCCACTGCG GCCATTCTCA GGGAAAACCT GATGAAAGCC AAAGAGTACA AGGAATTGAT GGATGAGTAC AACAAAAATC CGGAAGAAAA CGACAAACCG GAATATGATA TGAAAATGGA AGCTCTGCTG AAGGTTTTAA ACAGGGAAAT TCCGATAAAA GCACATGCGC ACAGGGCGGA TGACATCCTT ACCGCCATAA GGATAGCAAA GGAATTTGGG CTAAGGCTTA CAATAGAGCA TTGCACCGAA GGCCATCTTA TAAAGGACAT TCTTGCAGAG GAAGGAGTTT CGGCAATTGT GGGGTCGTCA CTTACCGACA GGTCAAAAGT GGAGCTTCGG AACCTCAGTT TGAAAACACC TGGAATTTTG GCGAAGGCGG GAGTCAAGGT GGCCATAATG ACGGACCATC CATGTACTCC GATACAGTAT TTGATACTGT GTGCGGCTAT GGCGGTAAGA GAGGGTATGG ACGAAATGGA GGCCCTCAGG GCAGTTACCA TAAATGCCGC CGAACTTACA GGAATAGCCG ACCGGGTGGG AAGCATAGAA GTGGGGAAGG ATGCGGACAT TGCCATCTAT GACGGTCATC CCTTTGACAT AAGGTCTAAA GTTTCCACAA CCATTATTAA CGGAAAGGTT GTTTACGAGA GGAAGAAACA TGAAAGAGAT TAG
|
Protein sequence | MLLIRNGKIL TMAGVNYENG YILIDAGKIV EVGEYPAAFN QEVLNSGDLE VIDAKNKYIL PGLIDAHCHV GMWEDSVGFE GDDGNEATDP VTPHLRAIDA VYYLDRAFEE ARENGVTTVV TGPGSANVIG GQFVALKTYG RRIEEMVVKD PVAMKVAFGE NPKTVYNERK TAPTTRMATA AILRENLMKA KEYKELMDEY NKNPEENDKP EYDMKMEALL KVLNREIPIK AHAHRADDIL TAIRIAKEFG LRLTIEHCTE GHLIKDILAE EGVSAIVGSS LTDRSKVELR NLSLKTPGIL AKAGVKVAIM TDHPCTPIQY LILCAAMAVR EGMDEMEALR AVTINAAELT GIADRVGSIE VGKDADIAIY DGHPFDIRSK VSTTIINGKV VYERKKHERD
|
| |