Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_3074 |
Symbol | |
ID | 4809948 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 3614597 |
End bp | 3615517 |
Gene Length | 921 bp |
Protein Length | 306 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 640108498 |
Product | Cof-like hydrolase |
Protein accession | YP_001039463 |
Protein GI | 125975553 |
COG category | [R] General function prediction only |
COG ID | [COG0561] Predicted hydrolases of the HAD superfamily |
TIGRFAM ID | [TIGR00099] Cof subfamily of IIB subfamily of haloacid dehalogenase superfamily [TIGR01484] HAD-superfamily hydrolase, subfamily IIB |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAGAAAA AGTTTGAAGG TTTGATGCTT GTCAGTGATT TGGACGGGAC TTTGCTAAAC AGCAAGTTGG AAGTAAGTGA GGAAAATATA AAAGCCGTTT CATATTTTGT GGACAATGGA GGAATATTTA CCATTGCGAC GGGCAGGATG GAGCTTGGGA CAAGAAAATA TTTGGAGGTC TTGCCGGTAA ATGCGCCCGT GATTTTATAC AACGGTGCCC TTATATATGA TTTTAATAAA GAAGAAAGAA TTTGGACCTG TGGTTTCAGA CAGGATATAA GGAATTTGTT GAAAGAGCTT TTGGATAGGT TTCCTTATCT TGGGATTGAA ATATTCCCGG GAGGGGACAA TGTTTATTTG CTGCGGGAGA ATGAAGAGAC CGAAAAACAC AGTAAAAAAG AAGGATTCAA ACCTGTGGTG ATTTCTGTGG ATGAAATGCC CAAAAACTTT TACAAGATTA TACTTACAGC TAACCCGGAC AGGCTTCCCG AAGTGGAGGA ATTTTTAAAG CCTATGGCTC AAGGATTCAG GACGGTGTAT TCAGAAAAAC AGTTTTTAGA AATTCTTGAC AATGAAACTT CAAAAGGTAG GGCTTTGGCC GAGCTTGCAA AAATAATGGG AATTGAAAAA GATAATGTGA TTTCTGTTGG TGACAATCAA AATGATTTAG AAATGATAAA GGTGTCGGGA ACAGGGTTTG CCGTTGAAAA CGCGCATCCG GAATTGATAG AAGCTTGTGA CTTTGTTTGC GTCCATCATG ACAGACATGC CGCTTCTTAT GTGATTGACT GGATTGAAAA AAATATTGTG AATAAAAACA AAACGTGGAA CCAATCTTTA GAAACTCCGT CTAATACAAA AAACATAAAA AGACGGAGGG AATTGTATGA AAATGAATAT TGTTTCAAAA TACAGAAGTA A
|
Protein sequence | MEKKFEGLML VSDLDGTLLN SKLEVSEENI KAVSYFVDNG GIFTIATGRM ELGTRKYLEV LPVNAPVILY NGALIYDFNK EERIWTCGFR QDIRNLLKEL LDRFPYLGIE IFPGGDNVYL LRENEETEKH SKKEGFKPVV ISVDEMPKNF YKIILTANPD RLPEVEEFLK PMAQGFRTVY SEKQFLEILD NETSKGRALA ELAKIMGIEK DNVISVGDNQ NDLEMIKVSG TGFAVENAHP ELIEACDFVC VHHDRHAASY VIDWIEKNIV NKNKTWNQSL ETPSNTKNIK RRRELYENEY CFKIQK
|
| |