Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1506 |
Symbol | |
ID | 4810544 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 1829489 |
End bp | 1830511 |
Gene Length | 1023 bp |
Protein Length | 340 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 640106926 |
Product | metal dependent phosphohydrolase |
Protein accession | YP_001037927 |
Protein GI | 125974017 |
COG category | [R] General function prediction only [S] Function unknown |
COG ID | [COG1418] Predicted HD superfamily hydrolase [COG4905] Predicted membrane protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 43 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGATTTGTA TGATATATGG ATGCGGAGCG TTATTGGCTT TTTTTACCTA TGATTTGATG GAATCCGGTC GTCTGCCGAA TATTAAATGG TGGATGATAT TTATATTTGC TTTTGTTACA AGCATGGTTT TAGGATACCC TGCTTCGTGG GCTTTTGAAA AACTGTTCAA GGAACGGTTG TGTGATTGCA CTAATGTTCC ACTGAACATT AACGGAAGAA TAAGTGTGCC TACGTCTGTT GTATTTGGTG CTGTATCAAT ACTTATGGTT AAAGCTTTGG TACCGTTGGT GAACAAAGGA CTTAACACGT TATCTGAAGC TTTGCTGGAT ATTCTTGCTT ATGTTCTTGT TTCAATTGTG TTAATAGATA CCACATTGAT AATATCGTTA ATGACGGATT TTCGAAGATA TGTTGTTTTG GTAGACGGGG GATTTCAAAA TCATATAGCA GTTTTTGCAG AACACTTTTA TGCCAATCCG GATTCTTATT ACAATAGAGT AATGCAACGT GTTGGAGATT TTAAACTTTC AGTAAGCAAA AATCTTATTG AAAAGCAGCT TTGTGAGGAG GAGTTTGCTG AATTAATTAA AGATTACCTG GAATATGATG TGATAAAGCA GATGGATGAG CATATTCATC ATGGTACAAC TACAACATTG CAGCACTGCG AAAATGTAGC ATGGATTTGT TACCTGCTTA ATAAAAAACT GAATTTGAAT GCGAACGAAA AGGAACTGGT GGAAGTGGCA ATGCTTCATG ATTTGTTTCT CTACGACTGG CACGACGGTG ATCCAGCGAG AAGGATACAT GGCTTTGTTC ACGCTGACAT TGCATGCAAT AACGCAATAA AACATTTTGG CATACCGGAA AAACAGCAGG AGGCTATACG CAGTCATATG TGGCCGCTAA ATATTACGAA AATTCCGAAA AGCAGGGAAG CTGTAATTTT ATGCATTGTA GACAAATATT GCGCTCTTAT TGAGACAGTG CGCTTAAACA AGCATTTTGG ATTAAGACAT TGA
|
Protein sequence | MICMIYGCGA LLAFFTYDLM ESGRLPNIKW WMIFIFAFVT SMVLGYPASW AFEKLFKERL CDCTNVPLNI NGRISVPTSV VFGAVSILMV KALVPLVNKG LNTLSEALLD ILAYVLVSIV LIDTTLIISL MTDFRRYVVL VDGGFQNHIA VFAEHFYANP DSYYNRVMQR VGDFKLSVSK NLIEKQLCEE EFAELIKDYL EYDVIKQMDE HIHHGTTTTL QHCENVAWIC YLLNKKLNLN ANEKELVEVA MLHDLFLYDW HDGDPARRIH GFVHADIACN NAIKHFGIPE KQQEAIRSHM WPLNITKIPK SREAVILCIV DKYCALIETV RLNKHFGLRH
|
| |