Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1091 |
Symbol | |
ID | 4811389 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 1297090 |
End bp | 1298664 |
Gene Length | 1575 bp |
Protein Length | 524 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 640106513 |
Product | metal dependent phosphohydrolase |
Protein accession | YP_001037516 |
Protein GI | 125973606 |
COG category | [R] General function prediction only |
COG ID | [COG1418] Predicted HD superfamily hydrolase |
TIGRFAM ID | [TIGR00277] uncharacterized domain HDIG [TIGR03319] conserved hypothetical protein YmdA/YtgF |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0000612767 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGTGTGCTA TAGCTTATCT TGGTGGATTG CTGACAGGTA TAGTAATTGC AATAATTGCT TCAATTATTG CTTCAGTAAT AAGTTACCGA AAAGGTATTG AATTTAGAAA GAAAAAAGCA GAAGCCAAAA TTGGCAGTGC TGAACAGGAA GCAGAGCGAA TAATCAGCGA AGCTCAAAAA ATTGCGGAAG CTAAAAAAAG GGAAGTACTG CTTGAGGCAA AGGAAGAGAT TCATAAAAGC AGGTTGGAGC TCGATAGAGA AATTAAGGAA AGAAGAAATG AAATCCAGCG TCTGGAGAGA AGACTTGTTC AAAAGGAAGA GGCTCTTGAC AGAAAAGTCG AATCCTTGGA ACAAAAAGAA GAACTTCTTA ATAAAAAGAC GAAAGAGATT CAGGAACTTT ATGAACAGAC ACTTGAGACA CAAAGACAAC AGGTGGCCGA GCTTGAAAGA ATATCCGGGC TGTCTGTTGA CGAGGCAAAA GAAGTCCTGC TGAAAAATGT TGAAAATGAA GTAAAACATG AAATGGCAAT CCTAATTAAG GACATTGAGG CAAAGGCTAA AGAAGAGGCA GAGATCAGGG CAAAGAATAT TATTGCCATG GCGATTCAAA AATGTGCGGC TGATCACGTA TCTGAAGTTA CCGTTTCTGT TGTTCCACTT CCGAATGATG AGATGAAGGG TAGAATAATA GGCCGCGAGG GAAGAAATAT CAGGACCCTC GAAACGCTTA CGGGAATCGA CCTCATTATT GATGACACGC CTGAAGCCGT TATCCTTTCC GGGTTTGATC CAATAAGGAG AGAAATAGCG AGAATTACTC TCGAAAAGCT CATTCTTGAC GGGAGAATTC ATCCTGCAAG AATTGAAGAA ATGGTTGAAA AAGCCAGGAA AGAAGTTGAA AACACTATTC GCCAGGAAGG GGAAAATGCC ACATTTGAAA CAGGAGTCCA TGGATTGCAT CCTGAGATTG TCAGATTGCT TGGTAAGCTT AAGTTTAGAA CAAGCTATGG CCAAAATGTT TTGAGCCATT CCATTGAAGT GGCTCGTTTG GCTGGTTTGA TGGCGGCAGA GCTTGGAGTT GATGTTAATC TTGCAAAGAG GGCAGGCTTG CTGCACGATA TCGGCAAGGC CGTTGACCAT GAAGTTGAAG GGTCACACGT TACGATTGGA GCTGACATTG CTAAAAAGTA TAAAGAATCC AATGAAGTTG TCAATGCAAT TGCTTCGCAC CATGGCGATG TAGAAGCCAC CTCCATCATA GCGGTGCTTG TACAAGCTGC GGATTCTATT TCGGCTGCAA GGCCCGGAGC AAGAAGGGAA ACTCTTGAAT CATATATCAA GAGATTGGAG AAGCTTGAGG AAATTGCGAA TTCTTTTGAC GGTGTTGATA AGTGTTTTGC TATTCAGGCA GGTAGAGAAA TTCGTATCAT GGTGAAACCT GAGGATGTAT CGGATTCAGA TATTGCATTG ATTGCAAGAG ATATTGTGAA AAGGATTGAA AATGAGCTTG ATTATCCCGG ACAGATAAAA GTGAATGTTA TCAGGGAGAC CAGGTATATA GAATATGCTA AATAG
|
Protein sequence | MCAIAYLGGL LTGIVIAIIA SIIASVISYR KGIEFRKKKA EAKIGSAEQE AERIISEAQK IAEAKKREVL LEAKEEIHKS RLELDREIKE RRNEIQRLER RLVQKEEALD RKVESLEQKE ELLNKKTKEI QELYEQTLET QRQQVAELER ISGLSVDEAK EVLLKNVENE VKHEMAILIK DIEAKAKEEA EIRAKNIIAM AIQKCAADHV SEVTVSVVPL PNDEMKGRII GREGRNIRTL ETLTGIDLII DDTPEAVILS GFDPIRREIA RITLEKLILD GRIHPARIEE MVEKARKEVE NTIRQEGENA TFETGVHGLH PEIVRLLGKL KFRTSYGQNV LSHSIEVARL AGLMAAELGV DVNLAKRAGL LHDIGKAVDH EVEGSHVTIG ADIAKKYKES NEVVNAIASH HGDVEATSII AVLVQAADSI SAARPGARRE TLESYIKRLE KLEEIANSFD GVDKCFAIQA GREIRIMVKP EDVSDSDIAL IARDIVKRIE NELDYPGQIK VNVIRETRYI EYAK
|
| |