Gene Cthe_1091 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1091 
Symbol 
ID4811389 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1297090 
End bp1298664 
Gene Length1575 bp 
Protein Length524 aa 
Translation table11 
GC content42% 
IMG OID640106513 
Productmetal dependent phosphohydrolase 
Protein accessionYP_001037516 
Protein GI125973606 
COG category[R] General function prediction only 
COG ID[COG1418] Predicted HD superfamily hydrolase 
TIGRFAM ID[TIGR00277] uncharacterized domain HDIG
[TIGR03319] conserved hypothetical protein YmdA/YtgF 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000612767 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTGTGCTA TAGCTTATCT TGGTGGATTG CTGACAGGTA TAGTAATTGC AATAATTGCT 
TCAATTATTG CTTCAGTAAT AAGTTACCGA AAAGGTATTG AATTTAGAAA GAAAAAAGCA
GAAGCCAAAA TTGGCAGTGC TGAACAGGAA GCAGAGCGAA TAATCAGCGA AGCTCAAAAA
ATTGCGGAAG CTAAAAAAAG GGAAGTACTG CTTGAGGCAA AGGAAGAGAT TCATAAAAGC
AGGTTGGAGC TCGATAGAGA AATTAAGGAA AGAAGAAATG AAATCCAGCG TCTGGAGAGA
AGACTTGTTC AAAAGGAAGA GGCTCTTGAC AGAAAAGTCG AATCCTTGGA ACAAAAAGAA
GAACTTCTTA ATAAAAAGAC GAAAGAGATT CAGGAACTTT ATGAACAGAC ACTTGAGACA
CAAAGACAAC AGGTGGCCGA GCTTGAAAGA ATATCCGGGC TGTCTGTTGA CGAGGCAAAA
GAAGTCCTGC TGAAAAATGT TGAAAATGAA GTAAAACATG AAATGGCAAT CCTAATTAAG
GACATTGAGG CAAAGGCTAA AGAAGAGGCA GAGATCAGGG CAAAGAATAT TATTGCCATG
GCGATTCAAA AATGTGCGGC TGATCACGTA TCTGAAGTTA CCGTTTCTGT TGTTCCACTT
CCGAATGATG AGATGAAGGG TAGAATAATA GGCCGCGAGG GAAGAAATAT CAGGACCCTC
GAAACGCTTA CGGGAATCGA CCTCATTATT GATGACACGC CTGAAGCCGT TATCCTTTCC
GGGTTTGATC CAATAAGGAG AGAAATAGCG AGAATTACTC TCGAAAAGCT CATTCTTGAC
GGGAGAATTC ATCCTGCAAG AATTGAAGAA ATGGTTGAAA AAGCCAGGAA AGAAGTTGAA
AACACTATTC GCCAGGAAGG GGAAAATGCC ACATTTGAAA CAGGAGTCCA TGGATTGCAT
CCTGAGATTG TCAGATTGCT TGGTAAGCTT AAGTTTAGAA CAAGCTATGG CCAAAATGTT
TTGAGCCATT CCATTGAAGT GGCTCGTTTG GCTGGTTTGA TGGCGGCAGA GCTTGGAGTT
GATGTTAATC TTGCAAAGAG GGCAGGCTTG CTGCACGATA TCGGCAAGGC CGTTGACCAT
GAAGTTGAAG GGTCACACGT TACGATTGGA GCTGACATTG CTAAAAAGTA TAAAGAATCC
AATGAAGTTG TCAATGCAAT TGCTTCGCAC CATGGCGATG TAGAAGCCAC CTCCATCATA
GCGGTGCTTG TACAAGCTGC GGATTCTATT TCGGCTGCAA GGCCCGGAGC AAGAAGGGAA
ACTCTTGAAT CATATATCAA GAGATTGGAG AAGCTTGAGG AAATTGCGAA TTCTTTTGAC
GGTGTTGATA AGTGTTTTGC TATTCAGGCA GGTAGAGAAA TTCGTATCAT GGTGAAACCT
GAGGATGTAT CGGATTCAGA TATTGCATTG ATTGCAAGAG ATATTGTGAA AAGGATTGAA
AATGAGCTTG ATTATCCCGG ACAGATAAAA GTGAATGTTA TCAGGGAGAC CAGGTATATA
GAATATGCTA AATAG
 
Protein sequence
MCAIAYLGGL LTGIVIAIIA SIIASVISYR KGIEFRKKKA EAKIGSAEQE AERIISEAQK 
IAEAKKREVL LEAKEEIHKS RLELDREIKE RRNEIQRLER RLVQKEEALD RKVESLEQKE
ELLNKKTKEI QELYEQTLET QRQQVAELER ISGLSVDEAK EVLLKNVENE VKHEMAILIK
DIEAKAKEEA EIRAKNIIAM AIQKCAADHV SEVTVSVVPL PNDEMKGRII GREGRNIRTL
ETLTGIDLII DDTPEAVILS GFDPIRREIA RITLEKLILD GRIHPARIEE MVEKARKEVE
NTIRQEGENA TFETGVHGLH PEIVRLLGKL KFRTSYGQNV LSHSIEVARL AGLMAAELGV
DVNLAKRAGL LHDIGKAVDH EVEGSHVTIG ADIAKKYKES NEVVNAIASH HGDVEATSII
AVLVQAADSI SAARPGARRE TLESYIKRLE KLEEIANSFD GVDKCFAIQA GREIRIMVKP
EDVSDSDIAL IARDIVKRIE NELDYPGQIK VNVIRETRYI EYAK