Gene Cthe_1506 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1506 
Symbol 
ID4810544 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1829489 
End bp1830511 
Gene Length1023 bp 
Protein Length340 aa 
Translation table11 
GC content38% 
IMG OID640106926 
Productmetal dependent phosphohydrolase 
Protein accessionYP_001037927 
Protein GI125974017 
COG category[R] General function prediction only
[S] Function unknown 
COG ID[COG1418] Predicted HD superfamily hydrolase
[COG4905] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones43 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGATTTGTA TGATATATGG ATGCGGAGCG TTATTGGCTT TTTTTACCTA TGATTTGATG 
GAATCCGGTC GTCTGCCGAA TATTAAATGG TGGATGATAT TTATATTTGC TTTTGTTACA
AGCATGGTTT TAGGATACCC TGCTTCGTGG GCTTTTGAAA AACTGTTCAA GGAACGGTTG
TGTGATTGCA CTAATGTTCC ACTGAACATT AACGGAAGAA TAAGTGTGCC TACGTCTGTT
GTATTTGGTG CTGTATCAAT ACTTATGGTT AAAGCTTTGG TACCGTTGGT GAACAAAGGA
CTTAACACGT TATCTGAAGC TTTGCTGGAT ATTCTTGCTT ATGTTCTTGT TTCAATTGTG
TTAATAGATA CCACATTGAT AATATCGTTA ATGACGGATT TTCGAAGATA TGTTGTTTTG
GTAGACGGGG GATTTCAAAA TCATATAGCA GTTTTTGCAG AACACTTTTA TGCCAATCCG
GATTCTTATT ACAATAGAGT AATGCAACGT GTTGGAGATT TTAAACTTTC AGTAAGCAAA
AATCTTATTG AAAAGCAGCT TTGTGAGGAG GAGTTTGCTG AATTAATTAA AGATTACCTG
GAATATGATG TGATAAAGCA GATGGATGAG CATATTCATC ATGGTACAAC TACAACATTG
CAGCACTGCG AAAATGTAGC ATGGATTTGT TACCTGCTTA ATAAAAAACT GAATTTGAAT
GCGAACGAAA AGGAACTGGT GGAAGTGGCA ATGCTTCATG ATTTGTTTCT CTACGACTGG
CACGACGGTG ATCCAGCGAG AAGGATACAT GGCTTTGTTC ACGCTGACAT TGCATGCAAT
AACGCAATAA AACATTTTGG CATACCGGAA AAACAGCAGG AGGCTATACG CAGTCATATG
TGGCCGCTAA ATATTACGAA AATTCCGAAA AGCAGGGAAG CTGTAATTTT ATGCATTGTA
GACAAATATT GCGCTCTTAT TGAGACAGTG CGCTTAAACA AGCATTTTGG ATTAAGACAT
TGA
 
Protein sequence
MICMIYGCGA LLAFFTYDLM ESGRLPNIKW WMIFIFAFVT SMVLGYPASW AFEKLFKERL 
CDCTNVPLNI NGRISVPTSV VFGAVSILMV KALVPLVNKG LNTLSEALLD ILAYVLVSIV
LIDTTLIISL MTDFRRYVVL VDGGFQNHIA VFAEHFYANP DSYYNRVMQR VGDFKLSVSK
NLIEKQLCEE EFAELIKDYL EYDVIKQMDE HIHHGTTTTL QHCENVAWIC YLLNKKLNLN
ANEKELVEVA MLHDLFLYDW HDGDPARRIH GFVHADIACN NAIKHFGIPE KQQEAIRSHM
WPLNITKIPK SREAVILCIV DKYCALIETV RLNKHFGLRH