Gene Cthe_2353 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2353 
Symbol 
ID4808987 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2806134 
End bp2807282 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content39% 
IMG OID640107760 
ProductPGAP1-like protein 
Protein accessionYP_001038748 
Protein GI125974838 
COG category[R] General function prediction only 
COG ID[COG1075] Predicted acetyltransferases and hydrolases with the alpha/beta hydrolase fold 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.808395 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTGTGC GCCAAGTACG TAACCGCAAA ATTTTTATAA TGATTGCCCT GTCACTTTTT 
CTTTTATTTG TGCATTTCAG GCAAATTAAT GCTTTCTTTA TTGGTGCCTT TTACAATAAA
AACAATCTGA ACATTTACCT TGACCTTATA AACAGCCAGG TTCACACAAT TTCATCAGAG
CCGTCAAACA AAGTATATCT AAAAGCTGTA GTTAAAGATG CAAACGGAAA ACTCATTCCC
CATATTGAAG TTAACTTTGA AGCTTCAAAA GGCATGGGTA CGGTACAGCC TGCAAAAGCC
GCAACCGACA GCCGGGGTGA ATGTTTTGTC ACTTACGTCC CCGAATACTA TTACAACCTT
AGTCCTGATG CGAATCCACG GCATGTTGTC ATTACTGCTT CAATTGCCGG CACCGACACA
AACTCAACGG TAAAACTGAA CCTTGTTCCG GCACCCGTCG TCTTTGTGCA TGGATACAGG
GAAACTGCGG ATGTTTTTGA CAATTTGAAT GAATTTATTT CATCAAAAGG GTATACTTGC
ATTTCCCTGA ATTATGACTC AACTTTGGGA ATAGAGCATA GTGCCAAAGA ACTGGAGCTG
TTTTTGCAAA AGCAAAAAAA GGATTTTTTA AGTCAGGGAA TCCTTGTAAA CAAATTTGAC
CTGATTACCC ACAGTATGGG GGGATTGGTG GCAAGGTACT ATTCAGCAAG TCAGAACTAT
CTTAAAAATG ACGATATCAA TAAAATAATT TTTCTTTCAG TGCCTCACAA AGGCTCGGTT
TTGGCATCAA TAGGCGAGGA ATATTTCAAA GACAAATCTA TTAAAGAACT GGTTCCTGAC
AACGAATTGT TCGTAAGCAT ATTCCCCAAT ACAATTAACG GCGGGCTCAA CAATTCAATA
CAAACAGGTA ATCTTTTAAG CCAGTACGAT GAAGTGGTTA CAAATGAAAG TGCCGCTCTT
GACAAATGGG GGATTAAGAC TGAAATATTC AACGTGGGGG AAAACAGTTT CACTGTGCAC
AATCTGCTAA GCGGCAACAT TCTTGATGCT CCGAACCATA AAGGCATATT AAACAACAGC
ACAGTCTTTA ACCGCATCGC TGAAATGTTA AATACCAATC TTCCTTATCC TGCCGTTATA
AACAAATAA
 
Protein sequence
MSVRQVRNRK IFIMIALSLF LLFVHFRQIN AFFIGAFYNK NNLNIYLDLI NSQVHTISSE 
PSNKVYLKAV VKDANGKLIP HIEVNFEASK GMGTVQPAKA ATDSRGECFV TYVPEYYYNL
SPDANPRHVV ITASIAGTDT NSTVKLNLVP APVVFVHGYR ETADVFDNLN EFISSKGYTC
ISLNYDSTLG IEHSAKELEL FLQKQKKDFL SQGILVNKFD LITHSMGGLV ARYYSASQNY
LKNDDINKII FLSVPHKGSV LASIGEEYFK DKSIKELVPD NELFVSIFPN TINGGLNNSI
QTGNLLSQYD EVVTNESAAL DKWGIKTEIF NVGENSFTVH NLLSGNILDA PNHKGILNNS
TVFNRIAEML NTNLPYPAVI NK