Gene Cthe_3165 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_3165 
Symbol 
ID4809615 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3740168 
End bp3741316 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content38% 
IMG OID640108598 
ProductPpiC-type peptidyl-prolyl cis-trans isomerase 
Protein accessionYP_001039553 
Protein GI125975643 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.106005 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGCTGTGGG TCATAAGGAT AATATTGCAG GAAGGTTTGG TGATGATGGA AAACAATGCC 
GGTATAAGTA AAAAGCCTGT CGTTAAGGTG CACGTCATCT TGCTGGTGGC AGGAATACTT
GTTCTGTCTG CCGTATTGGC AATTTTGGTT GCATATCAGG CAGGACTTAT ATATGGAGAC
TTTTCAGAAC TTGCCAGGGT AAACGGTGAG CCTGTCTATG TCAAAGAATA TAAAATGAAG
CTTTTAAGCA ATACCACCGA AGTAATCAAT TATTTCAGTC AAAGTTACGC AGTTGAAACC
AAAGAAAATT TCCGCACCGA CAGCTACAGT GGCGAAGCGC CGGTTGAAAT GGCAAGAAAA
AAGGCATTGG ACGACATTGT GGAAGTAAAG GTTCAACAGA TACTTGCAAA GGAAAAGGGA
ATTATTGAAA GTACTGATTA TAGAGAGTTT TTAAAAGAAC TGGAAAATGA AAACCGACAA
AGAAAGGATG CACTCAAAAG CAACAAGGTA GTGTACGGGC CCGACAATTA CGGAGAGATT
GAGTATTTCA ACTATTCTTT TGACAACATG GTTTCAAAGC TTAAGGAGAA GCTGAAGGAA
AATGAATTGT CCATACCGGA GGAAAAGCTT GAAAGCATGT ATAATTTGCT TAAAGACACG
AGATTCAAGC TTCCGGATGA TATAAAGATT CAGGTTATAA GCATTGGTTT TACCGATGAA
AAGGGTATTA TTAATGATGA CCTGAAGAGT AAGGCAAGGG TTAAGATTGA AGAGGCAAAA
AAGAGGATTG ACAACGGAGA GCCTTTTGAA GAAGTGGCAC TGGATTATAA TCCGAAAAGC
GGAGTTTTGG AGTACGTCTT TACAAAAGAG AAGCAGATGG CAAAAGACAT TTCGCATCCC
GAACTTTTGG ATGAAGCGTT GAAGTTAAAG CCGGGACAGG TGAGTGAAAT AATAGAAAGA
AGTACGGATT TCGTTTTAAT ATTGTGCAAG GAGAAAAAAA GTACGGGTTA CCTTCCTTAT
AAGGACGCAC GAAAACAACT TTTGGACGAA TTGATAGAAA AGGATTATCA AGAGTATATA
GACAAACTTG TTGAACAGGC GGATGTAAAA ATAAATGAAA AATTATATAG GCGGATAAAT
GTAAATTAA
 
Protein sequence
MLWVIRIILQ EGLVMMENNA GISKKPVVKV HVILLVAGIL VLSAVLAILV AYQAGLIYGD 
FSELARVNGE PVYVKEYKMK LLSNTTEVIN YFSQSYAVET KENFRTDSYS GEAPVEMARK
KALDDIVEVK VQQILAKEKG IIESTDYREF LKELENENRQ RKDALKSNKV VYGPDNYGEI
EYFNYSFDNM VSKLKEKLKE NELSIPEEKL ESMYNLLKDT RFKLPDDIKI QVISIGFTDE
KGIINDDLKS KARVKIEEAK KRIDNGEPFE EVALDYNPKS GVLEYVFTKE KQMAKDISHP
ELLDEALKLK PGQVSEIIER STDFVLILCK EKKSTGYLPY KDARKQLLDE LIEKDYQEYI
DKLVEQADVK INEKLYRRIN VN