Gene Cthe_0046 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0046 
Symbol 
ID4808811 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp59535 
End bp62237 
Gene Length2703 bp 
Protein Length900 aa 
Translation table11 
GC content35% 
IMG OID640105455 
Producthypothetical protein 
Protein accessionYP_001036480 
Protein GI125972570 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.63373 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAGAA TATTATGTTG TATAATATTG TTGTGTATTT TGGCACCTTT TTTGTTGATG 
AATGCATATG CAGTAAGTTA TAAAAGTGCA AAAGAAGCAA TAGATGATGC TAATGACTTT
ATATTTCAGA AAATGGGCTA TGAAAATTAC TATTTGTTTA AAATTGGTAA TATGGATATA
AATGAGAAAC TTGCTCAATA TGGTTCAGAA ACATTCTACA ACAGGCCGGT ATTTGTATAT
GGGGACAGTG TGGAAGCTAG TAAAAGGACG ACAACAGGAG GCCGGGATAT AGTTAAAAAA
GTTAATGGTA AGGATGAATA TCGTGCATTA GGGTATGCAG TAGATGGTTC AGTTTTTCCA
AATCCTGCCT TCCTATATGA TAATGAAGGC CATGCAGCGA AGGACAAGAA GTGGGTTAAA
GAACCGTGGA ATGGTAAAAA TATCAAATAT TTATATGGCG AGGATGGAAG CATTATAAAA
AAAACTTTAT CAGATAGTGT TCTTAATTAT ATAAAAAACT GGATCCAAAT TAACGGTTTA
ACACCTACTG AAGTTGAAGC TTGGACAGGC AAAAGAAACT ATTTCGTGGA AAATGCAGTT
GGTGTACCGG AAAAATTAAA GAACAATTTT GAAGACTTTC TGTACATAAT ACAACCTCCG
ACGGAACATG CATGGGGACT GGGTATAGCA TTTTACTACT GGAACGGATA TAACAACCTC
AACTATAAAT CGTTCCTCAT AAAGCCATTT GATATGGTTG CTGATGATTT GGATGTTGGT
TTTTATACGA TACCTGGAAG TGTGACAGAA GGCAGCAAAG TATTGGTTGC AGTAAAAGTT
AAATCACACT TCGATATAGA TTTGGAAAAC GTTAGCTTCA AGTGGAATAT TACCACAAAA
GACAGCGATG ATAAGGATAT TCCATTGGAT GCGGGAAAAT ACGAACTTCA ATTTGTCGAT
TCGGCGGATA ATAATAATCA AAGCGGAACC ATAAATATAT CTGCAAAATA TAAGGAAGAG
TTTTTTTATG CTCAGTTTAA AATGCCTGAC CGTGATGTGT ATATAGAATT TGCAATAAAT
GAAGATGGGA AAAATCCTGT GGAAAGTAAT ATGGAAAATA ATACTGTTTC TACGGTGGTT
AAAGCTGAAA AGCCTATAAA TATGGCAGTT AAAAAATATG ATTTGCCATA TTATGCATTG
TCCAGGGAGA TAAGCTATCC ATTGGCAGAC CAAAATATTG TAGTTAATTT AAAAAAGCCA
GGTAGTGGCT GGTGGAGTGG AAATGCCAAA GTAGATGCTT TTAATGTAGA TATAGACACT
AAGTTTTTAC ACAATTACCA AGTAGGAAGT CCGGTTCTTG AAGATGAAGA AGACGATGTA
ATTGTTAGTT TGCCGAATGT GAGAGCAAAA ATCCAAAGAA GTGACTTTGG AGATGACCCT
GAAAACAAAA AGTGGTTGGC TGGTAATAAT TCAACTGATA TATTAACGGC AGCTTTAAAG
ACATTCTATG ATGTCTTTGT ATTGAAAGAG TATAAATATG AAACGAAATG TAATAGACAT
GAAGACTGCG AAGCGGAAGG CTGTTCCGGC TATGTAAAGG AAACAGGTTA TGCAAGGGGC
AGTAGAAGCG GAACTGCTCC GATAGAAGTA AATACATACG TATACAATGG GAAAAAAGAT
TTAAACAAAA AGCAATATGA AAATAAAATT GTCAATAATT ATGACAAAGA CTTTAAAGCA
AGAATTCTAT GGACAAACAG TCCTATCAAA TTCAATGTAA TACGTTATAT GTGTGACTTG
GATGTAAACG GAAACGCGAT AAATTGGAGA GCTGTGCCCG GCAAATATGA AAGGGAGTTT
GTGCAGCAGT GTAGTGCAGA TATTGACTGG AATATAGCAA GATCAATGCA ACAAGATTAC
GAGCAGGCCC GGAAAGCGGC AAGAGAAGGT AAACAAGACA AGTCATTTTA TAACAAGGCG
GTATTTGCGA CAGATAAAAA CCTGCAACGT TATGATTACC CGATAAAATC AGGATATTAC
TTTAATCCGA CGGGTACATA TACATTTGAA ATTACAACTG TTACTTACAA AAATAACCAA
AATGATACAG AAGAGCACAA AAAGCTTGTA AATGCATTAG TAAACTCTTT TAGATATGAA
TCGAACTTAA TATATGTTGA TGACAACGGT AATCCTGTAA ATATTGCAAA TGGATCCTAC
AAAACTCTGG GAGTATTAAC TGCAGCAAAA AATACGGGCA TTGGTGGTAA AAAACTAATA
TCTGTAAATC GTGACTACAA AAAGGTAACG GATGAGATAT ACTACGATTC TAAGAGGACA
GAGGATGAAA ATAAAAACGG AAGTCATGAT TTTTGGAAAA TGTCAATGGA AGGTTATTCT
TTGTCAGGAA GCTTGGACAG TTATAACAAG TACAAATACC GGGAATATGT TGCAGGTGGG
AAAGTATTTA AAATAACTGA AACAACAAAG GTAACATTTA TAATAAACGA AGGAAATGCA
AAATTTTATA CTCATCCCAA AATGCCTGAT GGTGAATATT ATATAACAGT AAGGCTTTCT
GACATAGACT TAAGTAAAAT GTCCGGGATG GACTACAGTT CGATAAAAGA TGTATTGAAG
GGAGTAATTT TAGACAGAAT CAAAGTCAAC GTAAAAGGTT CAATATATGG TGATATAAGT
TAA
 
Protein sequence
MKRILCCIIL LCILAPFLLM NAYAVSYKSA KEAIDDANDF IFQKMGYENY YLFKIGNMDI 
NEKLAQYGSE TFYNRPVFVY GDSVEASKRT TTGGRDIVKK VNGKDEYRAL GYAVDGSVFP
NPAFLYDNEG HAAKDKKWVK EPWNGKNIKY LYGEDGSIIK KTLSDSVLNY IKNWIQINGL
TPTEVEAWTG KRNYFVENAV GVPEKLKNNF EDFLYIIQPP TEHAWGLGIA FYYWNGYNNL
NYKSFLIKPF DMVADDLDVG FYTIPGSVTE GSKVLVAVKV KSHFDIDLEN VSFKWNITTK
DSDDKDIPLD AGKYELQFVD SADNNNQSGT INISAKYKEE FFYAQFKMPD RDVYIEFAIN
EDGKNPVESN MENNTVSTVV KAEKPINMAV KKYDLPYYAL SREISYPLAD QNIVVNLKKP
GSGWWSGNAK VDAFNVDIDT KFLHNYQVGS PVLEDEEDDV IVSLPNVRAK IQRSDFGDDP
ENKKWLAGNN STDILTAALK TFYDVFVLKE YKYETKCNRH EDCEAEGCSG YVKETGYARG
SRSGTAPIEV NTYVYNGKKD LNKKQYENKI VNNYDKDFKA RILWTNSPIK FNVIRYMCDL
DVNGNAINWR AVPGKYEREF VQQCSADIDW NIARSMQQDY EQARKAAREG KQDKSFYNKA
VFATDKNLQR YDYPIKSGYY FNPTGTYTFE ITTVTYKNNQ NDTEEHKKLV NALVNSFRYE
SNLIYVDDNG NPVNIANGSY KTLGVLTAAK NTGIGGKKLI SVNRDYKKVT DEIYYDSKRT
EDENKNGSHD FWKMSMEGYS LSGSLDSYNK YKYREYVAGG KVFKITETTK VTFIINEGNA
KFYTHPKMPD GEYYITVRLS DIDLSKMSGM DYSSIKDVLK GVILDRIKVN VKGSIYGDIS