Gene Cthe_0451 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0451 
Symbol 
ID4808379 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp565307 
End bp567178 
Gene Length1872 bp 
Protein Length623 aa 
Translation table11 
GC content40% 
IMG OID640105865 
Producthypothetical protein 
Protein accessionYP_001036882 
Protein GI125972972 
COG category[S] Function unknown 
COG ID[COG2604] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAATGAAG TATTGGCTAA AAATCTGTCC CTTTTAAAAG AGTATCAGCC TGAAACATAC 
CTGAAGCTTG ACAGATACAT AAAGGGAGAG TATGTTCCAA AGGACAATTC AGTGGAAAAA
ATACTTTTGG CCCGTCAGGA CGATCTCATA ATAAATATTT TGGTAAAATG CTCTGACAAA
GATTTTCTGC TTTGTGACCA TGAAAACCCG ATTAATGAAG CTTATGCCTG GATTGACAAA
TATATTGACC CTTCCAACAA GGCGGACATT GTCTTTGGAA TGGGATTGGC ATTTCACCTT
GAAGTTCTTC TTACAAGTTT TCCTAACAAA AAAGTAATTG TAATTGAACC CAATATAAAC
TTGTTTTATC AGATTGCCTG TATAAGAAAT CTTGAGCCGG TGATTAAAAA GGCTGAAATA
ATTGTGGATG AAGACTTGGA TGTTATACTT GAGAGAATAA ATTCCTTGTT CTGGGATACG
GAAAAGGGCG GGATTCAGGT ACAGCCTCTT GAGGTGTATG GTGAAATGTT TCCCGAAATG
TGGGACAAGC TTCGGGACAG TTTCATAAAG CTTGCAAACA ATTTCACTGT CGATATTGCA
ACCAGAAGGA AATTTGGAGA GCTGTGGGTG CACAATAATA TAAAAAATCT CAACAAAATT
TGTGAAGCCT CCAATGCCGG TGTTCTGGTC GGAAAATTCA AGGGCATTCC CGGTATATTG
GTATCGGCCG GGCCTTCCCT TGAAAAAAAC ATCCACCTTT TAAAAGGTCT TGAGGATAAG
TGTGTGATTA TGGCAGCGGG AACGGCAGTA CGAATTATGG AGGATTTCGG TCTGGCACCG
CATTTTATGG TGGGAATTGA CGCGGGAGCT AAAGAAGGGG AAATACATTC CAACGTAAAA
AACAAAGATA TATATTTTAT TTATTCAAAC CAGGTTTCAA CATATTCTGT GGATGGCTAC
AAAGGCCCCA AATTTGTTAT GAATTATCCT ATTGACATGT ATACGGCAGG CTTTTTTGAG
TATGCCGGTA TCAAGTCGGA TTTTTTCCTA AGCGGGCCCT CGGTTGCCAA TACCTGCTTT
GACATCTTGT TTAAGATGGG CTGCGACCCG ATTATAATTA TCGGTCAGGA CATGGCGTTT
ACATACGGAA GCATGTATGC AGGTGAGGTG CCGGGAACGG TTGTAGACGG TGCCGGGGAA
GCAAAAAGAA GGGGATATGT TCTTGCAAAA GATATTTACG GAAATGAAGT TTATACCACC
CGGCCATTTC TTGCAATGAG AAACTGGTTT GAAGGATATT TTGAAAAAGT GCGGGACAAA
ACGACAATAA TAAATGCCAC CGAAGGAGGA CTGAATATTT CGTATGCCAG AAATGAAACT
CTTGAGGCCG CACTAAAGAG CTGTAATTTA TCAGAATCAG GCATTAAAGA TCATATCAGG
TCGCTGCATG AGGAAGGAAA ATTTGCGGAT ACGGTAGCTT CAAAAATCGA GGAATACAGA
GCATACGTTC AAAGGGAGAT AAGACGGCTG GAACAGCTTT CCAAAAGGCA GCTTGAGACA
GCAGAGGATT TAAAGAGAGA TGTTTATCAT CCGTCAAAAA GCCGCTCAAG GTTTATTAAG
GCTGTCAATT CAATAAATGA AATGTCGGAC AGGGTTCTTC AATCTCCGAT ATACAATTCC
CTTCTTAAAA ATCTTGTTGA AATTGACTTT TATATTATAA AAGCGGAAGT GGACAGGTTA
GTAAAGATAT TAACAAAATA TGACGATATA AAGAATGTAT ATGTGAATGC CATATTGAGT
CAGAATCAAA AACTCAACGC AAGCCTTGGC AAAATCAAGA AATTCTTTGA CGAATCGGAT
GTTACTGCTT AA
 
Protein sequence
MNEVLAKNLS LLKEYQPETY LKLDRYIKGE YVPKDNSVEK ILLARQDDLI INILVKCSDK 
DFLLCDHENP INEAYAWIDK YIDPSNKADI VFGMGLAFHL EVLLTSFPNK KVIVIEPNIN
LFYQIACIRN LEPVIKKAEI IVDEDLDVIL ERINSLFWDT EKGGIQVQPL EVYGEMFPEM
WDKLRDSFIK LANNFTVDIA TRRKFGELWV HNNIKNLNKI CEASNAGVLV GKFKGIPGIL
VSAGPSLEKN IHLLKGLEDK CVIMAAGTAV RIMEDFGLAP HFMVGIDAGA KEGEIHSNVK
NKDIYFIYSN QVSTYSVDGY KGPKFVMNYP IDMYTAGFFE YAGIKSDFFL SGPSVANTCF
DILFKMGCDP IIIIGQDMAF TYGSMYAGEV PGTVVDGAGE AKRRGYVLAK DIYGNEVYTT
RPFLAMRNWF EGYFEKVRDK TTIINATEGG LNISYARNET LEAALKSCNL SESGIKDHIR
SLHEEGKFAD TVASKIEEYR AYVQREIRRL EQLSKRQLET AEDLKRDVYH PSKSRSRFIK
AVNSINEMSD RVLQSPIYNS LLKNLVEIDF YIIKAEVDRL VKILTKYDDI KNVYVNAILS
QNQKLNASLG KIKKFFDESD VTA