Gene Cthe_2877 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2877 
Symbol 
ID4809157 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3401397 
End bp3403154 
Gene Length1758 bp 
Protein Length585 aa 
Translation table11 
GC content38% 
IMG OID640108296 
ProductS-layer-like domain-containing protein 
Protein accessionYP_001039268 
Protein GI125975358 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAGCGAG TACTGTCAAT TATTTTGGTT CTCGGCATTA TGATGCTAAG TACAGCTGCG 
GCGGCAGCCA CAGGATACGA CACGGTTTTT TCAGACATTT CGGGGCATTG GGCAAAGGAC
ACAATTGAAA GAATGGCAAA CTTGGGAATA GTCAAAGGTG TTGGCAACGG TATGTTTTTG
CCGGATAGGG AGATAAAACG GTCCGAATTT ATTGTGGCTT TGCACAAAGC AGCGGAAATT
AAAATTAACT ACTTCAAGGC TCCGGACATC AATGAATTTT TCGATGATGT AAAAAACGAA
GACTGGTATG CTTCGACTCT GTATGATCTG GCATCACTAA ACATAGTTGA CGACAGGGAG
AAGTTTAGGC CCAATGACCT CATAACCCGT GAGGAAATGG TGCATTATCT TGTAAATACA
TATAAATACA AGCTTAATAT TGTTTTGGAT CAAATTGATG AGAAAGACAA GTGTTTTGAT
GATGAGGAGA GTATTCAGAA GCAGTATAAA GAATCTGTGA AGTATGCATT TAAACTGGGA
TTTGTCAGGG GAAGGAGCAA TGGGAAATTT GTCCCCAAAG GATACAGCAC AAGGGCTGAA
GCCATGATAG TTCTGGAAAA ATTAATGCAG GCTTTGAAGG AAAATATAAA GGCCGAGGTT
GAAGTTATTC CTTCGTTTGA AAAACATGAA GACGGATATA AAATGGCGCT CACAATTAAA
AACAACAGCA AAAAAGATGT TGTAATCCAA CACTTTTCCG GGCAAAAGTA TGATTTTGTA
CTGCTTGATG ATAAAAAAGA AGAACTGTAC AGGTGGTCTG GAGACAGAGC ATTTGTTGAG
ATTTTGACAA GTACAGAGAT TCCGGCGGGA AAGACAGTGG AATTTTCGGA AATTCTTGAT
GCAAAGACTT ACGATGAAAT CAGCGGCAAG GCATATTATT TTAAAGCCGT GATAGTAGGA
AGCAGTGAGG ATTTTGAAAT AAATGAGGAT GGATATTATT TAAGCCTCAA AGAAGAAAAA
GATAATAAGC TTGAGATAGT ACCAAGCTAT AAAAAAGGCG AAAAGACCTT TACAATGAAG
CTTTCAATAA AGAATACATC GAAAAAACCA ATAACAATCA ATCACACATC TGGGCAAAAG
TTTGACTTTA AATTGCTCGA TGAAAATAAA GAAATAATTT ACACCTGGTC TGCTGACAAG
ATATTTATAA TGATGGAAAC TCAAACGGTA ATAGATCCGG GAAAGACAGT GGAGTTTGCC
GATGAGCTGG ATATGGAAAG CTTCGGGGAT ATTGTCAAAA AGGCAAGGTA TTTGAAGGCA
TATATTGTGG GAGCAAGTGA GGATTGTGAA ATAGAAGAAG ACGGATACGA GGTAGAGATA
ACAGAGAGTA AGGAAAACAG TCTTGTAGTT GTGCCGGAAT ATGAAAAGAG CCAGAATACT
TTCACGATGA AGCTCAAGCT CAAAAATACA TCCGACGGGG ATATAACTAT TAACCACTCA
TCAGGACAAA AGTTTGATTT CAAACTGTTG GACAAAAATA AAGAAATTCT CTATACCTGG
TCCGCCGACA AGGGATTTAT AGGTGTACTG ACCGAGACGG TGATAGACGC CGGAAAGACA
GTGGAATTCG AAGAAAAACT TGATATGGAA AACTATAAAG ATGTTATTGG AAAAGCAAAA
TATTTGAAGG CATATATTGT AGGTACAAGT GAAGATTGTG ACATAGAAAA AGACGGATAC
GAGATAGAGA TAAAATAA
 
Protein sequence
MKRVLSIILV LGIMMLSTAA AAATGYDTVF SDISGHWAKD TIERMANLGI VKGVGNGMFL 
PDREIKRSEF IVALHKAAEI KINYFKAPDI NEFFDDVKNE DWYASTLYDL ASLNIVDDRE
KFRPNDLITR EEMVHYLVNT YKYKLNIVLD QIDEKDKCFD DEESIQKQYK ESVKYAFKLG
FVRGRSNGKF VPKGYSTRAE AMIVLEKLMQ ALKENIKAEV EVIPSFEKHE DGYKMALTIK
NNSKKDVVIQ HFSGQKYDFV LLDDKKEELY RWSGDRAFVE ILTSTEIPAG KTVEFSEILD
AKTYDEISGK AYYFKAVIVG SSEDFEINED GYYLSLKEEK DNKLEIVPSY KKGEKTFTMK
LSIKNTSKKP ITINHTSGQK FDFKLLDENK EIIYTWSADK IFIMMETQTV IDPGKTVEFA
DELDMESFGD IVKKARYLKA YIVGASEDCE IEEDGYEVEI TESKENSLVV VPEYEKSQNT
FTMKLKLKNT SDGDITINHS SGQKFDFKLL DKNKEILYTW SADKGFIGVL TETVIDAGKT
VEFEEKLDME NYKDVIGKAK YLKAYIVGTS EDCDIEKDGY EIEIK