Gene Cthe_2151 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2151 
Symbol 
ID4811199 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2557230 
End bp2559425 
Gene Length2196 bp 
Protein Length731 aa 
Translation table11 
GC content37% 
IMG OID640107555 
Producthypothetical protein 
Protein accessionYP_001038547 
Protein GI125974637 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000594133 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACATGGA CCACATATAT AAGTGCCTAT TTGTTCAGAA CGACCGGCGA ACCAATGTTG 
TATGGCAATA TGACTTCCTT CCTGATTTTT TCATTTATTA TTATCTTTTT CATTGCAAAA
GACGAAACGG ATATTCCAGC TTTATTTAAA TCGTTAAAAT CTGTAGTTTC AAAGATAAAA
GACAATTCGT TTTTGAAAAG CAACGTTATT GAAATCGCTT ATGTTTTGGC TTTTTTGAGC
ATATGGACAT TTTTGGTGAT TGGGTCTTTT AACATAAAAG ACGGTGTAAT GAGAATTGGA
TGGTCAGTCT CAAGTGATTT TTCAACCCAT CTTGCGGTGA TTAGGTCTTT TTCCTATGGT
TCTAACTTTC CCTCGGGATA CCCTCACTTT GCAGACGGTA ATATGAGATA TCATTTTATG
TTCATGTTTT TGGCGGGAAA TCTGGAGTTT TTAGGCTTAA GACTTGACTG GGCGTTTAAT
CTGCCGTCAA TTTTGTCGAT CGTTTCATTT TTGATGCTGC TTTATTCATT TGCCGTGCTG
CTGCTGGGCG AAAAAATTAT CGGAGTTGTG ACGGGAATAT TGTTCTTTTT CAGAAGTTCC
TTTGCTTTCT TTACTTTTAT CACGGGAAAA CCTTCGATAA AAGAAGCTTT GAAAGAATTA
AAAAATTTAA AAGAGCATAT TGGTAATACG TTGAATGAAG AATGGGGACT TTATGCGCAG
AAGGTTTATG TAAATCAGCG GCATTTGCCT TTTGTATTGG GAATTATGAT GCTTGTTCTG
ATAGTGATTC TTCCTTTGTT TATTAAGATG ATGACAAGCA TTGGCGAGCT TTATCAGGAA
AGAGTGAAAA AATCCGATGA AGAACAAATG CCCGATGAAA ACAAAGACTC TGAAAGTTTT
TGGAAATCAT ATGTTAAAGA ATTTATTTTC AGTAAAAACG CGTGGATTCC GGAAAGTATA
TTTACATCTG TGGTGCTGGG GGTTATTCTT GGGCTTTCAA GTTTTTGGAA TGGAGCGGTG
GTTATAGCGG CGCTTTTAGT GCTTTTTGTT ATGGCTGTTT TCTCGAAACA CAGACTTCAG
TATCTGATTA TGGCGTCAAT TACTGTCGTG TTGGCTTATT GCCAGACTCA ATTTTTTGTA
AAAAGCGGCG GTTCTGTGGT TTCACCCAGC ATATATATTG GTTTTCTGGC AAACACCAAC
GGGCTGGAAC AGGATTTGTC CAGATATTTT GCGAATAATG GTTTGTGGGC TACACTCGAG
CATTTCTTTA AACTGATTCC TTACGTTACG GCTTTTTATA TTGAACTTCT CGGACTTCTT
CCTTTTATTG TGGCCATAAA TTTGCTCTCA AGAAACAGCA AATACAAATA TCATATAAGC
TTGTTGTTTA TAGCTGTGAC ACAAGTTATT GGCTACTTCT TCATAAAGTA CAAACTGGAC
AGTGACGACA AAATTATAAA CAAGCAGCTT TTTACTGGTT CAATGCTGTT TGCTTTGCTG
TTGGTGGTGT TTGCATGCAT GGCATATATG TTTTACGAAA ATTCTCCCGT GCCCCGGGGG
ACAAGGGCGT TAATACTGGC ATTTAGCACT CCGATTATCT TTGCAAGCTC TGTCAAGCTG
ACATTAGGCG TTGATATAAA CCATAAATAT GTGATTATTG GTTCCATACT TGTAAATATT
TTTGTGGCAT CTTTTATATT CTTCCTTTTC AAAGTAAAAA AACCTTTTGC CACCCTTGTG
GCTGTACTTG TTTCCGTGAT GATTACAATT ACAGGTTTTG CGGATCTGAA AGTGCTTTAC
AATCTTAACA ATTCTTATGT TACAATCAAT TGTGATGATC CTTTGTTGGT GAAAGTGAAA
AATGATACCG GCAAAGATGA AATTTTCCTG ACGGATAATT ACCATCTTCA TCCTCTGCTT
CTGTCAGGTA GAAAGATATT CTGCGGTTGG CCGTATTTTG TGGCGTCTGC AGGTTACGAC
TGGGATCTTC GGAATGACAT AAGGAGAAGA ATTCTTACTG CCACTGACCA AAACACACTT
AAAAAACTTG TGGAGGAAAA CAATATCAGT TACATAGTTA TTGACAACGG GCTGAGAAAT
TCTCAAGGGT TTACCGTGAA TGAAGAGCTT ATCAGAAATA CTTTCAGTGT TTTCTACGAT
GACGGAGTCG ACGTTGTTAT TTACAAAACA CATTAG
 
Protein sequence
MTWTTYISAY LFRTTGEPML YGNMTSFLIF SFIIIFFIAK DETDIPALFK SLKSVVSKIK 
DNSFLKSNVI EIAYVLAFLS IWTFLVIGSF NIKDGVMRIG WSVSSDFSTH LAVIRSFSYG
SNFPSGYPHF ADGNMRYHFM FMFLAGNLEF LGLRLDWAFN LPSILSIVSF LMLLYSFAVL
LLGEKIIGVV TGILFFFRSS FAFFTFITGK PSIKEALKEL KNLKEHIGNT LNEEWGLYAQ
KVYVNQRHLP FVLGIMMLVL IVILPLFIKM MTSIGELYQE RVKKSDEEQM PDENKDSESF
WKSYVKEFIF SKNAWIPESI FTSVVLGVIL GLSSFWNGAV VIAALLVLFV MAVFSKHRLQ
YLIMASITVV LAYCQTQFFV KSGGSVVSPS IYIGFLANTN GLEQDLSRYF ANNGLWATLE
HFFKLIPYVT AFYIELLGLL PFIVAINLLS RNSKYKYHIS LLFIAVTQVI GYFFIKYKLD
SDDKIINKQL FTGSMLFALL LVVFACMAYM FYENSPVPRG TRALILAFST PIIFASSVKL
TLGVDINHKY VIIGSILVNI FVASFIFFLF KVKKPFATLV AVLVSVMITI TGFADLKVLY
NLNNSYVTIN CDDPLLVKVK NDTGKDEIFL TDNYHLHPLL LSGRKIFCGW PYFVASAGYD
WDLRNDIRRR ILTATDQNTL KKLVEENNIS YIVIDNGLRN SQGFTVNEEL IRNTFSVFYD
DGVDVVIYKT H