Gene Cthe_2235 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2235 
Symbol 
ID4809973 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2662160 
End bp2664106 
Gene Length1947 bp 
Protein Length648 aa 
Translation table11 
GC content36% 
IMG OID640107641 
Producthypothetical protein 
Protein accessionYP_001038630 
Protein GI125974720 
COG category[S] Function unknown 
COG ID[COG2604] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGCCGGCCT TTTTTAAAAC ATTTGACAGG CAGGAGGATT TTATGGTTGA CGTATTCAGA 
CTTAATATTG ATGCTTTAAA GGAAAATTAT CTTCCTTTGG CGCGGTTTTT CGAAAATTTG
AATGAAAATA CCGGAAATGA AAGTAGTGTT ATCCTTGAGA CCTCAAAAAG CGGGATGCCG
AATTTCAGGG TAAAAAAAGG AGAACATACT TTTTTTGTTC ATAGCCCGTA TGATCCCAGA
ACTGAAGCGA TAAGATGGGC AGAGAAAATT GATTTAAAAG GTTTTGATAC AATTGCGGTT
TTGGGTATTG GATGTGGATA TCATATAGAA GAGCTTGAAA AAAAGTATCC TGATAAAAAC
AAAATTGTGA TTGAACCGGA CAGAAATGTG TTTTTAAAAC TTCTTAACAC AAGGGACATT
ACACATTTAA TATCCAGTAA AAACATATTA TTTATAATCA GTGACAATAC CGAGGAAATC
GCAAAGGTTT TCTTGTTGCT CAGAGAAGAA GGAGAAATAG ACAGTGTGGA GTTTAATGAA
TTGTTAAGTT ACAGAAAAGT TTATGAGGAT TGGTGGTTGG AATTAAAAAA GGAATATATA
AAGTTTGCAA GACTTCATCA GATAAATACC AATACGAGTG TTTTTTTTGC GGAAGCATGG
TTAACTAATT TATTTGAAGG AATGTGGCAG TTGACAAAGA GTGTGCATAT CAAGGAATAC
AAATCTGCTT TTGCCAATAT TCCGGCAATT GTTGTTTCTG CAGGCCCGGC ATTGAATAAA
AATGTACACC TTTTAAAAGA ACTGTACAAT AAGGCGGTTA TTATTTCTGC CGGTTCCGCC
CTTAACATTT TGGAAAGCAG GGGCATTACA CCCCATATTA TGGTTGGGGT GGACGGCGGA
GAAGCGGAAA GCCGAATATT TAACAATGTT AAATCCAATG AAATATATTT TGCCTATTCT
CTTTCGGTTC ATTATGACGG ATTGAAAAAT TACAGCGGTC CCAAAATATA TTTCAAGACC
AATGTGCTGG GATATGGTGA CTGGATTGAC GAAAAGTTGG GCATTGAAGG TGCGGAAAAC
CTTCGCTCAG GTTCTTCCGT GTCAAATCTG TCTTTGGACA TTGCGAGGTA TATGGGATGC
AATCCGATTA TTGTCATAGG CCAGAATTTG TCCTTTCCAA ACCTGGAATC TTATGCTGAC
GGTGCTGTGT TAAAACAGGA ACAGGACCGG CATATCCAAC AGTGCGTGGA AAATTCAAAT
AAGTATTATG TGCTTGAAAA AGATATTGAT GGTAATGATG TATATACTAC TCACAGCATG
CTTTCCATAA GATTTTATTT TGAAGAGTAT ATCAAGAACC ACCCGGACAG GTTGTATTTG
AACGGCTCTG AAGAAGGACT GCCTATAAAA GGGATGAAAA ACATGCCCTT GAAGGAGATT
GTAGAAAAGT ACTGCACAAA GGAATATGAC ATTAAAGGAA TTTTGGATAA AAAATTCAAA
GAAGAATTTG AGGCAGAAAA CGTAAAAGCA AAAGAAATTA AAATTAGAAG AATTTTGGAA
GATATTCACA AAGAAAGCAC TGAAATAAGA CAAAAAGCTA TAAAGAGAAT AGATTTAATT
CTCGATATAT TAAGTAATAT CAGAGGCAGC CATAATGACA AGTGGGAGGA AATTGACAGG
TTGACAGATG AAATTGAAAG CAGTGATTTG TACAAATATT TCGTTGAACC TTTGAGCAAA
TACTTTATTC AGGCTGTTAA GAATGAAAGG GAAAGAAAGA TGGAAAGCAT TCCTGATATA
CAAGAGAGAT TGAAATATCT GTATGAGGGA TTGCTTATAC AGTATGTTGA GGTAAAGGAC
AAAATTGTCC TGATAGACGA TTTGTCCCAA AAAATAATAG AGAATATTGA TAAAAAGGAG
GCAACAAAAT GTCTAAGTAT GGTTTGA
 
Protein sequence
MPAFFKTFDR QEDFMVDVFR LNIDALKENY LPLARFFENL NENTGNESSV ILETSKSGMP 
NFRVKKGEHT FFVHSPYDPR TEAIRWAEKI DLKGFDTIAV LGIGCGYHIE ELEKKYPDKN
KIVIEPDRNV FLKLLNTRDI THLISSKNIL FIISDNTEEI AKVFLLLREE GEIDSVEFNE
LLSYRKVYED WWLELKKEYI KFARLHQINT NTSVFFAEAW LTNLFEGMWQ LTKSVHIKEY
KSAFANIPAI VVSAGPALNK NVHLLKELYN KAVIISAGSA LNILESRGIT PHIMVGVDGG
EAESRIFNNV KSNEIYFAYS LSVHYDGLKN YSGPKIYFKT NVLGYGDWID EKLGIEGAEN
LRSGSSVSNL SLDIARYMGC NPIIVIGQNL SFPNLESYAD GAVLKQEQDR HIQQCVENSN
KYYVLEKDID GNDVYTTHSM LSIRFYFEEY IKNHPDRLYL NGSEEGLPIK GMKNMPLKEI
VEKYCTKEYD IKGILDKKFK EEFEAENVKA KEIKIRRILE DIHKESTEIR QKAIKRIDLI
LDILSNIRGS HNDKWEEIDR LTDEIESSDL YKYFVEPLSK YFIQAVKNER ERKMESIPDI
QERLKYLYEG LLIQYVEVKD KIVLIDDLSQ KIIENIDKKE ATKCLSMV