Gene Cthe_2610 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2610 
Symbol 
ID4809032 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3080278 
End bp3082032 
Gene Length1755 bp 
Protein Length584 aa 
Translation table11 
GC content40% 
IMG OID640108024 
Producthypothetical protein 
Protein accessionYP_001039003 
Protein GI125975093 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000187127 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAGGGTG CAAGGCTTTT TGACATTAAA ACTTTCAAAC AAATAACGGA ACACAGTATA 
TATATGGACG GTACAATTAA AGATAAGATA AAAAATAATT TGGAATCACT TCTTAACGTA
CGGATTATAG AAACAGACTT TGACTCCGGG GAAAATGAAA AAGTGGATGT TATCGCCTTA
GGCAGTGATA ACAATCCTTT GATTGTGTTG TTTAAGACAG CACAAAACGA AAGTGTGGCG
GCAAGAGCCG TGTTTTATCT TGACTGGCTT GTAAACAACA AAAATTTGTT TAACAAGCTC
GTCCAAAAAA CTTTCCCAGG TATGAATGTT GACGGCATAG ACTGGCAAAA GGCAGGTGTC
TGCTGTGTGG GAAGTGACTT TTCCAAGTAT GACCGCTACA TGCTGTCGCA TGTCGGGCGC
AATATTGAAT TGATTCGATG TAAGAAATAT GACAGGGATC TTATGGTTAT TGAAACAGTT
TACAAACCAA CAGGCATAAG TTTTGGAAAG CCTGTGTCTT TATCTTTGAC AGCCACTTCA
AATGCTGAAC AAAACAACAA AACAAAAAAA TATATAAAGG TAAGTCAGAC TCCAAGAATT
GGTTATTGCC AGATAGAAGA CGGAAAACAT TATGTGGTGT TCCCCGGAAA GGAAAAGGGC
GAGATACTTG ACTTGCCTGT TTCCATATAT CTTGCCCAAA ACCAGTTTGT TCTGGTTGAC
GAATACAACC GGTTCCAGTA TGCTTTCTCC TATTGGCTGA ATGACAATGA AATCTTAAGT
TCAAATATTG CAAGTTTTGC CGTGGTTGTG CTGAAAGATT CGGAGATTTT TATCGACAAA
GGCGACAATG TATTGCTTAA GCTCAACAAT ATTCCCCCAA ATGTTCAGCT TAGGGATAAA
AGCATTGTGT CTGTGGACAG TAACAACAAC TTTTTAAGAT TCTACAAGCC GGTAAAACAC
AACGCCGACA GCTTTATGAT GTCGGCAAAA GCCAAGGGGC ATACTCTGGC CTTTGTGCTC
AAAATTCTGG ATAACGGGGT TTTGCTTCGG GATATTGAAA CTGGCAGGGA ATTTTTTAAA
GAAATGGATA CAGACGGCAT AACTTTTAAG GAGCAGCAAA TCCTGTGTCT TCATGAAGGA
AACGTGGTAC ATACTCTGAC GTCGTGCAAA TTTTACACTT TGTCTTCGTA CTATGACAAG
TTTGAATACG GCACAGTGGA AATAAAGGAC GGACTCGCTT TTCTTAAAAA ACTGTCCGGT
GAGATTGTGA TAATAAATGA CGCGCCGGAC TACTTAAAGC CCGGTCAGGT GGCCTATGTG
GATGAGAATA ACAATTTCTG CGGCATTGAA GATGACGGGG AAGCTCAGGA GACTGACACC
GTAAAGAGAA ATGTTTCCAA TATCAGCACT TTTAAGAGAG TAAGCAAGAA GGAAAGAATC
GAGGTAACCA AGCAGGTTTT GATTCTCGGA AATAAAGCTT ATGAAAATTC CTACAAATTG
TGCCTTTTAA AATTCGGGTA CAAGGCAGAA GTGCTGGAAG GATTCGAACC ATGGGCAAAA
ATCAGTAATG TGCTTAGGGA CACGGATGTG GTAGTGGTGG TAACTTCGCA TATATCCCAT
GACAACATGT GGAGAGTAAA AAAGGAAATA ACGGATATTC CTGTTATCTA TTCAGAATTT
GACGGAGCAA ACAGAATATT GGAGAAGGTG ATTGCAGCGG AGAACAACTG GAAAGAAGTG
CGTACGGCAA GGTAA
 
Protein sequence
MEGARLFDIK TFKQITEHSI YMDGTIKDKI KNNLESLLNV RIIETDFDSG ENEKVDVIAL 
GSDNNPLIVL FKTAQNESVA ARAVFYLDWL VNNKNLFNKL VQKTFPGMNV DGIDWQKAGV
CCVGSDFSKY DRYMLSHVGR NIELIRCKKY DRDLMVIETV YKPTGISFGK PVSLSLTATS
NAEQNNKTKK YIKVSQTPRI GYCQIEDGKH YVVFPGKEKG EILDLPVSIY LAQNQFVLVD
EYNRFQYAFS YWLNDNEILS SNIASFAVVV LKDSEIFIDK GDNVLLKLNN IPPNVQLRDK
SIVSVDSNNN FLRFYKPVKH NADSFMMSAK AKGHTLAFVL KILDNGVLLR DIETGREFFK
EMDTDGITFK EQQILCLHEG NVVHTLTSCK FYTLSSYYDK FEYGTVEIKD GLAFLKKLSG
EIVIINDAPD YLKPGQVAYV DENNNFCGIE DDGEAQETDT VKRNVSNIST FKRVSKKERI
EVTKQVLILG NKAYENSYKL CLLKFGYKAE VLEGFEPWAK ISNVLRDTDV VVVVTSHISH
DNMWRVKKEI TDIPVIYSEF DGANRILEKV IAAENNWKEV RTAR