Gene Cthe_2495 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2495 
Symbol 
ID4809433 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2963546 
End bp2964928 
Gene Length1383 bp 
Protein Length460 aa 
Translation table11 
GC content51% 
IMG OID640107910 
Producthypothetical protein 
Protein accessionYP_001038890 
Protein GI125974980 
COG category[S] Function unknown 
COG ID[COG3391] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCCTTTTC AGAAAAAAGT CTGGCAGTTT AACGACATAA TCACAGAAGG CGAATTGAAT 
CGCATGGAAC AAGGTATTGA AGATTCTATA ACTGCCGCGA ATCAAGCTGA AGTAAATGCA
AAGGCTTATA CTGACCAAGA AGTAGGTGAA GTTGCCCAAG AACTTGCTGC ACATAAGGCG
GAAAGTACGC AGAACGCTCA TTTGGCGAAA AACATCGGGA TTGAAGACGC TGCGGGTAAC
TTCACAGCGA CCGACGTGGA AGGGGCACTG GCCGAGCTTT TTACGTCTGT CAGTAATGGT
AAGACTCTTA TCGCTGGGGC CATTACTGAC AAAGGAGTGC CGACCAATCC CAGCGATACA
TTCCAGCAAA TGGCAACAAA TATTCAAGCA ATTCCTGTTG GAGATTATGC TGTAGGGGGT
ACAATCCGTG ATTCTGTCTT GCGTTTTTTG CCGGGCGGTA TGGGTGTAGA AATCTGGTCG
AAGACGGACG TGGCGAGAGG GCAGGGCATC GCCGTAGACA GTGCAGGAAA CGTATATGTC
GCTCACTCTG TGGGCAGCGG CGGAAAAGCC GTACGAAAGT TGGATTCAGC AGGAAACGAA
ATCTGGTCGA AGACGGACGT GGCGTATGGG CAGGGCATCG CCGTAGACAG TGTAGGAAAC
GTATATGTCA CTCATTTTGT GAGCAGCAGC GAAAAAGCCG TACGGAAGCT GGACCCGAAC
GGAAACGAGA TCTGGTCGAA GACGGACGTG GCGTATGGGT GGGGCATTGC CGTAGACAGT
GCAGGAAACG TATATGTCGC TCACTCTGTG GGCAGCGGCG GAAAAGCCGT ACGAAAGTTG
GATTCAGCAG GAAACGAAAT CTGGTCGAAG ACGGACGTGG CGAATGGGCG GTACATCGCC
GTAGACAGTG CAGGAAACGT ATATGTCGCT CACAATGTGA GCAGCGGAAA AACCGTACGA
AAGTTGGATT CAGCAGGAAA CGAAATCTGG TCGAAGACGG ACGTGGCGTA TGGGTGGGGC
ATTGCCGTAG ACAGTGCAGG AAACGTATAT GTCGCTCACA ATGTGAGCAG CGGAAAAACC
GTACGAAAGT TGGATTCAGC AGGAAACGAA ATCTGGTCGA AGACGGACGT GGCGTATGGG
CAGGGCATCG CCGTAGACAG TGTAGGAAAC GTATATGTCA CTCATTTTGT GAGCAGCAGC
GAAAAAGCCG TACGGAAGCT GGACCCGAAC GGAAACGAGA TCTGGTCGAA GACGGACGTG
GCGAGAGGGC AGGGCATCGC CGTAGACAGT GTAGGAAACG TATATGTCAC TCACGATGTG
AGCAGCGGCG AAAAAGCCGT ACGAAAGCTG GATGGGAACA GATATTTTCA AATAGTGGGG
TGA
 
Protein sequence
MPFQKKVWQF NDIITEGELN RMEQGIEDSI TAANQAEVNA KAYTDQEVGE VAQELAAHKA 
ESTQNAHLAK NIGIEDAAGN FTATDVEGAL AELFTSVSNG KTLIAGAITD KGVPTNPSDT
FQQMATNIQA IPVGDYAVGG TIRDSVLRFL PGGMGVEIWS KTDVARGQGI AVDSAGNVYV
AHSVGSGGKA VRKLDSAGNE IWSKTDVAYG QGIAVDSVGN VYVTHFVSSS EKAVRKLDPN
GNEIWSKTDV AYGWGIAVDS AGNVYVAHSV GSGGKAVRKL DSAGNEIWSK TDVANGRYIA
VDSAGNVYVA HNVSSGKTVR KLDSAGNEIW SKTDVAYGWG IAVDSAGNVY VAHNVSSGKT
VRKLDSAGNE IWSKTDVAYG QGIAVDSVGN VYVTHFVSSS EKAVRKLDPN GNEIWSKTDV
ARGQGIAVDS VGNVYVTHDV SSGEKAVRKL DGNRYFQIVG