Gene Cthe_2429 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2429 
Symbol 
ID4808145 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2901970 
End bp2903040 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content38% 
IMG OID640107843 
Producthypothetical protein 
Protein accessionYP_001038824 
Protein GI125974914 
COG category[R] General function prediction only 
COG ID[COG0628] Predicted permease 
TIGRFAM ID[TIGR02872] sporulation integral membrane protein YtvI 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATTTTT TATTACGGAA AAAGCAGAAA AAGGCAATAA TCAGCCTCAT CCTTGCATTT 
ACATTACTGT TTGCTATTTA TATAATCATG AACTATTTCC TGGCTCTCAT TCTGCCTTTT
TTAATTGCCG TAATAATCTC TTCCGTCAAC GAACCTGTTA TCAGTTACAT GGAAACAAAG
CTAAGGCTTA ACAGAAAAAT TGCCTCAGTT ATATCCATCA TTATGACGGT AAGCATAATT
ATCATCTTAA TTTCTTTATG CATATTTAAA GTATACTATG AACTGGTAAA GCTTAACTCC
AATCTCCCCT ACTACATGGA ATCTTTTTCA GCCACCGCAT CGGCTTGTTA CGACCGCATG
AGCGTTTTTT ATTATCATCT TCCGAAGGGT TTGGCCGATA TTTTGGAAAA CAACTTTAAA
TCATTGCTAC CGAAACTTGA GACCATAACC GGCAAAATTG CAGAATCAAT TATAAGCAGC
ATTGCATCAA TACCCAAAGC TGCGGTCTTT ACCGCTGTAA CTCTCCTGTC TTCCTATTTT
ATAAGCAGTG ACCGAAAAAA GATCAGAAAC TTTATTTACA GGCAGCTTCC CGTAAATCTA
AAACAAGGTT TTATCGGGAT TAAAAGCGAT GCCATTTCAA CAATAGCCGG ATACATAAAG
GCACAGCTCA TCCTTATGTC CATCACCTTC ATTGAAACAA CTTTGGGTCT TATCGTCATA
AAATGTGAAT ATGCAGTGCT AATCGGGTTT ATCGCCGCAA TTGCCGATGC ACTGCCCATC
GTGGGAACGG GCATTGTTCT GTTTCCGCTT ATTGGCTGGA ACATTATTAC AGGAAACATT
CAAATAGCTT TAGGCATAAC CGCCGTATAC CTTTTAGGAG TGATTTTAAG GCAAATAATC
GAGCCGAAAA TAGTTTCAAG CCAGACAGGA ATTCATCCCT TTGCCACTCT TGTATCCATG
TATTTGGGAA TGACACTTTT TGGTTTTCCG GGACTTTTCA TAGGCCCCAT ATTTGTAACA
ATTTTAAAAA GCCTCCACAA GTCCGGCCTT ATAAGCGTAT GGGATGATTA A
 
Protein sequence
MNFLLRKKQK KAIISLILAF TLLFAIYIIM NYFLALILPF LIAVIISSVN EPVISYMETK 
LRLNRKIASV ISIIMTVSII IILISLCIFK VYYELVKLNS NLPYYMESFS ATASACYDRM
SVFYYHLPKG LADILENNFK SLLPKLETIT GKIAESIISS IASIPKAAVF TAVTLLSSYF
ISSDRKKIRN FIYRQLPVNL KQGFIGIKSD AISTIAGYIK AQLILMSITF IETTLGLIVI
KCEYAVLIGF IAAIADALPI VGTGIVLFPL IGWNIITGNI QIALGITAVY LLGVILRQII
EPKIVSSQTG IHPFATLVSM YLGMTLFGFP GLFIGPIFVT ILKSLHKSGL ISVWDD