Gene Cthe_1340 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1340 
Symbol 
ID4809480 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1630913 
End bp1632040 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content40% 
IMG OID640106764 
Producthypothetical protein 
Protein accessionYP_001037765 
Protein GI125973855 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGGTAA TTCATCTTGT AATCGCTGAC AAGGATAGGG CTTATCTTGA CAGTTTGGTT 
GACTTCATTT ATTCAAAATA TAACAACAGA TTTTATGTTC AGGCTTTTTC CAACGAGGAT
ACTTTTAACG ACTTTTTTAA TAAAACGGAC AAAATTGACA TACTTCTTAT AAGTCCGGAC
TTTTACAGTG ATGAGCTTGA TTTGGAGAAG GTAGTTGCAC CCATTGTGTT GTCGGCCGGA
ATTCTCACGA AAGATATAAA AAACTGTGAG ATAATCAGTA AATATCAAAT GGGCGACAAG
CTTGTCGGCA ATATATTAAA TATTTTTTCC GAGAAAAGCA ATTGCGAGTT TATAACCGGT
GACGGAAAAA AGAAGACTCG TTTTGTCACT TTTTATTCTC CATGCGGAGG GGCGGGTACA
TCCACCTTGG CCGCAGGTGT GAGCGTCAAA TGTGTACAGA GCGGATTGAA CGCTTTTTAT
CTTAATTTCG AAAAAATTGC CGCTACTACC GCTTATTTTG ATGCCCATGG CAGTGGAGAA
AATCTTTCGA ATGTTTTGTT TTTCCTCAAG GAGAATAATA AAAACCTGGC GCTTAAAATA
GAAGGAAGCA GATCCATAGA CAGCACAACG GGAGTTCATT ATTTTTTACC CCCGGAGAAC
GTTTTTGACC TTGATGAGTT GACATCCGAT GAGATAAAGA GGCTTATAGG ACAGTTTAAG
GCGATGGAGA GCTATGATGT GGTTATAGCT GATACAGGTT CGGAGTTAAA CAATGTCAGT
ATATCGCTTC TGGAAAGCAG TGATTTGGTG TTCTGTGTTT TGCCTTGTGA TACTACGGCA
AAGATTAAGC TGGCAACACT CCATAAAGCC TTTGATATTC TTAACAAGAG AAAAGGCTTG
AACTTTGAGG ACAAGATGGA GCTTATACTG AACAAATGCC TGAACTTGGG ATCTTCTGAT
GTTGAAAGTC TTACTTTGAA CGGAAAACCT GCTTCTGTCA GGATACCTTA CATAAAAGGA
CTGGATGCAA GCTATGGCAT AGAGCACCTG ACAGAAGATT CCAACCCTCT TGGACAGGCT
GTAAGGCAAA TAATTAGCAT ATTGCAGGGA AGTACGGGTG GTTGCTGA
 
Protein sequence
MAVIHLVIAD KDRAYLDSLV DFIYSKYNNR FYVQAFSNED TFNDFFNKTD KIDILLISPD 
FYSDELDLEK VVAPIVLSAG ILTKDIKNCE IISKYQMGDK LVGNILNIFS EKSNCEFITG
DGKKKTRFVT FYSPCGGAGT STLAAGVSVK CVQSGLNAFY LNFEKIAATT AYFDAHGSGE
NLSNVLFFLK ENNKNLALKI EGSRSIDSTT GVHYFLPPEN VFDLDELTSD EIKRLIGQFK
AMESYDVVIA DTGSELNNVS ISLLESSDLV FCVLPCDTTA KIKLATLHKA FDILNKRKGL
NFEDKMELIL NKCLNLGSSD VESLTLNGKP ASVRIPYIKG LDASYGIEHL TEDSNPLGQA
VRQIISILQG STGGC