Gene Cthe_0668 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0668 
Symbol 
ID4810285 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp823636 
End bp824745 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content35% 
IMG OID640106084 
Productspore germination protein 
Protein accessionYP_001037096 
Protein GI125973186 
COG category 
COG ID 
TIGRFAM ID[TIGR00912] spore germination protein (amino acid permease) 


Plasmid Coverage information

Num covering plasmid clones45 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGAAGGTA AGATAATTTT TGGAAAGAGA GAAGCAATAT CACTTCTGAT AATACTTATA 
TGCAATCAGT TAATTTTAGG ATTTCCAAGT ATTATGTCGA ATAGTGTGGG AAGTGCAGGA
TGGATTTTGT CAATCTATGT ATCCATACTT GCATTATGCC TTTTTCTGAT AATATCAAAA
CTTTATTCCG CTTTTGAAGG AAAAGATTTA TTGGATATAA GTGAATTCGC CGGCGGAAAT
ATTGCAAGAA TTATCGTCGG CTTGATAGTT GTAATAGATT CGGTTCTTAT AATTTCAGTC
AAATTAAGAG AGTATACCGA ACATATAAAA ATAATAAGCT TTACCCAATC TCCTGTCAGT
TTTATAATGC TGTTTTTTGC TTTAGGAATG ATTATCAGTG TCCATTTTGG CATAGAACCT
TTGGTAAGAA GTACGACAAT TGTTCTTCCG ATTGTGGCAA TCGGAGTTGT AATAGTCGTT
GCAGGTTCTG TCAAAAATTT CGAACTTTCA AATATAATGC CGATTCTTGG CACAGGGCCT
TATGATATTT TTGTAGGAGG CCTGCCAAGA TTGTCAATAT TTTCAGGGAT TATTTCGCTT
TTTTTTATAC CTCCTTTCAT GGGAGGTTAC AAAAATATAA AAAAAATCGG CGTGTTGGTA
ATTACCATAT CCGGCATAGT TTTAACCGTG GGAGTCCTTG CTTATTTGCT TGTATTCCCA
TACCCTGTTT CTTCAAATAA TGTTCTTGCC TTTTTTGAAC TGTCAAGAGT TTTGGAATAT
GGCAGATTTT TTCAAAGGAT TGAGTCAGTT TTTCTTCTTA CGTGGTCATT GGCAGGCCTG
TTGTATCTTA GCTCGGGATT GTATTTTGTA ATATATGTAT TTTCAAAAAC CTTCAAGCTT
AAATACTACA GACCGCTTAT AATTCCTTTT ACCTTGATAA TATTTTCTTT AAGCCTCATA
CCTGAAAGTC TGATGGAGAT AATGTATCTT GACAACAAAG TAATCAGGTA TTATGCCTGG
ATAGTTGCTT TTGGTTTGCC GTTTGTCCTT TTGTCAATTG CGAGGCTTGT TAAAAGAAAA
AGGAGGGGTA TGGCAAAAAA TGGGAAGTAA
 
Protein sequence
MEGKIIFGKR EAISLLIILI CNQLILGFPS IMSNSVGSAG WILSIYVSIL ALCLFLIISK 
LYSAFEGKDL LDISEFAGGN IARIIVGLIV VIDSVLIISV KLREYTEHIK IISFTQSPVS
FIMLFFALGM IISVHFGIEP LVRSTTIVLP IVAIGVVIVV AGSVKNFELS NIMPILGTGP
YDIFVGGLPR LSIFSGIISL FFIPPFMGGY KNIKKIGVLV ITISGIVLTV GVLAYLLVFP
YPVSSNNVLA FFELSRVLEY GRFFQRIESV FLLTWSLAGL LYLSSGLYFV IYVFSKTFKL
KYYRPLIIPF TLIIFSLSLI PESLMEIMYL DNKVIRYYAW IVAFGLPFVL LSIARLVKRK
RRGMAKNGK