Gene Cthe_2063 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2063 
Symbol 
ID4810661 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2454455 
End bp2455576 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content38% 
IMG OID640107470 
Productspore germination protein 
Protein accessionYP_001038463 
Protein GI125974553 
COG category 
COG ID 
TIGRFAM ID[TIGR00912] spore germination protein (amino acid permease) 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.722464 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAAATG ACAAGCTTCT GCCGGTACAA ATAATGCCAA TAGTCTCTTC CACCATGATG 
GGTGTAAGTA TACTTACCAT ACAGCGAAGT TTATCCTCCA TTGCAAAGGG AGATGCCTGG
ATTTCAATGA TACTTGGTGT AATACTTGGA GTATTTTCCG CAATTTTTTT GTATAATCTG
CTAAGGCTCA ATCCCGGCCT GGACTTGGCG GAAATAATAG TTTGTCAGGC CGGCAACTGG
GTCGGACGGC TTTTTCTTTT GTCCACTACA ATTTATATTC TCATTGACAT CGGACTGTCA
CTGAAAGTTT TCTCCTTCGC TTTGAAAAAT TTTCTACTGG ATTATACTCC CATATCCGTA
GTGTCTTTTT TACTGATAAT AGTTATCGTG TCTGTGGTGG TAAAGGGAAT TACCGTAATT
GCAGGTGTCA CCGACATACT CTACCCCTTT TTCGTCACAA GCCTTGTTGT CCTCATTGCC
ATGTCCACCG TAGAATTTCA GAAAGCAAAT ATCATGCCGA TAATTTACGG CAACATTCAA
AACACTTTCA AAGGCAGTCT GCCCGCTTTT GGTGCAATCT CCGGCTATGG TGCTTCTTCA
TATGTAATGA AATATGTAAC TGAACCCAAA AAAGCATTTA AATGGTTTTT TATGGGTTTT
GGAATTTCTT CAATTTTATA TATACTTCTC ACTCTTGCAA CAACCCTGGT TTTTGTCCCG
GAATTCCTGC AAAAACTTAC ATTTCCCACT TTGTTTCTGT CCAATGCAAT AGAATTTGGA
ACAGGTTTCT TTGAAGGTTT CTTTGAAAGA CTTGAGGCTT TCATGGTGCT AATCTGGATA
CCTGCAGTGT TTACATCCGT CGGAGTTTAC ACTTTTGCAT CCGTAAGAAA TTTTTCGGTA
CTTTTTAATA TAAAACCTAA ATTTCAAAAA TATGTGGCTT ATGCTCACAT ACCTTTACTG
TTTGCCATTA CTCATTATAT TAAAAGTCAA ATTGTGGCTA CAAATCTCAT GGATTTGTTT
GATTCACTTT CAATTGTATT AGGTTTCGGT CTTACGCCTT TATTGCTCGT ACTTACTTTA
ATAAACAGAA GGAGAAGGGC GAAAAATGAG GTTAAAAAAT AA
 
Protein sequence
MENDKLLPVQ IMPIVSSTMM GVSILTIQRS LSSIAKGDAW ISMILGVILG VFSAIFLYNL 
LRLNPGLDLA EIIVCQAGNW VGRLFLLSTT IYILIDIGLS LKVFSFALKN FLLDYTPISV
VSFLLIIVIV SVVVKGITVI AGVTDILYPF FVTSLVVLIA MSTVEFQKAN IMPIIYGNIQ
NTFKGSLPAF GAISGYGASS YVMKYVTEPK KAFKWFFMGF GISSILYILL TLATTLVFVP
EFLQKLTFPT LFLSNAIEFG TGFFEGFFER LEAFMVLIWI PAVFTSVGVY TFASVRNFSV
LFNIKPKFQK YVAYAHIPLL FAITHYIKSQ IVATNLMDLF DSLSIVLGFG LTPLLLVLTL
INRRRRAKNE VKK