Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2063 |
Symbol | |
ID | 4810661 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 2454455 |
End bp | 2455576 |
Gene Length | 1122 bp |
Protein Length | 373 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 640107470 |
Product | spore germination protein |
Protein accession | YP_001038463 |
Protein GI | 125974553 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR00912] spore germination protein (amino acid permease) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.722464 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAAATG ACAAGCTTCT GCCGGTACAA ATAATGCCAA TAGTCTCTTC CACCATGATG GGTGTAAGTA TACTTACCAT ACAGCGAAGT TTATCCTCCA TTGCAAAGGG AGATGCCTGG ATTTCAATGA TACTTGGTGT AATACTTGGA GTATTTTCCG CAATTTTTTT GTATAATCTG CTAAGGCTCA ATCCCGGCCT GGACTTGGCG GAAATAATAG TTTGTCAGGC CGGCAACTGG GTCGGACGGC TTTTTCTTTT GTCCACTACA ATTTATATTC TCATTGACAT CGGACTGTCA CTGAAAGTTT TCTCCTTCGC TTTGAAAAAT TTTCTACTGG ATTATACTCC CATATCCGTA GTGTCTTTTT TACTGATAAT AGTTATCGTG TCTGTGGTGG TAAAGGGAAT TACCGTAATT GCAGGTGTCA CCGACATACT CTACCCCTTT TTCGTCACAA GCCTTGTTGT CCTCATTGCC ATGTCCACCG TAGAATTTCA GAAAGCAAAT ATCATGCCGA TAATTTACGG CAACATTCAA AACACTTTCA AAGGCAGTCT GCCCGCTTTT GGTGCAATCT CCGGCTATGG TGCTTCTTCA TATGTAATGA AATATGTAAC TGAACCCAAA AAAGCATTTA AATGGTTTTT TATGGGTTTT GGAATTTCTT CAATTTTATA TATACTTCTC ACTCTTGCAA CAACCCTGGT TTTTGTCCCG GAATTCCTGC AAAAACTTAC ATTTCCCACT TTGTTTCTGT CCAATGCAAT AGAATTTGGA ACAGGTTTCT TTGAAGGTTT CTTTGAAAGA CTTGAGGCTT TCATGGTGCT AATCTGGATA CCTGCAGTGT TTACATCCGT CGGAGTTTAC ACTTTTGCAT CCGTAAGAAA TTTTTCGGTA CTTTTTAATA TAAAACCTAA ATTTCAAAAA TATGTGGCTT ATGCTCACAT ACCTTTACTG TTTGCCATTA CTCATTATAT TAAAAGTCAA ATTGTGGCTA CAAATCTCAT GGATTTGTTT GATTCACTTT CAATTGTATT AGGTTTCGGT CTTACGCCTT TATTGCTCGT ACTTACTTTA ATAAACAGAA GGAGAAGGGC GAAAAATGAG GTTAAAAAAT AA
|
Protein sequence | MENDKLLPVQ IMPIVSSTMM GVSILTIQRS LSSIAKGDAW ISMILGVILG VFSAIFLYNL LRLNPGLDLA EIIVCQAGNW VGRLFLLSTT IYILIDIGLS LKVFSFALKN FLLDYTPISV VSFLLIIVIV SVVVKGITVI AGVTDILYPF FVTSLVVLIA MSTVEFQKAN IMPIIYGNIQ NTFKGSLPAF GAISGYGASS YVMKYVTEPK KAFKWFFMGF GISSILYILL TLATTLVFVP EFLQKLTFPT LFLSNAIEFG TGFFEGFFER LEAFMVLIWI PAVFTSVGVY TFASVRNFSV LFNIKPKFQK YVAYAHIPLL FAITHYIKSQ IVATNLMDLF DSLSIVLGFG LTPLLLVLTL INRRRRAKNE VKK
|
| |