Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1509 |
Symbol | |
ID | 4810547 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 1833063 |
End bp | 1834298 |
Gene Length | 1236 bp |
Protein Length | 411 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 640106929 |
Product | hypothetical protein |
Protein accession | YP_001037930 |
Protein GI | 125974020 |
COG category | [S] Function unknown |
COG ID | [COG2461] Uncharacterized conserved protein |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGAAA TAATTAATAA CCGCGAATAC AGGCAGAAAG TCTTGAAAGA GTTGATTAGG GAGCTGCATG ACGGAAAGAG TGTCGAAGAA ATAAAACCAA GGTTTGAAGA ATTAATAAAA GGGATTTCTC CTGCTGAAAT TTCTGAAATG GAACAGGCTT TGATTATGGA AGGCATGCCC GTTGAAGAAA TACAAAGACT TTGCGATGTA CATGCCGCTG TTTTCAAAGG CTCCATTGAA GAAATACACA GACCGCAGAA ACCTGAAGAA GTGCCGGGGC ATCCTATCCA TACATTTAAA CTCGAAAATG CCGAAATAAG GAAACTTGTC GATAATGAAA TCAGACCGCA GTTGGAATTG TATAAAAATG GAGACACAGC TGAGAGTTTA AAGAAATTAA GAGAAGCGTT TCAAAAGCTT TGGGAGATAG ACAAGCATTA TTCAAGAAAA GAAAATCTAT TATTCCCGTA CCTGGAGAAA TACGGCATAA CTGCACCTCC CAAGGTAATG TGGGGTGTGG ATGATGAAAT CAGGGCAGAT ATAAAAGAGA TAAACAACAA GCTTTCAGCG AATGCTCAAA ATCAAAATAT ACCTTTGGAG AAAGCGGAAG AAGCGGTAAA CAGGGTTATT GAAATGATTT TTAAAGAAGA AAATATTCTT TTCCCAATGG CTTTGGAGAC TTTAACCGAG GATGAATGGG CTGAGATTGC CAGGGCGAGC GATGAAATAG GGTATTGCAT GATTACGCCT GAAGCGGAAT GGAAACCTGC CCGGGTGGAT GTTGTGGAAA AAACACAAAA AGAGGGGGTT AAGTCTCAAG AAAACCAGGG CTTTGTTGAA TTTGACGCGG GATGTCTTAC AACGGAAGAA ATAAATGCAA TGCTTAATAC TTTACCCATT GATATTACTT TTGTGGATAA AAACGATACT GTAAAATACT TTACCCAGGG AAAGGAAAGG ATTTTTGCCC GTCCTAAAAC AATTATCGGA AGAAAGGTAC AAAACTGCCA TCCTCCGGCC AGTGTACACA TTGTGGAAAA GATTATAGAG GATTTGAAAT CGGGAAAAAA GGACCATGAG GATTTCTGGA TTAAAATGGG TGAAAAGTAT GTTTATATCA GGTATTTTGC TGTAAGAAAC GAAAAAGGCG AATACCTGGG AACAATAGAA GTGACACAGG ATATTGCTCC TATACAAAAA ATTACAGGTG AGAAACGATT GCTTTCTGAT GCTTAG
|
Protein sequence | MSEIINNREY RQKVLKELIR ELHDGKSVEE IKPRFEELIK GISPAEISEM EQALIMEGMP VEEIQRLCDV HAAVFKGSIE EIHRPQKPEE VPGHPIHTFK LENAEIRKLV DNEIRPQLEL YKNGDTAESL KKLREAFQKL WEIDKHYSRK ENLLFPYLEK YGITAPPKVM WGVDDEIRAD IKEINNKLSA NAQNQNIPLE KAEEAVNRVI EMIFKEENIL FPMALETLTE DEWAEIARAS DEIGYCMITP EAEWKPARVD VVEKTQKEGV KSQENQGFVE FDAGCLTTEE INAMLNTLPI DITFVDKNDT VKYFTQGKER IFARPKTIIG RKVQNCHPPA SVHIVEKIIE DLKSGKKDHE DFWIKMGEKY VYIRYFAVRN EKGEYLGTIE VTQDIAPIQK ITGEKRLLSD A
|
| |