Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1274 |
Symbol | |
ID | 4809779 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 1550424 |
End bp | 1551731 |
Gene Length | 1308 bp |
Protein Length | 435 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 640106697 |
Product | nucleoside recognition |
Protein accession | YP_001037699 |
Protein GI | 125973789 |
COG category | [S] Function unknown |
COG ID | [COG3314] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | [TIGR02871] sporulation integral membrane protein YlbJ |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.174951 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACCTTT ACAGGCTGAC AATAATCATA CTTATCGCAT TGGTATTGGC CATTAATATA AAATCCATAA AAACCATAAA GGTTATTTAC CTTAAATCCC TCGTGCTTCC TTTAGTGTGC ATAACTTTCA TCCTTATGCT CATTATTTTT TCCGACACCG CGGTAAAATC CGCCGGCAGC GGGCTTAACC TGTGGTTTAA TGTTGTATTT CCTTCCCTCT TCCCCTTTTT TGTTGCATCC GAAATCCTTT ACAGGACAGG GTTTATTAAA GCCATAGGAA TACTTTTGGA ACCCATAATG CGTCCTCTTT TCAATGTGCC CGGCTGCGGC TCCTTTGCTT TTGCCATGGG AATAACCAGC GGTTATCCCG TCGGTGCCAA AATCACCGCA AGCATGAGGG AAGAAAAACT CCTTAGCAAA ACAGAATCCG AAAGGCTTTT GTCTTTCACC AACAACTCAG GCCCCCTCTT TATTATCGGC GCCGTTGCCG TAGGCATGTT CAAAATGCCT GAGCTTGGAC TTCTGCTTTT AGCCTGTCAC ATCCTTGCAA GCATCACCGT GGGAATTCTT TTTCGCTTCT ATGGCAGAAA CAATAAGAAA ATCAAGATGA AAGACGACAA AAATCTCTGG AGAAGATTTA AAAAAGAATT GATTTATACC TGCAAACAAG AATTAAACCC CGGAACAATG CTGGGAGAAG CCATAAGAAA CTCCGTTAAC GTGCTGCTTT CCATTGGAGG ATTCATTACT CTTTTTTCAG TTATTATTAA TATTCTGATT GAAATCGGGT TTATATCCTG CCTGGCGTCT TTTATTTCGC CGTTTCTGTC ACCCTTTGGA ATAAGCAGAG AAATAGTTTT GGCGGTATTA AGTGGTTTTT TTGAAATGAC AACAGGAACA AACATGGCAA GCAAAGCGGC AAACGCAACC CTCCAGGGAC AACTTGCAGC GGTGAGCCTG TTGCTCGGCT GGGCTGGCCT TTCTGTGCAT TTTCAGGTTT ACAGCATTAT AAGCCACACC GATATAAGCA TAAAGCCTTA TTTATTTGGT AAAATGCTTC AGGGAGTGTT TGCAGCAATT TATATATCAA TAGCAATGAA ATTACCGTTT ACGGCTTCTT TGACAGCAAA AAGCGTTCTT AGTGTTATAA CACCTTTTTC AGACTTTACA TGGTACAATG CCTTCATATA TTCGGCTCAG AATGTGTTTA TTTCATTTTT GATCCTTTTG ATTTTGACGG CAATATCACT TATATTTCAT TTTATAAAAC ACGTATGCAA GACTCTTTTG AAACGTTCCG TATTTTAA
|
Protein sequence | MNLYRLTIII LIALVLAINI KSIKTIKVIY LKSLVLPLVC ITFILMLIIF SDTAVKSAGS GLNLWFNVVF PSLFPFFVAS EILYRTGFIK AIGILLEPIM RPLFNVPGCG SFAFAMGITS GYPVGAKITA SMREEKLLSK TESERLLSFT NNSGPLFIIG AVAVGMFKMP ELGLLLLACH ILASITVGIL FRFYGRNNKK IKMKDDKNLW RRFKKELIYT CKQELNPGTM LGEAIRNSVN VLLSIGGFIT LFSVIINILI EIGFISCLAS FISPFLSPFG ISREIVLAVL SGFFEMTTGT NMASKAANAT LQGQLAAVSL LLGWAGLSVH FQVYSIISHT DISIKPYLFG KMLQGVFAAI YISIAMKLPF TASLTAKSVL SVITPFSDFT WYNAFIYSAQ NVFISFLILL ILTAISLIFH FIKHVCKTLL KRSVF
|
| |