Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1165 |
Symbol | |
ID | 4810833 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 1389543 |
End bp | 1390754 |
Gene Length | 1212 bp |
Protein Length | 403 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 640106587 |
Product | YbbR-like protein |
Protein accession | YP_001037590 |
Protein GI | 125973680 |
COG category | [S] Function unknown |
COG ID | [COG4856] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0430232 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATGAGT TACTGAAGAA GGATTTAACT TTAAAAATAA TCTCTGTCTT TTTCGCCATA TTTCTCTGGT TTATTGTTTT GGACAGCTCT AATCCGGTAA CCTGGGTTGA ATTGAATGTG CCTTTGAAAG TTGAAAATGA AAGTTCACTT AAAGAAAAGG GAATAATGCT TAAGAATGAG AACTTTCCGA GAAATGTTTC CGTCAGTCTA AAGGGAAGAA AAAGCGCTTT TAACAATATA GGTTTAAATG ACATTGAGGC AATTGTTGAC CTTTCAAAGG TGGAGGATGT TAATACTCAG TTTTTATATG TCAATGTTTA TACAAATAAA AAAGGTGTGT CTTTTCAGGG AGTAACACCG AGAGTTGTGG AAATAGAACT GGAAAAACTG GGTGAAAATC CTTTTCCTGT TAATGTAGTT ATCACAGGAA AACCGAAGGA AGGCTACACA GTGGTAAAGG CAAATGCAAT ACCGACAACG GTTTCAATTG AAGCGCCGGA CGAAATAATA AATTCCATCG GTGAAGTCAG GGCTTATGTT GATGTTGACA ATCTCAGTAA CGATATTATT GTAAACAAGG AATGTGTGGT TTACAACAAA GAAGGAGAAA AAATAGTTGA GCTGGATAAA AAAATAAGTG TTGACATCAA TATTGAAATC GCGAAAGAAG TGCCTATAGT ACCGGCCGTA AGGGGGAGAC CGGCAAAAAA TTACACCGAC GGCATACACA GGGTTGTGCC GGAAAAGGCG TGGATTTCGG GACCTTCTGA CGTCATTGAC CTTATTGACA ACTTGAAAAC CGAACCTATT GATATTGAAA ATATGTCGCA GAGCATGACC AAAATTGTAA ATCTCGTTCT GCCGGATGGG GTTCGCCTTG TTGACACTCC AAGAAGTGTT TATGTGGATG TGGTTATTGA GGAACTGGCA GAAAGGGAAT TTGTCTTTAA CAAGGAAAGC ATTGCGTTTG ACAATGCAGT AAAAAATAAT TCACTTAAGT ATGAAATTTT GGATGATGAG ATAAAAATAA CTTTGACCGG TACCAGACAG GAGTTGAACA AGATTTCGCC TGAGAGTCTC AAGCTTAGCG TTGATGTAGG CGGGCTTTCG GAAGGGGAGT ATAAGAGGCC CCTTAACGTG GTTATCCCTG ATACTGTGAA TCTTTCCGGA AGCTATGATG TTAAAATCAG TGTGAAAAAA ACCGGAAGTT AA
|
Protein sequence | MNELLKKDLT LKIISVFFAI FLWFIVLDSS NPVTWVELNV PLKVENESSL KEKGIMLKNE NFPRNVSVSL KGRKSAFNNI GLNDIEAIVD LSKVEDVNTQ FLYVNVYTNK KGVSFQGVTP RVVEIELEKL GENPFPVNVV ITGKPKEGYT VVKANAIPTT VSIEAPDEII NSIGEVRAYV DVDNLSNDII VNKECVVYNK EGEKIVELDK KISVDINIEI AKEVPIVPAV RGRPAKNYTD GIHRVVPEKA WISGPSDVID LIDNLKTEPI DIENMSQSMT KIVNLVLPDG VRLVDTPRSV YVDVVIEELA EREFVFNKES IAFDNAVKNN SLKYEILDDE IKITLTGTRQ ELNKISPESL KLSVDVGGLS EGEYKRPLNV VIPDTVNLSG SYDVKISVKK TGS
|
| |