Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1873 |
Symbol | |
ID | 4809204 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 2224043 |
End bp | 2225386 |
Gene Length | 1344 bp |
Protein Length | 447 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 640107292 |
Product | HMG-I and HMG-Y, DNA-binding |
Protein accession | YP_001038287 |
Protein GI | 125974377 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.701975 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGTTGGGAA GTTCTAGTAC TGATGCTATA GGACCTGGCT CAGTATTCCA AATTGATGCA ACTGTGGCTG ATGTATATTT GGTTTCCCGT TTCAACAGAA CACATATTAT AGGCAGACCT GTTTTATATA TTGTCCAGGA CTGCTTTTCC AAACTTATTG TGGGGCTTTA TGTAGGACTT GAAGGGCCGT CATGGATTGG AGCAGCAATG GCTCTTGCAA ACACTGCTGG AAACAAAGTT TCTTTTTGTA GTCAGTATGG TATAGATATT CAGGAAGAAG AATGGCCTGT ACATCATCTC CCTCAGGCAA TTTTGGCAGA CCGAGGGGAA ATGCTGTCAG ACAATGCAGA GAGTTTGATT ATGAATCTTG GAATTACGGT AAAGAATACC CCGCCTTTCA GGGCTGACTG GAAGCCGTTA GTAGAAAGAT ATTTTAAATT GACCAATGAG CGTACAAAAT CATTACTTCC TGGAGCGGTA AATACAGATT TTATGCAGCG AGGCGGGAGA GATTACAGGC TTGATGCGAA ACTTGATTTA ATGCAATTTA CTGCCATTAT TATAAAATGT GCGTTATTCC ACAACAACCA TTATCGTATT GACAATTACA ACAAAGATGA AATGATGGTG GCAGACGAAG TGGAACCTAT TCCAAGGGAA ATCTGGAACT GGGGTATCGC TAACCGAATG GGCAAACTTC GGCACGTAGA TGAGGAAGTA GTGAAACTTA ACCTGATGCC GTCGGATAAT GGGGTGGTTA CGGCAAAAGG GATAAGGTTT AAAGGGCTGT TCTACAGTTC TAAATCAAGC ATGAAAGAGC AGTGGTTTGT AAAAGCCCGT AGCAGTGGAA GTTGGAAAGT GCCTGTATCC TATGACCCAA GAAACATGAA TTACATATAC ATTAAGAAAT CTGCCACCGA GTTTGAGAAA TGCTACCTGC TGGAATATCA GACGGCATTT AAGGATAAGT ACATTGAAGA AATTGAATAC CTGATGGAGT GGGAAAAGAT GCAAAAGGCT AAAAGTCTTG ATGAGGGATT GCAGGCCAAG GCAGATTTAA TAACAGAAAT AGAAACAATA GTTGAAGGGG CAAAAAGCAA GACAAATAAA GAACTTTCAC TATCAACAGA AAGCGATGCA CAAAGGAAGA AAAACATACG GCAAAACAGG CAGGTTGAAA AGGAGATAAA TCGGGAGATA GAGGCTTTTG AATTGGATAG GCAACCTAAT AATAAGAATG CAGAGATAAT CTCTCTTAAT GAACTGGAAG AAGAGTTACC ATCCAATCCT CTGGATTTGT TGAGAAGAAA ACAGAGGGAG ATGCTTGGGA AGATTAACGA ATAA
|
Protein sequence | MLGSSSTDAI GPGSVFQIDA TVADVYLVSR FNRTHIIGRP VLYIVQDCFS KLIVGLYVGL EGPSWIGAAM ALANTAGNKV SFCSQYGIDI QEEEWPVHHL PQAILADRGE MLSDNAESLI MNLGITVKNT PPFRADWKPL VERYFKLTNE RTKSLLPGAV NTDFMQRGGR DYRLDAKLDL MQFTAIIIKC ALFHNNHYRI DNYNKDEMMV ADEVEPIPRE IWNWGIANRM GKLRHVDEEV VKLNLMPSDN GVVTAKGIRF KGLFYSSKSS MKEQWFVKAR SSGSWKVPVS YDPRNMNYIY IKKSATEFEK CYLLEYQTAF KDKYIEEIEY LMEWEKMQKA KSLDEGLQAK ADLITEIETI VEGAKSKTNK ELSLSTESDA QRKKNIRQNR QVEKEINREI EAFELDRQPN NKNAEIISLN ELEEELPSNP LDLLRRKQRE MLGKINE
|
| |