Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1138 |
Symbol | |
ID | 4810806 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 1351046 |
End bp | 1352731 |
Gene Length | 1686 bp |
Protein Length | 561 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 640106560 |
Product | hypothetical protein |
Protein accession | YP_001037563 |
Protein GI | 125973653 |
COG category | [R] General function prediction only |
COG ID | [COG0433] Predicted ATPase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGATAAAT ACATAGGCAA ACTTATCGGC AATACGGGCA ATCCTAATGA TTTAAAGATT GCTCTCGAAA ACAGTTTTTC TGCTAAAAGA GGAGAGTTTG TAAAAATCAA GCATAGAGAA TCGGAAGAAG ATGGAGATAC ATATGTTCTA GGGAGAATTG TATCCATATC CAGAAGCAAT ATTCTATATA ACTCCAATAT GGGTGAGGGG CTGTCATCTC TTGAGATACT ACCAGGTGCC CAAGTTACTG GGGAGACTTT ATTTGGGACC ATAGAACTGG TGGGGTACAG GGATAACTAT GGACAGATAA AAATACCCAG GCGACCCCTT AACCCAGGGG AAAAGGTATA TGGTGTTGAC TATGAGTTTT TGTCAAAGTT TTATAAGTTT GATGAGAATA CAAGCATTAA CATAGGTAAT TTGATTGGAT ACGACAAGGG AAGTAATATA GTCCCTGTTT ATCTCGATGT AAACAAACTG GTTACAGAAC ATTTGGCTGT GCTGGCAATG ACAGGTTCAG GGAAATCATA TACTGTGGGC AGAATTATTG AGAGACTTGT AGCAGAAATG AATGGAACAG TAGTAGTTTT TGACGTTCAT GGAGAATATG GAAAAGCATT TGAAAAGGGA GAGATACATT TTAATAATAA TCTTGATTTT ATTGAGGATG AGAGGGAAAA GAAGAGTATT CAAAGGATTC AGGAAAATTT AATAAAAATG CAGAATGCAG GTGGTGGAAT AAAAGTTTAT ACTCCCCAGA TTGATTCCTT TGATTATAAG TATAGTGGAA AAAACCATCA CTTAGCCCTG CAATTCGATA GATTCGACAT GGATGATTTA TCTTCCATTC TTCCCGGTTT GACAGAAGCC CAGGAAAGAG TATTGGATGT TGCAATCAGG TATTGGAAAG CGAAATATAA TCATCCACCA AGAGATATTC AGGATTTAAC ATATCTACTT TCTGATGAAC AGGGGCTTGA GGAACTAAAG AATTGGGACA ATTTAACTGA AGGTGAAGCC AAAGCACTCA ATAATAGAAG TGCAGCAGTG GCTTCTATGA AATTAACCCG AGTAATAAAT GAAGCAAAAA GTTTTTACAC AAGGGCTATA GGTGAGCCTA CAGATATTTA TGATATGATT GGCGAAAAGG GAAATAGCGT GGGAAGGCTT GTAATAATAG ACTTACAAGG CCTATCCGAT GATGCTAAAC AAATTATAAC AGCATTGATA TCCAGTGAAA TTATGAGGGC AGCATCAGAT AAAAAAAGGC AAATAAGACC ATGTTTCCTT GTTTATGAAG AAGGACACAA TTTTGCACCG GCAGGCATTC CGAGCATTTC TAAGAAAATT ATTAAGAAGA TTGCGGCGGA AGGAAGAAAG TTTGGTGTTG GTTTTGCGAT TATTTCACAA AGACCGTCAA AACTTGACCC AGATGTAACC TCACAGTGCA ATACAATTAT TACAATGCGG TTAAAGAATC CAGATGACCA GCGGTTTATA GCAAAAACGT CAGATATGTT TTCATCATCT GATATTGAAG AATTGCCATC TTTATCAACG GGAGAAGCAT TGATAAATGG CAGGTCAATT CCTGCACCAC TGTTAGTAAA AGTTGGAACA AAGGCCTTAA TACATGGTGG AGAGTCTCCT GAAGTAATCA AGGAATGGGG CGTATTCAAT GGATAA
|
Protein sequence | MDKYIGKLIG NTGNPNDLKI ALENSFSAKR GEFVKIKHRE SEEDGDTYVL GRIVSISRSN ILYNSNMGEG LSSLEILPGA QVTGETLFGT IELVGYRDNY GQIKIPRRPL NPGEKVYGVD YEFLSKFYKF DENTSINIGN LIGYDKGSNI VPVYLDVNKL VTEHLAVLAM TGSGKSYTVG RIIERLVAEM NGTVVVFDVH GEYGKAFEKG EIHFNNNLDF IEDEREKKSI QRIQENLIKM QNAGGGIKVY TPQIDSFDYK YSGKNHHLAL QFDRFDMDDL SSILPGLTEA QERVLDVAIR YWKAKYNHPP RDIQDLTYLL SDEQGLEELK NWDNLTEGEA KALNNRSAAV ASMKLTRVIN EAKSFYTRAI GEPTDIYDMI GEKGNSVGRL VIIDLQGLSD DAKQIITALI SSEIMRAASD KKRQIRPCFL VYEEGHNFAP AGIPSISKKI IKKIAAEGRK FGVGFAIISQ RPSKLDPDVT SQCNTIITMR LKNPDDQRFI AKTSDMFSSS DIEELPSLST GEALINGRSI PAPLLVKVGT KALIHGGESP EVIKEWGVFN G
|
| |