Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1196 |
Symbol | |
ID | 4810148 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 1425450 |
End bp | 1426688 |
Gene Length | 1239 bp |
Protein Length | 412 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 640106618 |
Product | hypothetical protein |
Protein accession | YP_001037621 |
Protein GI | 125973711 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.000988852 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAGATC CGTTATGCAG TGAAAGTTAT TTATTAGAAA CAATAGAATA TGACAAAGAA GGAATTTGTA AAAGTAAAAA AAAGATTGTT ATTCTGAAAG ATGATATGGA AAAAGGTATA CAAAGATATC CAAGGGATAA TCAAAGCATA ATTTATGCTA CGTTTTTACA TATGTTTATG TATAACACGG AAATGCTTAC AGCCAAATAC TCTTTAGGTA GTCATCCGGA TGAAATGATT GAAGATTATT TAAACGGTAT AGAGTATTTG GAAAATGTCG GTGAAGAAAA AGTATGGTAT ATTGACCTTT TGTGGATGCT ATCGTTAGGT ATACTTTTAG AGGTAGACAA ACAGGATTTA AAAAGGCTTG CTTGTGTGAT AGAGAAGCAA AAAAAAGAAG ACGCACTGAT GGATTTTCTT TTAAAAGCTT GTGATATAGG ATGGAATCAT AATACAAGTG AATATGAGAG AAAAAATCCA TATGCAAAGA CGGCTGAAAT TATACAAATG GCATTGCATG ATAAAGACAG GGAAAAAGCT TCGAAAAGGC TACAACAATA TATAGAGAAA GAATGGATTA AGGGACATAA TGATCTGGAC TTCAAAAATG CGCATAAAGA ACCCGGCTAC GTTGGCTTGT GGAGTTTTGA GGCTGCAGCA TTGGCAAAGA TACTGGGATT GGACGACAGC GCACTGAAAG ATAACAACCA TTACCCTTAT GATTTGGCGC ATTATAAAAA TGGAATGAGT TTTGATTTAA GCTGGTATGG TGTGCCAGTT GAAGAGGAAG CCAAGGAAGA AGAGGCAATA GTATATGGAA TACCGAATAA CCCGGAGTTG GAGCAAATAA TACCTGCAAA GTTCCACAGT TTTGTGAATG AAGTGATAGG AGACTACAAT ACATTGAGCG ACGAAGAGTT TTGGAAGAAG TATAATTTGA GAGAAATCTG GTTTGATGTT AAGGAGTACG AGGAAGATAA TAAAGCCAAA AATATGTTGG GAACGATTAT AGTGTTTTTG CTTGTAGAGA AGGAGTATAT TTTGCAGTTG GATTATAAGG AAGATTTGGT AGATTACATA GAAGATATAG ATAATTATTG GGGCAAAGAG GAAGTAAAGT TGATAAGCTT TGAAGTGGAC AATGACCAGC AGTATTATGC ATACGTACCG AAAACCGCAG CAATAGATTC ATTGTACGAG GTAAAATTGA CAGAAGTGGA GAAGATAGAG GAAGTTTAG
|
Protein sequence | MRDPLCSESY LLETIEYDKE GICKSKKKIV ILKDDMEKGI QRYPRDNQSI IYATFLHMFM YNTEMLTAKY SLGSHPDEMI EDYLNGIEYL ENVGEEKVWY IDLLWMLSLG ILLEVDKQDL KRLACVIEKQ KKEDALMDFL LKACDIGWNH NTSEYERKNP YAKTAEIIQM ALHDKDREKA SKRLQQYIEK EWIKGHNDLD FKNAHKEPGY VGLWSFEAAA LAKILGLDDS ALKDNNHYPY DLAHYKNGMS FDLSWYGVPV EEEAKEEEAI VYGIPNNPEL EQIIPAKFHS FVNEVIGDYN TLSDEEFWKK YNLREIWFDV KEYEEDNKAK NMLGTIIVFL LVEKEYILQL DYKEDLVDYI EDIDNYWGKE EVKLISFEVD NDQQYYAYVP KTAAIDSLYE VKLTEVEKIE EV
|
| |