Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1182 |
Symbol | |
ID | 4810134 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 1410808 |
End bp | 1412076 |
Gene Length | 1269 bp |
Protein Length | 422 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 640106604 |
Product | hypothetical protein |
Protein accession | YP_001037607 |
Protein GI | 125973697 |
COG category | [R] General function prediction only |
COG ID | [COG1721] Uncharacterized conserved protein (some members contain a von Willebrand factor type A (vWA) domain) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.114028 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCTATA TAGTATATCT TTTTATGCTG GTATCCATGT TTTTATATAC ATACATATTT GGCGATGAAA CAAGTATGTT GATGCTCTAC ATGCTGATTC TTTCCCCTGT TTTGTCTTTG CTTCTGTCTT ATGCATCGCT TAAAAGTCTT GAATTTTCAA TTGATGAGAA GGTTCATGCC TCACAGGTTG AAAAAGACGG TGTTGTGGGA GTAACGGTAT TACTTCAAAA CAAATCTTTT GTGCCGATAC CGATTATTGA TATATCGTTT GCTGTTCCGC AAAACTTGAT TCCTCTGGAC AATCCCAGGC CTATTGTGTC TTTGGGACCG TATAAAACTC AGATAATCCA TTTGCAGTAC AAAGCAAAGT ACCGTGGAGT GGCGGAAATT GGAGTCAGGG ATATTAAAAT AAGAGACTTT CTGGGGTTTT TTAACTTTTC TTTGCTAAAG AAACAGAATA AAGTGGAGAG TACCAGAGAA ATAACGGTGT TAAACAAGAT TTCCAGGCTT AAGATGAACA GTGTTTTACT GCTTGAATCA ATTCTGGCTG CCAATGAAGA AACAGGCGCC GCTACGAACG ATTTTAATTT TTTAAGCTGC TTGAATGGAG AGCCGGGGTA TGAATTTCGT GAATATCAGC CCGGAGATCC CCTTCACAAG ATTCACTGGA AACTTTCGGC AAAAACAGAC GTGTTTATGG TGAGAAAAGA TGAAGGACGG GGTATTCCTA GAAAAAAGCT GGTACTTGAT CCTGTTGCCG TAAAGGGTCC AAAATCAAAA GCCGGAAGTG TCGTTGAAAT AGAGGATAAA ATTTTAGATG CCCTTATATC AGTTGTTGAC ATGTTGGTTA GAGCGGGAAG AGATGTGGAA GTGTGGCTTT TGGAACATGG AGAATGGATG AGCCATTTAG TCAAGGACAG GGATGAGATT GTAGAAATGC AGCACAGACT TGCATCATAC AAATTTCTGC ATTCAAGAGA CGAACTTGAA AATGAACGTC TTCCTGTGTC CACCATTACA TTGCAGGACA GTAGCGGCAG GATTTTTGCC GGAGGAGATG CCATGATTTT TACGGCTTCC CTTGACAAGG AACTTTCTGA AATAATAGAG GGAATGCAGG AGTTGAAAAT GACGGTGGAT TTGGTTGCAA TTAAGAATGA AAGAGATGTT GAAAAGAGCG AAGGTTTCGA AAAGAGAGAG CACAAAAGCA CCAAAATGAA CCTGTGGACG ATAGGACTGA CGGACGATAT TTCTGAAGTT TTGGCATAG
|
Protein sequence | MSYIVYLFML VSMFLYTYIF GDETSMLMLY MLILSPVLSL LLSYASLKSL EFSIDEKVHA SQVEKDGVVG VTVLLQNKSF VPIPIIDISF AVPQNLIPLD NPRPIVSLGP YKTQIIHLQY KAKYRGVAEI GVRDIKIRDF LGFFNFSLLK KQNKVESTRE ITVLNKISRL KMNSVLLLES ILAANEETGA ATNDFNFLSC LNGEPGYEFR EYQPGDPLHK IHWKLSAKTD VFMVRKDEGR GIPRKKLVLD PVAVKGPKSK AGSVVEIEDK ILDALISVVD MLVRAGRDVE VWLLEHGEWM SHLVKDRDEI VEMQHRLASY KFLHSRDELE NERLPVSTIT LQDSSGRIFA GGDAMIFTAS LDKELSEIIE GMQELKMTVD LVAIKNERDV EKSEGFEKRE HKSTKMNLWT IGLTDDISEV LA
|
| |