Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_3123 |
Symbol | |
ID | 4809686 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 3686834 |
End bp | 3690736 |
Gene Length | 3903 bp |
Protein Length | 1300 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 640108556 |
Product | von Willebrand factor, type A |
Protein accession | YP_001039511 |
Protein GI | 125975601 |
COG category | |
COG ID | |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGATTTTTT ATTATGATGA GGAAAATGAC ACTATTAAGT TTTTAGAAAC GGAAGTGGAT GAGGAAACAA ATACGATAAA AACAACGGTT GATCATTTTA GTATTTATGG TGTTATAGAT ATAGTAAGGT TTGCACAGAG TTGGGGTATA AAAGACATTT TGGATAAACT TTTAAACCCT GGTGGAGAAA CCATACCGCC TGTTGCTGAA ATAGGACAGG CAGATATAGT TTTTGTTATT GATACGACCG GTTCGATGGG CTCAGTAATT AATAATGTGA AAAACAATAT TACAAACTTT GCCAACACAT TAATGGAAAA TAATGTAGAT GTGAGATTGG GGTTAATAGA TTACAAGGAT TTAGAAGAAG ATGGCATGGA TTCTACTAAA AACCTTGGAT GGTTTGACAA TGTAAGTGAT TTTATTGCAA GTGTAAACAA TATGAGGGCA ACCGGAGGAG GAGATGCTCC TGAATCTACC GTGGATGCAT TGGAAGAAGC AAGAAGAATG GATTTCAGGC CCGGTGTAAA TAAATTTATT ATGCTTTTTA CTGACGTGTC ATATAAAGAA TCTACAAGAT TTGAAGATGT TCAGTCAATG AAGACAGTGA TAGAAAAGTT AAAAGAAGAT AAAATTGTCG TATCAGCAAT TGTACCTTCC GGTTATGAGT CATTGTATAG GAATTTGTAT ACTGAAACAG GCGGTGTTTA CGCAAATATA ACACAAGCTT TTTCTTCCGC GCTCCAATCG CTTATTAGCA ACATAGCCAG TGTAACCCAT GGTGACGGTG TCTGGATAAG GTTATCCAAC GGAATGTTAA AAAAACTGGA AGGAGTTTCG AGCAAAGAAG AATTGCTTGA GATGGCATCG GGAGGATACC TTTCTGTCGG CGATTTGTTT TTTGAAGAAG TTACGGTAGA TTTTTTCGGA TTGAAAGTAA AAACTTTCAG GGAAGCTTTT AACAAAAATG CCGTGGATGT CAGAATAACT CCCACAAGTG GATATTCAGG GGGTATGTAT ACAGTTGATG CGGTAGCTAC AATTGTAACA AAACAGCAGG TGGATAAAGC GGTATTGTTA TTCAGGCTGT TAAACGGAAA CTGGATGCAA TATAATGCCG GTCTTGATTC GAAATTTATT ATGAGTAAGT ATAATTTTAA ATTGCCTGAA GAATATCCGG AACGCTTGAA TGATTTGTTT ATTGGATACA GAACCAGTTT AAATATTCTG GATTCGGGAT TAAGGGAGTT TAAAGTTGTT TTGTTTGACA AAGAAGGCAG GATAATTGTT GAGTCAGCAG TAAATAAAGT AAAGGTGGAA AGTTCAATCA AGTGGACAGG GGATTTATCA CCGCTGGAAA CAGTAATCGG CACCAAAAAC TTTTCCATAG CTCAAAATGC CAATCTTCTG GCCGGTGAAA AAATTGCTTT GTTTTTACGT AAAGATGGCG AACAGTTTCC GGTAATGGTT GATTCGTCCG AACAAAAATA CAAACGCGAG CAAAATAACT ATAATTTCAG TTTCGTATCA AATGATTGGG ATGCTGTAAA TAAGAGGCCT AAATATACTA ATGGTGTATA TTGGATATCT GTACATATAA CGGATGGGAG TGGCAAATCT CTTATTAAGT CTGATGAGAG AAAGATAAAA ATAAATAACA TTTGGATTTC TGAATACTAC AATAGCGGAC AAAACAAGGA TATTTATGAT ATTAAAAAGA GATTGAATGA ATTGGGATAT TTTGGCCAGA ATAATTTGCA GTTGACTGTC AATACAGTAT TTGATTATAA TACTTCTTTT GCTGTCAGTG AATTTAAGAG AGTTAATCGT ATAAATGACA ACGGAATATT TAAAGGTGTT GTATGTGACC AAACATGGAA TGTACTGTTT TCAGATTCTG CATTAAGGAA TGATAAGTCG CCTTCTGCAG GTTCCGGTGA ATGGAATTTC CAGAGCCTGA AAGAAAGACT GAGATTTTTG TCCGCGGCAG AATTGGAAAA GGAAATTTCA GATGCCAGCA AGGAAAAAAG GTCCTTGTCT CCAAGTGCTG AGCTGACTAA AAGATTGAAT CTCATTAATT CAAGAATAGC GAATGACTGG GTATCCAATG GCGTAACCAA AACAAGATTT GCCAAAAGGT ATCTTGACCA GTGTGCAAAA AACGGCATAA CACCACCTGC CGAGTATAAC ACTGAGGCTA AGGTTATTAA AGCATTGACA TGGAATGATG AGAAAATTGA GAAGCTGTAT AGAAATTCAT TGCAATTCCA AAATCTGGAT ATTGATCCAA GGCTGCTTCT GGCGATAATA ATTCAGGAGG GTACAGGAAG TTTTAATACA AATCCGGAGG TAAAGGACAG TAATGGCGGA CATTATATCC AGCCGGATTT TGAAAAAGAT TTGAAAGCTG CATTGGACAA TCAATTTTTG AGAAAAGCAA ATGCATATAG GTACTATGGG GAGCAGTTCT CTGAATTTGT AAGTGGTTTA AAGGTTTCAC AGCCTAATTT AACCAGCGGA AAAGGTAATT TGTATCAATT CCTAAATTAT GCTACAATGG CGGCAACGGT TGATCTCAAT ACTAATGAAA TAATTGAGCT TAAACCCCAT GGAGTATATG CAACTCATGA TGGCTGGTGG AAAAATGTGG AAGCTGTTTT CAATTCATTG GTGGACAACA ATGGAGAGAA ACCTGCTGAA AAGTATTCGA ATTTGTTTAA GAGCTCAAAA AAGGTACCTT TAAAAAGTGG CTATTCAAAA CCGACAGTTG TGTTTAAATT GGAGTGGATA AATGAAGTAA TATACAATAG CAAGACCGAG AAATATGACC CGGGTTATAC AATTAAAGCG ACACTGGCTT CACAGACTAC ACCGGAGGAA CCAAAGCCGA CGGAGAATGT ATTCCCTCTG AAATATGGTG ATACCAGTGC CGTAAAAGGT AGCTACATCA AAGAAGTTCA TGATTTACTT AGCAAAAACC TTGGAAGCGG TCTGGAATAT CTTAAAATTG CAGATGGTGA TGCGGGTTAT GGAAAAAACT ATGGACCAAA AACTACAGCT TTGGTGAAAT TATATCAGAG TCAAAACGGT GCCGCAGTTA ACGGGCAAAT AGATGAAGCT ACTCTGAATA AGCTGAGGAA AGGTGAATGG AAAGTTGTAG CACCCAGTGA AGGAAAATAT GTAATAAAGG GTTCTTATCA GGATTCTTAC TTTGAACTTG TTGTTGATAA AGCTCCTGAT ATTTCTGATA CTGGAGACTT TAGCTTAAAG CTGAAAAGAC TGGTGGAAGA ATATACCGAT TTTAATATTA ACGGGGTTAA AGTTAAATTG CCTTACTATC AGACTGTGGG CACACGTTAT GGAGGTAAAT CAACACCGGA ACAGATAAGA AATTTCATAC TTGGAAAAAC AACAGATCCG TCCAAATTCC AGAGTGTTGC AGATGACCCT GAAAACAGGC ATAAAGTTGG AGTTGACTGT TCGGGACTGG TAGCTTATGT GCTTAATGAA GCAACAGAAG GAGCGATTCA CAAAACCCAC GGGCAAACCG GTTATGCAAA TGGAATTAGT GCGGCAGCCC TCACAAACAC TAAACTTGGA CAGAAAATAA CCAGGGCGAA AGATATTGTT CCGGGAGCTA TAATGAATAC TGATGACGGC GGTCATGTTA TCGTAATATA TGAAGTTGTA AAAACAAACG GAAAAGTTAC TCAAATAAAA TATGCACATT CTAATGGCAA ACACGGGCCT CATAAAGGAT ACATTGACAT AGGAGATGAA AATCAGGACT TGGACGGAAG TGCACAAACG TGGCATGATA TTTCATATAC GGACCAAAAA GCTAAAGAGC TTTATACCTA TACCATTTTG CGTAATGAAG TCATAGAATA TTTGAAAAAA TAA
|
Protein sequence | MIFYYDEEND TIKFLETEVD EETNTIKTTV DHFSIYGVID IVRFAQSWGI KDILDKLLNP GGETIPPVAE IGQADIVFVI DTTGSMGSVI NNVKNNITNF ANTLMENNVD VRLGLIDYKD LEEDGMDSTK NLGWFDNVSD FIASVNNMRA TGGGDAPEST VDALEEARRM DFRPGVNKFI MLFTDVSYKE STRFEDVQSM KTVIEKLKED KIVVSAIVPS GYESLYRNLY TETGGVYANI TQAFSSALQS LISNIASVTH GDGVWIRLSN GMLKKLEGVS SKEELLEMAS GGYLSVGDLF FEEVTVDFFG LKVKTFREAF NKNAVDVRIT PTSGYSGGMY TVDAVATIVT KQQVDKAVLL FRLLNGNWMQ YNAGLDSKFI MSKYNFKLPE EYPERLNDLF IGYRTSLNIL DSGLREFKVV LFDKEGRIIV ESAVNKVKVE SSIKWTGDLS PLETVIGTKN FSIAQNANLL AGEKIALFLR KDGEQFPVMV DSSEQKYKRE QNNYNFSFVS NDWDAVNKRP KYTNGVYWIS VHITDGSGKS LIKSDERKIK INNIWISEYY NSGQNKDIYD IKKRLNELGY FGQNNLQLTV NTVFDYNTSF AVSEFKRVNR INDNGIFKGV VCDQTWNVLF SDSALRNDKS PSAGSGEWNF QSLKERLRFL SAAELEKEIS DASKEKRSLS PSAELTKRLN LINSRIANDW VSNGVTKTRF AKRYLDQCAK NGITPPAEYN TEAKVIKALT WNDEKIEKLY RNSLQFQNLD IDPRLLLAII IQEGTGSFNT NPEVKDSNGG HYIQPDFEKD LKAALDNQFL RKANAYRYYG EQFSEFVSGL KVSQPNLTSG KGNLYQFLNY ATMAATVDLN TNEIIELKPH GVYATHDGWW KNVEAVFNSL VDNNGEKPAE KYSNLFKSSK KVPLKSGYSK PTVVFKLEWI NEVIYNSKTE KYDPGYTIKA TLASQTTPEE PKPTENVFPL KYGDTSAVKG SYIKEVHDLL SKNLGSGLEY LKIADGDAGY GKNYGPKTTA LVKLYQSQNG AAVNGQIDEA TLNKLRKGEW KVVAPSEGKY VIKGSYQDSY FELVVDKAPD ISDTGDFSLK LKRLVEEYTD FNINGVKVKL PYYQTVGTRY GGKSTPEQIR NFILGKTTDP SKFQSVADDP ENRHKVGVDC SGLVAYVLNE ATEGAIHKTH GQTGYANGIS AAALTNTKLG QKITRAKDIV PGAIMNTDDG GHVIVIYEVV KTNGKVTQIK YAHSNGKHGP HKGYIDIGDE NQDLDGSAQT WHDISYTDQK AKELYTYTIL RNEVIEYLKK
|
| |