Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_3075 |
Symbol | |
ID | 4809949 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 3615489 |
End bp | 3617099 |
Gene Length | 1611 bp |
Protein Length | 536 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 640108499 |
Product | von Willebrand factor, type A |
Protein accession | YP_001039464 |
Protein GI | 125975554 |
COG category | [R] General function prediction only |
COG ID | [COG2304] Uncharacterized protein containing a von Willebrand factor type A (vWA) domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATATTG TTTCAAAATA CAGAAGTAAT GTATTATGGC TTTTGATTCC GTTTTTTTTA ATGGCGGTGT TGTTTTCTTC CTGCAGCAGT TCTGTCAACA AAGGTTTAAA TTCATCATTA ATGGTTGAGA ATTCACATTT GGAAAAAGGT ACCGGATATT ATGACATAAT TGACATGGAT ATTAATATTG GTGACGATGA TACCTATAGT TTGGAATATA ATACCGAAGA CTATAATGTT ATTGAGGAAA ATATATTTCT TGATACCTAC AACAATCCCC TTTCAACATT TTCTGTGGAC GTGGATACGG CATCCTACAG CAATATAAGG CGTTTTCTCA ATTCTTCGCA AAAACCTCCT GTGGATGCCG TAAGAATTGA AGAAATGATT AACTATTTTA CATATGACTA TCCTGATGCC GACGGCGATG AACCGTTCAG CATAACCACC GAGATTGGAC AGTGCCCGTG GAATCCGGAA AACAAGCTTA TGCTTGTAGG ACTTCAGACA AAAAAGCTCT CCACGGAACA GCTGCCTCCA AGCAACCTGG TTTTTCTGAT AGATGTTTCG GGTTCAATGG ATGAACCTAA CAAACTTCCC CTTTTAAAAT CAGCTTTTAA GCTTCTTGTG GACGAACTGG ACGAAGATGA CAGAGTGTCT ATTGTAGTTT ATGCGGGTGC GGCAGGTTTG GTTTTGGATT CGACTCCGGG AAATGAAAAG GACAAAATTC TTGACGCTCT GATGAATCTT GAAGCCGGAG GCTCGACGGC AGGAGCCGAA GGAATAAAAC TTGCCTATGA CGTTGCTAAG AAAAACTTTA TTAAGTCAGG CAACAATCGT GTTATTCTGG CAACGGACGG AGATTTTAAT GTTGGTATAA GCAGCGAGGC TGAGCTTGTC AGGCTGATTG AGAAAAAAAG AGATGAAGGT ATATTCCTTA CAGTGCTTGG TTTTGGAACG GGCAATTACA AGGATTCAAA GATGGAAAGC CTTGCAGACA AAGGAAACGG AAACTATGCC TACATAGACA ATATTGCGGA AGCCAGGAAA GTTCTTGTAA ATGAAATGGG TGCAACTTTA AACACGGTTG CAAAGGATGT AAAAATTCAG GTTGAATTCA ATCCCGCAAA GGTAAAGGCT TACAGGCTGA TTGGGTATGA AAACAGGCTT CTTAGAAATG AGGATTTTAA CAACGATTCG GTGGACGCGG GAGAAATAGG TGCGGGACAC TCGGTAACCG CACTTTATGA GATTGTTTCT GCGAATTTGG ATTTTGAGGT TTCGAAAGTT GACGAGCTCA AGTATCAAAA ATCTCAGCTT GTAGAAAGCG ATGAAATTGC CACTGTGAAG GTAAGATATA AAAAGCCTGA CTCGGATACA AGTGAACTGC TTTCAGAAAC TGTGTTGGAA AAAGCAGAAG AAAATACATC AAAAAACCTG GAATTTGCGG CAGCTGTGGC AGAATTCGGC ATGTTGCTCA GAGAATCAAA GTATAAAGGC AATTCTTCTT ATGATCATGT TTTAGAGGTG GCAAAGCAAT ATGCTGACGG TGACGACGGA TACAGGAGTG AATTTGTAAA GCTGGTGGAA AAAGCGGAAA GTATTAATTG A
|
Protein sequence | MNIVSKYRSN VLWLLIPFFL MAVLFSSCSS SVNKGLNSSL MVENSHLEKG TGYYDIIDMD INIGDDDTYS LEYNTEDYNV IEENIFLDTY NNPLSTFSVD VDTASYSNIR RFLNSSQKPP VDAVRIEEMI NYFTYDYPDA DGDEPFSITT EIGQCPWNPE NKLMLVGLQT KKLSTEQLPP SNLVFLIDVS GSMDEPNKLP LLKSAFKLLV DELDEDDRVS IVVYAGAAGL VLDSTPGNEK DKILDALMNL EAGGSTAGAE GIKLAYDVAK KNFIKSGNNR VILATDGDFN VGISSEAELV RLIEKKRDEG IFLTVLGFGT GNYKDSKMES LADKGNGNYA YIDNIAEARK VLVNEMGATL NTVAKDVKIQ VEFNPAKVKA YRLIGYENRL LRNEDFNNDS VDAGEIGAGH SVTALYEIVS ANLDFEVSKV DELKYQKSQL VESDEIATVK VRYKKPDSDT SELLSETVLE KAEENTSKNL EFAAAVAEFG MLLRESKYKG NSSYDHVLEV AKQYADGDDG YRSEFVKLVE KAESIN
|
| |