Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_3131 |
Symbol | |
ID | 4809694 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 3700138 |
End bp | 3701835 |
Gene Length | 1698 bp |
Protein Length | 565 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 640108564 |
Product | von Willebrand factor, type A |
Protein accession | YP_001039519 |
Protein GI | 125975609 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG1240] Mg-chelatase subunit ChlD |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCGGAAA GCAAAAAGAA TTTAAAAGTT ATAGTAATTT TATTGGGAAT AGCTTTAGTT GTGTTCGGGC TTATTTATGG TGGCATTTCT TTGACGCAAA ACTTTGGAAA AAGTCAAAAA GTAATTTCAA TGGAAAGTGC CGAAAAAAAA CTGGATAAAC TTTATAAAGA TATAACTGTA AATATTATTG AGCCAAAAAA AGGGCAGGTG GACATTGATC CTCCGGACCT TAAAGAGTCT TTGCCTGATA TTTCCAAATA TCCGCCTCAG GTGGAAGAAA CTACAGGCAC TTTTATAGAA ATCTTTTCTT CCACTGAAAA GTCAGGAGAG AAAAAAGACG GATGGCTTCT TGATGTGGCA AGGGAATTTA ACAGAGCCAA CATACAGGTA AACGGTAAAC CTGTTTCTGT CAGAATCAGA GGTATTGCCT CGGGTTTGGC GACAGACTAC ATAATTTCCG GTAAATATCT TCCTGATGCC TTTACACCTT CCAACGAGCT TTGGGGGGAA ATGATCAGGG CAAGCGGGGT GGATATATCC CTTGTTGAAA AAAGACTTGC CGGTAATGTT GCAGGAGTGC TTCTTTCAAA AGCAAAGTAT GAGGAATTGC TTCAGAAATA TGGTTCAATA AATTTAAAAA ATATTACTGA GGCGGTTGCG GCAAACGAAA TAGCAATGGG TTATACCAAT CCTTTTGCAA GTTCAACGGG AATGAACTTT CTTGTTTCAA CATTGAGTAC TTTTGACAGT AAAAATATAT TGAGCGAAAA AGCTATAGAA GGTTTTGAAA AATTTCAGAC CAATATTCCT TTTGTGGCTT ATACTACTTT ACAGATGAGG GAGTCTGCAA AATCCGGCGT TCTCGACGGC TTTATACTGG AGTACCAGAC CTATGAAAAT ACTCCTGAAC TGAAAAAGGA CTATGTTTTC ACTCCTTTTG GTGTAAGACA TGACAGTCCG ATGTATGCCA TCGGAAATCT AACTCAGGAG AAAAAAGAAA TACTCAATAA ATTCGTTGAG TTTTGCAAAA GCAGCAAATC ACAGGAGCTT GCAACAGAAT ACGGTTTCAA CAGGCTTGAC GATTATTTGC CTGAAATATC GAATTTTGAC GGAGAGGCTA TAATGAAAGC CCAGAAGCTT TGGAAAGAAA AGAAGGATGT TAACAATGAC ATTGTAGCCG TTTTTGTTGC CGATGTGTCG GGAAGTATGG CAGGTGAACC GCTCAACAGA TTGAAGCAAT CTCTTATAAA TGGTTCTAAA TATATAAGTT CAGATGTTTC CATCGGGTTG GTGTCTTATT CCACGGATGT GAATATAAAT CTTCCGATTG CCAAATTTGA CTTAAACCAA AGGTCTTTGT TTGTAGGTGC GGTTGAAAGC CTGGCTGCGG GCGGCAATAC AGCAACGTTT GACGCGATAA TTGTGGCAAC GAAAATGCTT AAGGAAGAAA AAGCAAAGAA TCCTAATGCC AAATTGATGC TGTTTGTGTT AAGTGACGGT GTGACAAATT ACGGCCACTC GCTAAACGAT ATTAAAGATA TGATGAAGAC TTTCGGAATT CCAATTTATA CTATAGGATA TAACGCAAAT ATAAAGGCAT TGGAGACTTT ATCACAAATA AACGAAGCGG CAAATATAAA TGCTGATACG GAAGATGTTG TATATCAGTT GGGAAGTTTG TTCAACGCCC AGATGTAA
|
Protein sequence | MPESKKNLKV IVILLGIALV VFGLIYGGIS LTQNFGKSQK VISMESAEKK LDKLYKDITV NIIEPKKGQV DIDPPDLKES LPDISKYPPQ VEETTGTFIE IFSSTEKSGE KKDGWLLDVA REFNRANIQV NGKPVSVRIR GIASGLATDY IISGKYLPDA FTPSNELWGE MIRASGVDIS LVEKRLAGNV AGVLLSKAKY EELLQKYGSI NLKNITEAVA ANEIAMGYTN PFASSTGMNF LVSTLSTFDS KNILSEKAIE GFEKFQTNIP FVAYTTLQMR ESAKSGVLDG FILEYQTYEN TPELKKDYVF TPFGVRHDSP MYAIGNLTQE KKEILNKFVE FCKSSKSQEL ATEYGFNRLD DYLPEISNFD GEAIMKAQKL WKEKKDVNND IVAVFVADVS GSMAGEPLNR LKQSLINGSK YISSDVSIGL VSYSTDVNIN LPIAKFDLNQ RSLFVGAVES LAAGGNTATF DAIIVATKML KEEKAKNPNA KLMLFVLSDG VTNYGHSLND IKDMMKTFGI PIYTIGYNAN IKALETLSQI NEAANINADT EDVVYQLGSL FNAQM
|
| |