Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acel_1893 |
Symbol | |
ID | 4486150 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Acidothermus cellulolyticus 11B |
Kingdom | Bacteria |
Replicon accession | NC_008578 |
Strand | + |
Start bp | 2140552 |
End bp | 2142540 |
Gene Length | 1989 bp |
Protein Length | 662 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 639730683 |
Product | von Willebrand factor, type A |
Protein accession | YP_873651 |
Protein GI | 117929100 |
COG category | [R] General function prediction only |
COG ID | [COG4867] Uncharacterized protein with a von Willebrand factor type A (vWA) domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.376882 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGCAT CAGCCTACCG GTACGGACCG TTTCATGACG GGCCGGATCC GTTGGCGCCG CCGTACGACG TCGCACGGGC GCTGGACGAG CTCGGGGACG ACGTTCTTTC TGGCGCGAGT CCCGCAGACG CTCTGAGAAA GCTGCTGCGC CACGGTGCAC CGGGACTGCG TGGCACCGAC GACCTCCTGC GCCAGGTCCG GGAACGCCGG CGTGCCCTTC GGGAGAGCGG CCGGTTCGGC GGGACACTCG AGCAGGCCCG TGCTTTGCTG GACAAGGCCA TCGGCCAGGA ACGCGCCGCC CTTTTCCCCG ATCCCAGCGA TGATGCGCGG TTGCGGGAGG CCGAACTCGA CGCCCTCCCT GCGGACACCG CACGAGCCAT CCGAGCGCTC GCCGACTACG ATTGGCGAAG CCCGCAGGCT CGGCGAACCT ATGAGGAATT GAAGAATCTG CTGCGTGATG AGGTGCTCGA CACCCAGTTT CGGGGGATGC GGGAGGCGCT CCGCCAGATG CGGGACGCCT CGTCCAGCGC GACCGCTGCC GCCGTCAAGG ACATGCTCGC CGACCTCAAT GACATGCTCG CGGCCGACGA GCGCGGCGAG CACACGCAGG AGAAATTCGA CGACTTCATG GCCCGCCACG GCCACTTTTT TCCCGATAAT CCCAGGAATC TCGACGAATT GGTCGACTCG CTGGCGCGGC GGGCGGCCGC CATGGAACGC ATGCTCGCGT CCATGAGCCG GGAACAGCGG GAAGAACTCG CGGCGTTGAT GGCTCAGGTC ATGGCCGACC TCGGCCTGGC GGCTGAACTG GCCCGTCTCA ATGACGCGCT GCGCCGCCGA CGCCCGGATC TCGACTGGTC CGGCCGGACC CGGCTCCGCG GCGACGAACC ACTGTCGGCC CCGGATGCGA CGTCGGTTCT CGAGGAGCTC GCGGATCTCG AAGAAGTCGC CGCCACGCTT GCGCAGGATT ATCCCGGCGC CCGCCTCGAC GACATTGATG AGGAAGCGGT CCGGCGCGCA CTCGGCCGCA GTGCAGTAGA CGATCTGCGC CGGTTGCGGG ACATCGAACG CGAATTGGAA CGGCAGGGGT ACATCCGCCG CGAGGCCGGC CGGCTGGAGT TGACGCCGAA AGCGGTCCGC CGCCTCGGCG CGACCGCACT CCGGCGGATT TTCGCCTCGC TGGAAGGAGC GCGATCCGGC GGCCACGATA CCCCCGATGC CGGGACCGCC GGTGAATTGA CGGGCTCGTC GCGACCATGG GAATTCGGCG ACGAGCAGCC CCTCGACGTC GTCCGCAGCC TGCGCAACGC GATCCGGAAC GGCCGTGTCC GGCGGGAACC CGACGGCCGC CCGGCACTGC GCCTCGCCGT CGAGGATTTC GAGGTCTTCG AAACCGAACG GCGGACCGCC GCCGCCGTCT GCCTGCTCGT CGACCTCTCC TGGTCGATGA CCCTGCGCGG CACGTGGGGC GCCGCCAAGG CAACCGCACT GGCGTTGCAC TCCCTGGTCA CGACGCAATT CCCGCAGGAC GCCCTGCAAA TCATCGGTTT TTCGAATTAC GGCCGAGTAC TTCAGCCCAC CGAGCTCGCC GGCCTGGACG CCGAAATGGT GCAGGGCACC AATTTGCAGC ACGCCCTCCT CATCGCCGGC CGCTTTCTCG ACCGCCATCC CGAATACGAA CCCATCGTCA TGATTGTCAC GGACGGCGAA CCGACCGCTC ACCTCCTGCC GGATGGCGAC TACGCCTTCG ACTGGCCACC GTCCCGGCAG ACGATCACAC TCACACTGGC CGAAGTCGAC AAGATGACCC GGCGCGGCGC CGCCTTGAAT GTCTTCATGC TGGCGGACGA TCCGGGGTTG GTGGACTTCG TCGAACTCAT GGCAAAACGC AACGGTGGCA GGGTCTTTTC ACCGTCCAAG GAGAGACTCG GCAGCTACGT GGTCAGCGAC TATTTACGAT CGAGGCGTGG ACGACGTCGG GCGGGCTGA
|
Protein sequence | MSASAYRYGP FHDGPDPLAP PYDVARALDE LGDDVLSGAS PADALRKLLR HGAPGLRGTD DLLRQVRERR RALRESGRFG GTLEQARALL DKAIGQERAA LFPDPSDDAR LREAELDALP ADTARAIRAL ADYDWRSPQA RRTYEELKNL LRDEVLDTQF RGMREALRQM RDASSSATAA AVKDMLADLN DMLAADERGE HTQEKFDDFM ARHGHFFPDN PRNLDELVDS LARRAAAMER MLASMSREQR EELAALMAQV MADLGLAAEL ARLNDALRRR RPDLDWSGRT RLRGDEPLSA PDATSVLEEL ADLEEVAATL AQDYPGARLD DIDEEAVRRA LGRSAVDDLR RLRDIERELE RQGYIRREAG RLELTPKAVR RLGATALRRI FASLEGARSG GHDTPDAGTA GELTGSSRPW EFGDEQPLDV VRSLRNAIRN GRVRREPDGR PALRLAVEDF EVFETERRTA AAVCLLVDLS WSMTLRGTWG AAKATALALH SLVTTQFPQD ALQIIGFSNY GRVLQPTELA GLDAEMVQGT NLQHALLIAG RFLDRHPEYE PIVMIVTDGE PTAHLLPDGD YAFDWPPSRQ TITLTLAEVD KMTRRGAALN VFMLADDPGL VDFVELMAKR NGGRVFSPSK ERLGSYVVSD YLRSRRGRRR AG
|
| |