Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hoch_3260 |
Symbol | |
ID | 8545648 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haliangium ochraceum DSM 14365 |
Kingdom | Bacteria |
Replicon accession | NC_013440 |
Strand | - |
Start bp | 4493189 |
End bp | 4495900 |
Gene Length | 2712 bp |
Protein Length | 903 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 646387927 |
Product | von Willebrand factor type A |
Protein accession | YP_003267655 |
Protein GI | 262196446 |
COG category | [R] General function prediction only |
COG ID | [COG2425] Uncharacterized protein containing a von Willebrand factor type A (vWA) domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.0740764 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGTGACG TTTCCAAACG CCTCCTCCTC GGCGTCGTCC TCGGCCTGCT CTGCGCCGCC GCCATCGCCT TCGGCATCGA TCTCTGGCTG GCGCCGCGCG CCGGCGCGGT GGTGCTGAAC TGGGGCGAGC GGCCGATCGA GCTGCTGGAG CCGAGCTGGC TCTACCTCAT CGCGCTCGCG CCCTTCTTCT TCGTCGTCCG CGGCTACTCG CTCACCGATC TATCGACCGC GCAGCAGCTC GTGCAATCGG CGCTGCGCGC GCTGCTGCTG GTCGGCATCG CGGCCGCCCT GGCGCGGCCG GCGTGGACCA CCGAGGACGA CAAGGTGGCC ACGGTGGTGC TGGTCGATGT CAGCGACTCG GTGAGCGACG CCCAGCTCGA CGCCGCCCGC GCCTACGTGG GCGAGCTGTC CGCGGCCAAG CGCGACGATG ATGCTCTGTA CGTGGTCAGC TTTGCCCAGC GGCCGCTGCG GGTACCGGCC ACGGCCGAGG GCGGCTTCGC CATCGAGCGC CACCCGGACG AGGGCGCGGG CACCGACATC CAGGCCGCGG TGCAGCTCGC CTACGGCCTG TACCCGGCCG GCTACGTGCC GCACCTGGTG GTGGTGAGCG ACGGCAACCA GACCGGCGGC GACCTGCTCA GCGAGGCCTA CCGGGCCGAG GAGCTGGGCG TGCGGCTGTC GTGGCAGAGC TTCCCCGAGC AGCGGGTGCA GGAGATCCGC GTGGTCGGCC TGCGCATCCC GGACGAGGTC AAGGTCGGCG CGCCCTTCGA GGTCACGGCC GAGGTGTGGT CCACGCACGA GGAGGAGGTG ACGCTCACCC TGGCCCAGGA CGCGTTCCCC AACCCGCTCG AGCCGCGCAA GCGCGTGACC CTGCGCGAGG GCGTCAACCG CATCCCGTTC AAGTCGCAGG CGACCCGGGC CGGCTTCACC AGCTACCGCC TGCGCCTGGT GGACGCCGCC GGCGACACCG AGAAAGAGAA CAACGAGGCG CTGATGACCA CGCCGGTCAA GGGCCGGCCC AGCGTGCTCT ACGTCGAGGG CAGCGCCATG AGCGACCCGT CGACCGCGCG CTACTTCGAG CGCGCCCTCG AGGGCGAGAA CATCGACGTC GAGGTGCGCG GCCCGCGCGG GCTGCCGTCC TCGGCCAAGG AGCTCGAGCG CTTCGACCTG GTGCTGGTCT CGGACGTGCC GGCGCAGTTT TTGGGCATGG GCCAGATGGC CGCGCTCGAG AGCTACGTGC GCGACCTCGG CGGCGGCTTG ATCATGGCCG GCGGCGAGGA CTCCTTCGGC TCGGGCGGCT ACCAGGGCAC GCGCATCGAG AAGATCATGC CGGTGCGCTT CGACTCGGAG AAACAGCGCG AGCAGCCGCA CGTGGCCATC GCGCTGGTGG TCGATCGCTC GGGCTCGATG TCGGGGCTCA AGATCGAGGC GGCCAAGGAG TCGGCGCGGG CCACGGCCGA GGTGCTCTCG CCCTCGGACC TCATCACCGT GGTCGCCTTC GACAACCAGC CGACCACCAT CGTGCGCCTG CAGCGGGCCT CGAACCGCAT GCGCATCGCC ACCGATATCG CCCGGCTACA GGCCGGCGGC GGCACCAACA TCTACCCGGC GCTGCGCGAG GCCTACGAGA TCCTCCAGGG CGCCAACGCC AAGGTCAAAC ACGTCATCGT GCTCAGCGAC GGCCAGGCGC CCTACGACGG CATCGCCGAC CTGTGCCAGG AGATGCGCAG CGCGCGCATC ACGGTCTCGG CCGTGGGCAT CGGCGACGCC GATCGCAACC TGCTCAATCT CATCACCGAC AACGGCGACG GCCGCCTGTA CATGACCGAC GACCTGGCCG CGCTGCCGCG CATCTTCATG AAGGAGACCA CCGAGGCCCA GCGCTCGGCC CTGGTCGAGT CGCCGGTGCG CGCGCACGTG CTCAAGCGGG TGGAGATGAT CGAGGGCACC GGGGTCGAGA ACGCGCCGCT GCTGCGCGGC TACGTGACCA CCAAGCCCAA ACCGCGCAGC GAGGTCATCC TGGTGAGCGA CCTGGGCGAG CCCATCCTGG CCCGCTGGCG CGTGGGCACG GGCACCAGCG TGGCCTGGAC CAGCGACGTC AAGAACCGCT GGAGCGTGGA CTGGATCCGC TGGAACGGCT TTCCCAAGTT CTGGGCCCAG GTGGTGCGCA CGAGCATGCG CAGCAAGGTG CACGAGAGCT ACGATCTGCG CGCCGCGGTC GACAGCGGGC GCGCCGAGGT GGTGGTCGAC GCCATCGACG TCGACGACCA GTTCGTCGAC CAGCTCGACA CCACGCTCGA GGTCATCGAC CCGCGCGACT CGCAGGTGGT GCGCACGCTG CCCATGGAGC AGACCGCGGC CGGGCGCTAC GCGGCCGACT TCACCATGGA CCGCTACGGC AGCTACATGC TCAAGGCCGT GCACCGCCGC GAGGGCCGCG TGGTCGCCGA GTCGATGGGC GCGGTGGCGC TGTCGTATCC GCTCGAGTAT CTCAAGACCA CGCCCGACAC CACCGCGCTG GCGCAGGCGG CCGCGGTCAC CGGCGGTCTC GATCAGCCGC CGCCGACGCA GGTGTTCGAC GCCGCCGGCC AGACCATCGA GTACACCGAG GATCTGTGGC CCTGGGTGCT GCTCTTGGTC GCCTGCCTGC TCATCCTCGA TCTGTACTTC AAGCGCCTGC GCATGTTCGG CTACCGCACG CTCAAGCTGT AG
|
Protein sequence | MRDVSKRLLL GVVLGLLCAA AIAFGIDLWL APRAGAVVLN WGERPIELLE PSWLYLIALA PFFFVVRGYS LTDLSTAQQL VQSALRALLL VGIAAALARP AWTTEDDKVA TVVLVDVSDS VSDAQLDAAR AYVGELSAAK RDDDALYVVS FAQRPLRVPA TAEGGFAIER HPDEGAGTDI QAAVQLAYGL YPAGYVPHLV VVSDGNQTGG DLLSEAYRAE ELGVRLSWQS FPEQRVQEIR VVGLRIPDEV KVGAPFEVTA EVWSTHEEEV TLTLAQDAFP NPLEPRKRVT LREGVNRIPF KSQATRAGFT SYRLRLVDAA GDTEKENNEA LMTTPVKGRP SVLYVEGSAM SDPSTARYFE RALEGENIDV EVRGPRGLPS SAKELERFDL VLVSDVPAQF LGMGQMAALE SYVRDLGGGL IMAGGEDSFG SGGYQGTRIE KIMPVRFDSE KQREQPHVAI ALVVDRSGSM SGLKIEAAKE SARATAEVLS PSDLITVVAF DNQPTTIVRL QRASNRMRIA TDIARLQAGG GTNIYPALRE AYEILQGANA KVKHVIVLSD GQAPYDGIAD LCQEMRSARI TVSAVGIGDA DRNLLNLITD NGDGRLYMTD DLAALPRIFM KETTEAQRSA LVESPVRAHV LKRVEMIEGT GVENAPLLRG YVTTKPKPRS EVILVSDLGE PILARWRVGT GTSVAWTSDV KNRWSVDWIR WNGFPKFWAQ VVRTSMRSKV HESYDLRAAV DSGRAEVVVD AIDVDDQFVD QLDTTLEVID PRDSQVVRTL PMEQTAAGRY AADFTMDRYG SYMLKAVHRR EGRVVAESMG AVALSYPLEY LKTTPDTTAL AQAAAVTGGL DQPPPTQVFD AAGQTIEYTE DLWPWVLLLV ACLLILDLYF KRLRMFGYRT LKL
|
| |