Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mmcs_4313 |
Symbol | |
ID | 4113143 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium sp. MCS |
Kingdom | Bacteria |
Replicon accession | NC_008146 |
Strand | - |
Start bp | 4582697 |
End bp | 4584694 |
Gene Length | 1998 bp |
Protein Length | 665 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 638033459 |
Product | von Willebrand factor, type A |
Protein accession | YP_641474 |
Protein GI | 108801277 |
COG category | [R] General function prediction only |
COG ID | [COG4867] Uncharacterized protein with a von Willebrand factor type A (vWA) domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.844585 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTGATG CCCGCGCGCA CAAGGGACAC GGCCGGTCAT CCCGGTACTC CCGCTATACC GGCGGGCCGG ATCCGCTTGC CCCGCCGGTG GATCTGCGCG AGGCGCTCGA GCAGATCGGT GAGGACGTGA TGGAGGGCAG CTCGCCGCGG CGGGCGCTGT CCGAACTGCT GCGGCGCGGC ACCAAGAACA TGCGCGGGGC CGACCGGCTG GCCGCCGAGG CCAACCGGCG GCGCCGGGAA CTGTTGAAGC GCAACAACCT CGACGGCACC CTGCAGGAGA TCAAGAAGCT GCTCGACGAG GCGGTGCTCG CCGAACGCAA GGAACTCGCC CGCGCCCTCG ACGACGACGC GCGGTTCTCC GAGATGCAGA TCGAGGCGCT GTCCCCGTCG CCGGCCAAGG CCGTCCAGGA ATTGTCGGAC TACCAGTGGC GCAGCCCCGA GGCCCGCGAG AAGTACGACC AGATCAAGGA TCTGCTCGGC CGCGAGATGC TCGACCAGCG GTTCGCCGGC ATGAAGGAGG CGCTGGAGAA CGCCACCGAC GAGGACCGCC AGCGCGTCAA CGACATGCTC GACGACCTCA ACGAGCTGTT GGACAAGCAT GCGAACGGCC AGGATTCGCA ACAGGATTAC GACGATTTCA TGGCCAAGCA CGGCGAGTTC TTCCCGGAGA ATCCGCGCAA CGTCGACGAA CTCCTCGACT CGCTGGCCAA ACGCGCCGCG GCCGCACAGC GCTTCCGCAA CAGCCTCTCC CCGGACCAGC GCGCCGAGCT GGATGCGCTG GCGCAGCAGG CATTCGGCTC GCCGTCGCTG ATGAACGCGC TCAACAAACT CGACTCCCAT CTACAGGCGG CGCGCCCAGG TGAGGACTGG TCGGGGTCCT CGGAGTTCTC CGGCGACAAC CCACTGGGGA TGGGGGAGGG CGCGCAGGCG CTTGCCGACA TCGGTGAGCT CGAACAGCTC GCCGAGCAGC TGTCGCAGAG CTACGCGGGC GCCACGATGG ACGACGTCGA CCTCGACGCG CTGGCCCGCC AGCTCGGTGA CCAGGCCGCC GTCGACGCGC GGACGCTGGC CGAACTCGAA CGCGCCCTGA TGAACCAGGG CTTCCTCGAC CGCGGGTCCG ACGGGAAATG GCGGCTGTCG CCGAAGGCCA TGCGTCAGCT CGGGCAGGCC GCGCTACGCG ATGTGGCGCA ACAGCTTTCG GGCCGCCACG GTGAACGCGA CACCCGCAGG GCGGGCGCCG CCGGCGAGCT GACGGGAGCC ACCCGGCCCT GGCAGTTCGG CGACACCGAA CCGTGGAACG TCACCCGCAC GCTCACCAAC GCCGTTCTGC GCCAAGCGGG TTCGAGCGTA CGCGAGATCC CGGTGAGCAT CACCGTCGAC GACGTCGAGA TCTCCGAGAC CGAGACCAGG ACGCAGGCCG CGGTGGCGCT GCTCGTCGAC ACCTCGTTCT CGATGGTGAT GGAGAACCGG TGGCTGCCCA TGAAGCGGAC CGCGCTGGCG CTCAACCATC TGGTGAGCAC CCGGTTCCGT TCGGACGCAC TGCAGATCGT CGCGTTCGGC CGGTACGCCA GGACGGTGAC CGCGGCCGAA CTGACCGGGC TCGAGGGCGT CTACGAACAG GGCACCAACC TGCACCACGC GCTGGCGCTG GCCACCCGGC ATCTGCGCCG GCACCCCAAC GCCCAGCCGG TCATCCTCGT GGTCACCGAC GGGGAGCCGA CCGCCCACCT CGAGGACTTC GGCGGACGCG ACGGCGCACA GGTGTTTTTT GATTACCCGC CGCATCCGCG GACCATCGCC CACACCGTGC GCGGCTTCGA CGAGGTCGCC CGCCTCGGCG CCCAGGTGAC GATCTTCCGG TTGGGCACCG ACCCCGGCCT CGCGCGGTTC ATCGACCAGG TCGCCCGCCG CGTGGGCGGC CGGGTGGTGG TGCCCGACCT CGACGGACTC GGCGCCGCTG TCGTCGGCGA CTACCTGACC TCACGCCGCC GACGGTAA
|
Protein sequence | MADARAHKGH GRSSRYSRYT GGPDPLAPPV DLREALEQIG EDVMEGSSPR RALSELLRRG TKNMRGADRL AAEANRRRRE LLKRNNLDGT LQEIKKLLDE AVLAERKELA RALDDDARFS EMQIEALSPS PAKAVQELSD YQWRSPEARE KYDQIKDLLG REMLDQRFAG MKEALENATD EDRQRVNDML DDLNELLDKH ANGQDSQQDY DDFMAKHGEF FPENPRNVDE LLDSLAKRAA AAQRFRNSLS PDQRAELDAL AQQAFGSPSL MNALNKLDSH LQAARPGEDW SGSSEFSGDN PLGMGEGAQA LADIGELEQL AEQLSQSYAG ATMDDVDLDA LARQLGDQAA VDARTLAELE RALMNQGFLD RGSDGKWRLS PKAMRQLGQA ALRDVAQQLS GRHGERDTRR AGAAGELTGA TRPWQFGDTE PWNVTRTLTN AVLRQAGSSV REIPVSITVD DVEISETETR TQAAVALLVD TSFSMVMENR WLPMKRTALA LNHLVSTRFR SDALQIVAFG RYARTVTAAE LTGLEGVYEQ GTNLHHALAL ATRHLRRHPN AQPVILVVTD GEPTAHLEDF GGRDGAQVFF DYPPHPRTIA HTVRGFDEVA RLGAQVTIFR LGTDPGLARF IDQVARRVGG RVVVPDLDGL GAAVVGDYLT SRRRR
|
| |