Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_3906 |
Symbol | |
ID | 3906674 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | + |
Start bp | 4674267 |
End bp | 4676261 |
Gene Length | 1995 bp |
Protein Length | 664 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 637881232 |
Product | von Willebrand factor, type A |
Protein accession | YP_482985 |
Protein GI | 86742585 |
COG category | [R] General function prediction only |
COG ID | [COG4867] Uncharacterized protein with a von Willebrand factor type A (vWA) domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0145904 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAATTCCG GAACATTCCG GTATGGCCCG TGGGACGGCG GACCCGACCC ACTCGCCTCT CCCTTCACCG CGTCCGACGC CCTGGATGAG ATGTCCCGCC AGATTCTGGA GGGACGCACC CCGCGGGAGG CTCTGGAGTC GCTGCTGCGC CGGGGCATGC CGGGCCGGCG TGGTCTGGAC GACATGCGCC GGGCCGTCGA GCGGCACCGG CGGGAGGCCC GATCCCACGG CCGGATGGAC GGGACACTCC AAGAGGTCCG CCGGCTGCTC GACACCGCCG TGGGCCAGGA ACGGGCGGCG CTGTTCCCCG ACCCCTCCGA CGACGCCCGG CTGCGCGAAG CCGAGCTGGA CGCGCTGCCC GCGGACCCCG CCCGTGCGGT GCGCTCGCTC GCCGAGTATG ACTGGCGGTC CCCGCCGGCC CGCGAGACGT ACGAGAAGAT CTCCGAGCTG CTGCGTCGCG AGGTCCTCGA TTCCCAGTTC AAGGGGATGA AGCAGGCCCT GGAGGGGGCC ACGCCCGAGG ACTTCGCCCG GATCCGGGAG ATGGTCGGCG CGCTCAACGC CCTGCTGGAG GCGGACGCCC GCGGTGAGGA CACCGACGAG GCGTTCGCCC GGTTCATGCG GGAGTTCGGC GACTTCTTTC CGGAGAACCC CGGCTCGCTG GAGGAACTGG TTGACGCGCT GGCCCGCCGG GCCGCCGCCG CCGCGCGGTT GCTGGCCGGG CTCACCCCGC AGCAGCGCTC CGAACTCGCC GATCTGATGT CCACGGCGAT GGAGGACCTG GGGCTCGCCG CGGAGATGTC ACGACTGGCG CAGGCGCTGC GCGCGTCCCG ACCTGACCTC AACTGGGGTG GACGGCTGCG GGGGCGGGCC GGCCGGTTGG GCGACGGCCT GAGCGGCGAG GAACCGCTCG GTCTCGGCGA CGCCACCACC GCGCTGGAGG AGCTCGCGGA GCTGGACGAA CTGGCCGCCG CGCTGGGGCA GGACTACGCC GGGGCCTCCC TGGAAGACGT TGACCCCGAG GCGGTGGCCC GGGCGCTGGG CCGGTCCGCC GTCGACGATC TGCGAGGGCT GCAGGAGATC GAACGGGAGC TGGAGCGGCA GGGATTCCTC ACCCGCCGGG CCGGGAGCCT CGAGCTCACC CCGCGGGCGG TGCGCCGGAT CGGGCAGGCC GCACTGGCCC GGATCTTCCG GCAGGTCTCC GCCCGGGGCC GCGGGGATCA CAGCGTGACC GACGCGGGTT CCGCCGGGGA TCTGCTGGGC ACCTCCCGGG CCTGGCAGTT CGGCGACACC CAGCCGATCG ACGTCGTCCG CACGGTCCGC AACGCGGTGC TGCGCGGCGG TCCGCCCGGC CGGGGCCACC CGATCCGCCT GGCGGTGGCC GACTTCGAGG TCGCCGAGAC GGAGCGCCGC GCCACGGCGG CGGTGTGCCT GCTCGTCGAC CTGTCCTACT CGATGGCGCT GCGGGGCACC TGGGGCATCG CGAAGTCGAC GGCACTCGCG CTGCACACGC TCGTCAGCAC GTCGTTCCCG CAAGACAAGA TCCACATTGT CGGCTTCTCG GACTACGCCC GGGAGCTGCG GCCGGTCGAG CTCGCCGGGC TCGACTCCGA GATGGTGCAG GGCACGAACC TGCAGCACGC CCTGCTCATC GCTGGCCGGT TGCTGAGCCG CTACCCGCAG TCGGAGCCGG TCATCATGGT GGTCACCGAC GGTGAGCCCA CCGCGCACCT GCTGCGCGAC GGTACTCCCT CGTTCTCCTG GCCGCCGATG CCCGAGACCC TGGAGCTGAC CCTGGCCGAG GTCGACCGGC TCACCCGGCG CGGCGTGACC GTCAACGTGT TCATGCTCGA CGACGAGCCA CGGCTGGTGC AGTTCGTCGA GGAGATAGCC AGACGCAACG GCGGGCGGGT TCTCTCACCC GATCCCGCCG CCCTCGGCAA CTACGTCATC CGGGACTACC TGCGGGCCCG GGGCAGGCAC CGCACCGCCC GCTGA
|
Protein sequence | MNSGTFRYGP WDGGPDPLAS PFTASDALDE MSRQILEGRT PREALESLLR RGMPGRRGLD DMRRAVERHR REARSHGRMD GTLQEVRRLL DTAVGQERAA LFPDPSDDAR LREAELDALP ADPARAVRSL AEYDWRSPPA RETYEKISEL LRREVLDSQF KGMKQALEGA TPEDFARIRE MVGALNALLE ADARGEDTDE AFARFMREFG DFFPENPGSL EELVDALARR AAAAARLLAG LTPQQRSELA DLMSTAMEDL GLAAEMSRLA QALRASRPDL NWGGRLRGRA GRLGDGLSGE EPLGLGDATT ALEELAELDE LAAALGQDYA GASLEDVDPE AVARALGRSA VDDLRGLQEI ERELERQGFL TRRAGSLELT PRAVRRIGQA ALARIFRQVS ARGRGDHSVT DAGSAGDLLG TSRAWQFGDT QPIDVVRTVR NAVLRGGPPG RGHPIRLAVA DFEVAETERR ATAAVCLLVD LSYSMALRGT WGIAKSTALA LHTLVSTSFP QDKIHIVGFS DYARELRPVE LAGLDSEMVQ GTNLQHALLI AGRLLSRYPQ SEPVIMVVTD GEPTAHLLRD GTPSFSWPPM PETLELTLAE VDRLTRRGVT VNVFMLDDEP RLVQFVEEIA RRNGGRVLSP DPAALGNYVI RDYLRARGRH RTAR
|
| |