Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0822 |
Symbol | |
ID | 5669238 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 960226 |
End bp | 962223 |
Gene Length | 1998 bp |
Protein Length | 665 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 641239751 |
Product | von Willebrand factor type A |
Protein accession | YP_001505186 |
Protein GI | 158312678 |
COG category | [R] General function prediction only |
COG ID | [COG4867] Uncharacterized protein with a von Willebrand factor type A (vWA) domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.649548 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGTTCCG GACAGTTCCG CTACGGCCCA TGGGACGGCG GGCCGGACCC GCTGGCCTCC CCGTTCTCCG CGACCGACGC CATGGACGAG ATGTCCCGGC AGATCCTCGA GGGGCGCACG CCCCGGGAGG CCCTGGAAAG CCTGCTGCGC CGGGGCATGC CGAACCGCCG CGGCCTGGAC GACCTGCGCC GCGCGATCGA GCGCCGGCGC CGCGAGGCCC GCTCGCGCGG CCGGCTGGAC GGCACCCTCG AGGAGGTCCG CCGGCTGCTC GACACCGCGG TCGGGCAGGA GCGGGCCGCG CTGTTCCCCG ACCCGGCGGA CGAGGCGCGG ATGCGCGAGG CGGAGCTGGA CGCACTGCCG TCCGACCCGG CCCGGGCCGT GCGCGAGCTG GCCGACTACG ACTGGCGCTC GCCGCAGGCC CGCGAGACCT ACGAGCAGAT CTCCGACCTG CTGCGCCGCG AGGTCCTCGA CGCGCAGTTC CGCGGCATGA AGCAGGCGCT GCAGGGCGCC TCGCCCGAGG ACATGGCCCG CGTGCGCGAG ATGGTCCGCG AGCTCAACGA GCTGCTGGAG GCGGACGCCC GGGGTGAGGA CACCCAGGGC CGCTTCGACG AGTTCATGCG TCGCAACGGC GAGTTCTTCC CGGAGAACCC GCGCAACCTC GACGAGCTCG TCGACGTGCT CGCCCGCCGG GCCGCGGCGG CGGCGCGCCT GCTCGCCGGA CTGACCCCCC AGCAGCGCCA GGAGCTCGCC GACCTGATGT CCACCGCGAT GGAGGACATG GGCCTGGCCA CCGAGATGTC CCGGCTGTCG CAGGCGCTGC GGTCGGCTCG GCCGGGCCTG GACTGGGGCG GACGCTCGCG CGGGCGCGGC CGGGGTCGGT TCTCCGACGC GATGACCGGC GAGGAGCCGC TCGGCCTCGG TGACGCGACC AGCGCGCTCG AGGAGCTGGC GGAGCTCGAC GACCTCGCGG CCTCACTCAG CCAGGACTAC CCGGGCGCGT CCCTGGACGA CGTCGACCCG GAGGCCGTCG AGCGCGCGCT CGGCCGTTCG GCCGTCGACG ACCTGCGTAA CCTGCAGCAG ATCGAGCGGG AGCTCGAGCG GCAGGGGTTC GTCACCCGCC GCGCCGGAGC GCTGGAGCTC ACGCCGCGGG CCGTCCGGCG CATCGGGGAG TCGGCGCTCG CGCGGATCTT CCGCGAGGTG GCGGCGCGCG GGCGCGGCGA CCACAGCCTC ACCGACGCCG GCTCGGCCGG TGACCTGCTC GGGACGTCCC GGCGCTGGCA GTTCGGTGAC ACCCAGCCCA TCGACGTCGT CCGCACGGTG CGCAACGCGG TGCTGCGCGG CGGGCCGCCG GGGCGGGGCC AGCGGATCCG GCTGGCGGTC GACGACTTCG AGGTCGCCGA GACGGAGCGG CGCACCAGCG CGGCCGTCTG CCTGCTCGTG GACCTCTCCT ACTCGATGGC GCTGCGCGGC ACCTGGGGGG TGGCGAAGTC GACCGCGCTC GCACTGCACA CCCTGGTGGC CACCCGGTTC CCGCAGGACA AGGTGCACAT CGTCGGCTTC TCCGACTACG CCCGCGAGCT GCGCCCGGTC GAGCTCGCCG GGCTCGACTC GGAGATGGTC CAGGGCACGA ACCTGCAGCA CGCGCTGCTC ATCGCCGGAC GGCTGCTGCG CCGCTACCCG GACTCCGAGC CGGTGATCAT GGTGGTGACG GACGGCGAGC CCACAGCCCA CCTGCAGCGC AACGGCACGC CGTCCTTCTC CTGGCCGCCG CTGCCCGAGA CACTGGAGCT GACGCTCGCC GAGGTCGACC GGCTGACCCG CCGCGGCGTC ACGATCAACG TCTTCATGCT GGACGACGAG CCCCGCCTGG TGCGCTTCGT CGAGGAGATG GCCCGCCGCA ACGGCGGGCG GGTGCTCTCC CCGGACCCGT CCGCGCTGGG CAGCTACGTC ATCCGGGACT ACCTGCGTTC CCGCAGCTCG CGGCGCGTCG CCCGCTGA
|
Protein sequence | MSSGQFRYGP WDGGPDPLAS PFSATDAMDE MSRQILEGRT PREALESLLR RGMPNRRGLD DLRRAIERRR REARSRGRLD GTLEEVRRLL DTAVGQERAA LFPDPADEAR MREAELDALP SDPARAVREL ADYDWRSPQA RETYEQISDL LRREVLDAQF RGMKQALQGA SPEDMARVRE MVRELNELLE ADARGEDTQG RFDEFMRRNG EFFPENPRNL DELVDVLARR AAAAARLLAG LTPQQRQELA DLMSTAMEDM GLATEMSRLS QALRSARPGL DWGGRSRGRG RGRFSDAMTG EEPLGLGDAT SALEELAELD DLAASLSQDY PGASLDDVDP EAVERALGRS AVDDLRNLQQ IERELERQGF VTRRAGALEL TPRAVRRIGE SALARIFREV AARGRGDHSL TDAGSAGDLL GTSRRWQFGD TQPIDVVRTV RNAVLRGGPP GRGQRIRLAV DDFEVAETER RTSAAVCLLV DLSYSMALRG TWGVAKSTAL ALHTLVATRF PQDKVHIVGF SDYARELRPV ELAGLDSEMV QGTNLQHALL IAGRLLRRYP DSEPVIMVVT DGEPTAHLQR NGTPSFSWPP LPETLELTLA EVDRLTRRGV TINVFMLDDE PRLVRFVEEM ARRNGGRVLS PDPSALGSYV IRDYLRSRSS RRVAR
|
| |