Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4946 |
Symbol | |
ID | 5673285 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 5938085 |
End bp | 5939380 |
Gene Length | 1296 bp |
Protein Length | 431 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641243800 |
Product | von Willebrand factor type A |
Protein accession | YP_001509216 |
Protein GI | 158316708 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0264631 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.726246 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTGGACT TCAGTCTCGA GGTCAGCCAG AACAAGTACC TGCCCGAGGG GTCGGGCGAG GTGCACGCCG TCATCACCGT CACCGCGCAC GACGTCCGCC CTGGTACTCC CGGCGCCCCC GGGACGGCCG GGACCGCTAC CACGGCGGGC GCCGCCGAGG TCATCCTGCT GGACTGCTCG GGGTCGATGG ACTACCCCCA CTCGAAAATC ATCGAAGCCC GCCGGGCGGC CCAGGCCGCC ATCGACACGC TGCACGACGG TGTCGCGTTC GCCGTGGTGG CAGGCACCGG GCAGGCCGAA ATGGTGTACC CGACGCGCCA GGAGCTCGTC GAGGCGTCGC CGCGGACCCG CGAGGCCGCG AAGGCCGCGG TGAAACGGCT GCAGCCCCAC GGCGGCACCG CCATGGGCCG GTGGCTGCTG CTCGCCCGCG ACCTGATGGC CACCCGCCCG GACGCCATCC ACCACGCGAT CCTGCTGACC GACGGCCAGA ACGGCGAGAG CGAGGCCGTC TTCGCCGCCG CACTGGCCGC GTGTGAGGGT CGGTTCCAGT GCGACTGCCG CGGCGTCGGT GCCGACTGGA AGGTGGCGGA GCTGCGCCGC GTCGCCTCCA CCCTGCTGGG CGGCGTCGCC CTGCTGCGCG AGCCGGCGGA GATGGCGGAG GACTTCCGCT CGCTGATCGA GCGGGCACAG GCCCGCGGGA TCGACCGGGT CGGCCTGCGG GTGTGGACGC CCAAGGGGGC GACGATCCGG TTCCTGCGCC AGGTGTCGCC CGAGCTCGAG GACCTCACCG CGCGGGCCGT CGAGGTCAAT CCGCTCACCC GCGACCATCC GACGGGCGCC TGGGCCACCG GGACACGGGA GTACCACCTG TGCGTCGACG TCCCCCCGGC GCCGGTGGGG AACGAACGGC TCGCCGCCCG CGTCAGCGTG ATCGCCGGCG GGGACGAGCT CTCCCGGACG GCGGTGCTCG CCGCGTGGTC CGAGGATGAC GAGCTGTCGA CCCGCATCGA CGAGGTCGTC GCGCACTACA CCGGCCAGAC GGAGCTGGCG CGCGCCGTGC AGGACGGGCT CGCGGCACGC CGCGACGGGG ACGAGGTCAG CGCGGTCACC CTGCTCGGCA GGGCCGCGCG GATCGCGGCC GCCGCCGACG ACGGCGCGAC CCTCGAACGG CTGGCGAAGG TCGTCGACAT CGACGATGCG GCCACCGGAG CGGTGCGGCT GCGTCCCCAG GTCGACACCC TCGACGAGAT GGACCTGGAC GCGGGCTCGA CCGTCACCGT GCCCGCCCGG CGGTGA
|
Protein sequence | MVDFSLEVSQ NKYLPEGSGE VHAVITVTAH DVRPGTPGAP GTAGTATTAG AAEVILLDCS GSMDYPHSKI IEARRAAQAA IDTLHDGVAF AVVAGTGQAE MVYPTRQELV EASPRTREAA KAAVKRLQPH GGTAMGRWLL LARDLMATRP DAIHHAILLT DGQNGESEAV FAAALAACEG RFQCDCRGVG ADWKVAELRR VASTLLGGVA LLREPAEMAE DFRSLIERAQ ARGIDRVGLR VWTPKGATIR FLRQVSPELE DLTARAVEVN PLTRDHPTGA WATGTREYHL CVDVPPAPVG NERLAARVSV IAGGDELSRT AVLAAWSEDD ELSTRIDEVV AHYTGQTELA RAVQDGLAAR RDGDEVSAVT LLGRAARIAA AADDGATLER LAKVVDIDDA ATGAVRLRPQ VDTLDEMDLD AGSTVTVPAR R
|
| |