Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_7319 |
Symbol | |
ID | 5675620 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 8946847 |
End bp | 8948673 |
Gene Length | 1827 bp |
Protein Length | 608 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641246156 |
Product | von Willebrand factor type A |
Protein accession | YP_001511544 |
Protein GI | 158319036 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.505027 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTGGTG CCCGCCACCG CCACCGCCGT TCGGGCCCCT CGTTACGGGG GGTCGCGGCG GTGGCTGCGG TACCCCTCAT GGTCGGCGCC CTGACCTGCG GATGGCTCGT CCTGCGCGGA GGGGCCGGCC CCGTTCGCTG CGATCGCACG ATCACCCTCG GGGTCACCAC CTCGCCGAGC CTGGCCACGG CGTTGAGCGA GGCCGCCGCG GCCTACGGCA GCGGGAAGCC GACGGTGTCC GGGTACTGCG TGTCCGTCCG GGTCGACACG GCGGGCGGCG GCCAGGTCGC CTCCTACATG AGAGGCGGCT GGACGGACCC GACGGCCGGT CCGATCCCGG ACGTCTGGGT GCCGGACTCC ACGGACTGGC TCACGCTGGC GCGGACCACC GAGCCGGCCA ACCGGCTGCT GGTGGACACC GGCACCGTTA TCGCCACCTC ACCGGTCGTC ATCGCGATGC CCCGCCCGAT GGCCGAGGTG TTCGGCTGGC CGCGTCGCGA GCTCTCCTGG GCCGACCTGC GCAAGCTCGG TGGCGACGAG GGCTACTGGG GGTCACGGGG ACGGCCGGCC TGGGGCGGTT TCACCGTCGG GCTGCCCGAC CCCCGGGTGT CGGCTGCCGG GATGACCGCG CTGGCCGACG CCGTGGCCGC CGCGCTGAAG ACCCCGGTCG AACGGCTCAC CGAGGACATG TTCACCGACG GCCTCGCGGC CAAGGGTGCC CTCCTGGATC TGGAGCGCTC GTCGGCGCTG GTAGCCGCCT CCGACACCGA CCTGCTCACG GCCGTCCGCG CGGCGGACCT CGAGGACCCC GCGGCGACCA GGCTCACCGC GTTCCCGCTT CAGGAGAGTC TCGTCTACCA GTACAACCGC CGGGTGGGCA TCGGCGCCGC GCTGCCGGAC GGCCGCGGAC CGGAGCTGGC CGCCTTCTAC CCGCGGGACG GCACCGAGCT CGACGAGATC CGGTACACCG TGCTGAGCCG GGCGTCGGAC GACCCGGTGA AGGCCGAGGT GGCGCGGGAC TTCCTGCGGA CGCTGACGTC CGGGCCGGGG CGGGTCGCCC TGCTCGGGAA CGGCCTGCGC CCCCCGGACG GCATCGCCGA CTCGTTCACA GCCCGGACGG GGCTCACCCC GCGGCCGCGG ATGACGCCCG AACGCACCCT GGACGCGACG GTGCTGACCG CGCTCCAGGG GAGCTTCGCA GGCGTCCATC AGCGCGGGAA CACCCTCGCC GTGCTGGACA CCTCGGGCTC CATGAACGAG GAGGTGCCGG GCAGCGCGGG CCGCAGCCGG CTGTCGGTGG CGCTCGACGC GGCGAAGTCC GCCATCCCGC TGTTCGCCGA AGACAGCGAT CTCGGGCTGT GGCAGTTCTC CACCCGGCTG CGCGGCGACC AGGACTGGGA GGAGCTCGTG CCGCTCGGGC CGATGGGCGA GCGGCTGGGC GCCGGCACGC GCTCGCAGGC GGTGATGGAC GCGGTGAACC GGATCGAGCC GCGCGGCGAC ACCGGGCTCT ACGACACGGC CCTCGCCGCG TTCCGCTACA TGAACCAGCA CTATGTGCCG GGCCGGCCCA ACCAGGTCGT GCTGCTGACC GACGGGAAGA ACTCCGATCC CGGCAGCATC GCGCTCGACG AGCTGGTGCG GATCCTGCGC CGGGAGTACT CGCCCCAGCG GCCCGTCCAG GTGATCACGA TCGGCTATGG CGCCGACACG GATCTCGCCG CGCTGTCGCG GATCTCGGCC GCGACCGGAG CCGAGACGTA TCCCGCGCTG GACCCGAACA CCATCTTCGA GGTCCTCGTC GACGCGCTGA CCGAGGTTCC CGGCTGA
|
Protein sequence | MPGARHRHRR SGPSLRGVAA VAAVPLMVGA LTCGWLVLRG GAGPVRCDRT ITLGVTTSPS LATALSEAAA AYGSGKPTVS GYCVSVRVDT AGGGQVASYM RGGWTDPTAG PIPDVWVPDS TDWLTLARTT EPANRLLVDT GTVIATSPVV IAMPRPMAEV FGWPRRELSW ADLRKLGGDE GYWGSRGRPA WGGFTVGLPD PRVSAAGMTA LADAVAAALK TPVERLTEDM FTDGLAAKGA LLDLERSSAL VAASDTDLLT AVRAADLEDP AATRLTAFPL QESLVYQYNR RVGIGAALPD GRGPELAAFY PRDGTELDEI RYTVLSRASD DPVKAEVARD FLRTLTSGPG RVALLGNGLR PPDGIADSFT ARTGLTPRPR MTPERTLDAT VLTALQGSFA GVHQRGNTLA VLDTSGSMNE EVPGSAGRSR LSVALDAAKS AIPLFAEDSD LGLWQFSTRL RGDQDWEELV PLGPMGERLG AGTRSQAVMD AVNRIEPRGD TGLYDTALAA FRYMNQHYVP GRPNQVVLLT DGKNSDPGSI ALDELVRILR REYSPQRPVQ VITIGYGADT DLAALSRISA ATGAETYPAL DPNTIFEVLV DALTEVPG
|
| |