Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_6197 |
Symbol | |
ID | 5674518 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 7529571 |
End bp | 7530728 |
Gene Length | 1158 bp |
Protein Length | 385 aa |
Translation table | 11 |
GC content | 78% |
IMG OID | 641245049 |
Product | VWA containing CoxE family protein |
Protein accession | YP_001510447 |
Protein GI | 158317939 |
COG category | [R] General function prediction only |
COG ID | [COG3552] Protein containing von Willebrand factor type A (vWA) domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.787747 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0617522 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAACGCGC GGCCGTCCGC CGCGGGTCCG CTCGCCGAAC CGGACCCGCT GGTCGCCCTG ACCGGGTTCG CCCGCGCGCT GCGCGGCGCC GGCGCGGCGG CGGACGCGAG CCGGGTCGCG ACGGCGGTGC AGGCACTCAC CCACCTCGAT CCCACCAGCG CCGCCGACGT GTACTGGGCC GGTCGCCTGG CGCTGTGCGC CGAACCGGAC GACCTGCCGC GCTACGACGC GCTGTTCGAC GAGTGGTTCC GCGGGCGCCT CGACGGGCTG CCCGGGCAGG CCGCCGCCGG GGCCGTCGCG CCGTCGTCGC GGGCGGTGCG GGTCTGGCCG TCCAGCGGCA CCGGCACCGC GCGCGACGAC GGTGACGACA GCCCGGCCGA CCTGCTCCCC GTCGGCGCGA GCGACGTCGA GCTGCTGCGT CGCCGCGACG TCGCGGACCT CAGCCCGGCG GAGCGGGCCG AGATCGACCG GCTCGTCGGC CTGCTCGCCC CCCGGGTCGG GTCCCGTCCC AGCCGCCGGC GCCGGCCCGG CGGGAACCGC GGGCTCGACC CGCGCCGCAC CGTGCGCGCC ATGCTGCGCG ACGGCGGCGA GCCCGGTGAG CTCGTCCGCG CGCGCCCGCG GGTACGGCCC CGGCGGCTGG TGTTCCTCGT CGACGTCAGC GGTTCCATGA GCCCCTACGC CGACGTGATG CTGCGCTTCG CGCACGCCGC CGTCCGGGTC GCCCCGTTCG CCACCGAGGT GTTCACCTGC GGGACCAGGC TGACGCGACT CACCCGTCCG CTGCGGCTGC GGGACGCGGG GGAGGCCCTG AGGGCGGCCG GCGAGGCGAT TCCCGACTGG AGCGGCGGCA CCCGCCTCGG CGAGTCGCTG CGCGCCTTCC TCGACCTGTG GGGGCAGCGG GGCACCGCCC GCCAGGCGGT CGTGGTGATC GTGTCCGACG GGTGGGAGCG CGGCGACGTC ACCCTGCTCG CCGAGCAGAT GGCCCGGCTG GCCCGGCTCG CGCACCGGGT CCTCTGGGTG AACCCGCACA CCGGCCGGGA CGGGTTCACG CCGACCGCCG CCGGCATGTC CGCGGCGCTT CCCCACGTGG ACGACCTGTT GGCCGGGCAT ACGTTCCAGG CACTGCGAGG ACTTGCCGAG GTGATCTCCG ATGCGTGA
|
Protein sequence | MNARPSAAGP LAEPDPLVAL TGFARALRGA GAAADASRVA TAVQALTHLD PTSAADVYWA GRLALCAEPD DLPRYDALFD EWFRGRLDGL PGQAAAGAVA PSSRAVRVWP SSGTGTARDD GDDSPADLLP VGASDVELLR RRDVADLSPA ERAEIDRLVG LLAPRVGSRP SRRRRPGGNR GLDPRRTVRA MLRDGGEPGE LVRARPRVRP RRLVFLVDVS GSMSPYADVM LRFAHAAVRV APFATEVFTC GTRLTRLTRP LRLRDAGEAL RAAGEAIPDW SGGTRLGESL RAFLDLWGQR GTARQAVVVI VSDGWERGDV TLLAEQMARL ARLAHRVLWV NPHTGRDGFT PTAAGMSAAL PHVDDLLAGH TFQALRGLAE VISDA
|
| |