Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0669 |
Symbol | |
ID | 5669086 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 789736 |
End bp | 792663 |
Gene Length | 2928 bp |
Protein Length | 975 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641239596 |
Product | WD-40 repeat-containing serine/threonin protein kinase |
Protein accession | YP_001505034 |
Protein GI | 158312526 |
COG category | [K] Transcription [L] Replication, recombination and repair [R] General function prediction only [T] Signal transduction mechanisms |
COG ID | [COG0515] Serine/threonine protein kinase [COG2319] FOG: WD40 repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0315888 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.915288 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGTCATTG CCCGGCGACA CCTCGAACGT TCGCGCGGCA ATCAACCGGT TTCGATCGGT CCGTATGTTG TCGAACGTCA GCTCCTCGAG GCCGGAACCG GGCCGGTCTA TCTCGGGCGG GATCCCGAGG GCGGCCAGGT GGCCATCAAG GTGATCAGCG CGGCGTTCGC TCGCGACCAT GATTTCCGCC GCAGGCTGCG CGCCGATCTC GAGACCGTAC GCGCACTGGC ACCTCCGTGC CTGGCGGACA TCATCGACGC CGACACATCC GCCCATCCGC CGTATGTCGT CACCGAGTTC GTGGACGCTC CGACCCTGGC AGCGACGGTC GCGCAGGGCG GTCCGCTGGC GGTGCCCGAC GTTCGCCGGC TCGCCGTGGC GCTCGGTTCG GCGCTGACGG GGCTCCATGG TGCCGGTCTC GTCTTCGGTG ACCTTAAACC GGCGAATGTG GTGCTTTTCG AGGGTGGAAT CCGCCTCGTC GACTTCGGGT TGTCGCGAGT TTTGAACACC GTTGCCCTAC CGGGTCGCGG CGGATCAGGG CCCGGGATGG GTACCCCCGC ATTTATTACC CCGGAGCATG TCCTGAGGCA GCCGCTCACG ATGGCGTCAG ACATCTTCAC GTGGGGTGGA GCAGTCCTCT TCGCCGCGAC CGGACGACTG CCGTTCGGTA ACGGATCACC GCAGGTTCTG TTGCAACGTG CGGTCTATGC GGAACCCGAC CTCACCGGTT TGGACCCGGT ACTGCGCGAC GTAGTCAGCG CCACGATGCG TAAGGATCCC TCCCGCCGAC CCGGCGCCGC GGAGCTGCTC GAGGTGCTCG GACGACTTGT CGGTGGCCTC CCGGCACCCA CGGGCCTGGA TCCGCTCGAG ATCGCCGCCG CGGCGGCCGT AAGCGCCGCC GCCCCGCCCA CCGAGACACC CGCCGCCCCG CCTGACGAGA TGTCTGCGGT CGAGGCGCCT GCCGACACCG CTGGCGTCGT CCCGGCGGCC GCTGTGGCAG TCGCCGAGGC CGCCGAGCCC GCCCCGGTGC CGGCGAGCGC GCCGGACGCG GCTGTCGCTC CGGATCCGAC GCCCGACTCC GATGCCGAGC CCGCGCCCGA GGCCCCGGCG GAGTCTGAGC CCACGGCGGA GTCTGAGCCC ACGGCGGAGT CTGAGCCCAC GGCGGAGTCT GAGACGAAGC TTGAACTCGG GCCCGAGGCC TCGCTCGAAC CTGAGCCCAC GGCGAAGTCC GAAGCGAGGC CCGAGCCTGA GAGCGGTCCC GGGGCTGGCA CCGGGGCGGC GGCCAGCGCT GCCGCCGAGC CCGGGATCGG AAGGCCGCCG GCTCAGTCTT CGGAGGCGGC CCAGTCTTCG GAGCTGTCGG AGCCCACTGC GCTGGCTGCG CTGGCCGACG GACGTCGACG GCCTGACGGG CACTGGTTGT TTCGGCTGTT CATGGCGGGG ATCGCCAGTC TCGCCGTCCT GACCATGGCG GTCGAGATCA CGGGGGCTGT TCGGGCCCAG GCCGCGGTGC GGGCGTCCAA GGCCACCGCC GGTCAGGCGC GCGCCCTGCT GGAGCGTCAG CCGGACCTCG CCGGCCAGCT CGCCGCGGCG GCCTACGAGA TCGCTCCGAC CGCGGCGGCC GGCGAGGCGC TCATCGCCGC GGCTGTCCGT CGGAGCGGCC ACCTGCCCGG CGACGTCCGC GACCTCGCCG TGGCCCCGGA TGGCAGCAGC ATCGTCACCG CGGGCGACAC CGGCGCCGGC CTCTGGAACC TCACCGACCC GTCCGCCAAC CGTCGCATCA CCGCGTTCCC CGTCGACGGG CCCGCGGTCA GCGCTGTGGG CTACGTGTCG TCCCCCGGGC GGACGGCCGC CGCCGGCCAG ATCATCGTCA CCGCGGCCGG CCCGGCAGGC GCGGGCGAGA GCAAGGTCCA GCTGTGGCGG GTGACGCCGG ACGGCGCGGT CGAGCGGCTC GGCGTGCTCG CCGGGCACAC CGGCACCGTC GGCGAGATCG CGGTGAGCCG GGCCGGCGAC GCGATCGCCA CCGGCTCCTC GGACGGCATG CTCCGGCTGT GGGACGTGAC CGACCCACGC GCCCCCGCCG AGCTCGCCGT CCTGCGGACA CCCGGGCCGA TCACCGCGTT GGCGTTCTCC CCGGGCGGGG ATCAGATCCT GGTCGGCGGG GCCGCCCGCC TCTCGCTGTG GTCGCTGCAT GATCCGCGCC AGCCGCGCCG GCAGGGCCTG CTGCACGGCG AGGGCGTCGT CGCCGGCGCG TCCTACTCCC CGGACGGCCG GACGCTCGCG GTAGCCACCA CCGGCCTGCC GGACGCCGCG ACGGAGCCCG GGTCGACCGC CGGCGTTCCG ATGGCGGTCG CCTCGTCCCT GGCGCCGCCG GAGAAGTCGC GGTCCGTGGT GGAGATCTAC CAGCCGGGGG ACCCGCGGGG GCTGCACCGG CTCACCTCCT TCGCCCCCGC GAGCGGCGCG GGAATGGTGG CCTTCTCACC GGACGGGCGG GCACTCGCGG TGGCCGCCGC GGCGGGCGGC GGCGACGTCG GCGTCTGGGA CATGTCGAAC CCGGCCCGGC CGCGCCCCCG GCTCGCCCTG CCCACGCCCG CGGCACCCTC CGACTCCGCC GGACCCTCCG ACTCCGCCGG GCCCTCCAAC CTGGCGGCGC TCGCGATCGC CGGGTCGGCG CCCGCCGCAC CCGAACCGGC GGCGCTCGAA TCGGCCACGC CCCCACCGGC CAGCCTCGCG CCGACCGCGC TCGCCTTCGC CGACGGCCCC GCCCGGACAC TCGCCGTCGC CGACGGGAAC GGCGCGCGCG TGTGGGATCT CGACCCGCGG ACCGCGCGGG ACCAGGTCTG CGGCCGGGCC CAGGCCGAGA TCACGAGGCG GGACTGGCGT AGGTACATCC CCGACCGCCA CTACTCGCCG CCCTGCCCCC GGAACTGA
|
Protein sequence | MVIARRHLER SRGNQPVSIG PYVVERQLLE AGTGPVYLGR DPEGGQVAIK VISAAFARDH DFRRRLRADL ETVRALAPPC LADIIDADTS AHPPYVVTEF VDAPTLAATV AQGGPLAVPD VRRLAVALGS ALTGLHGAGL VFGDLKPANV VLFEGGIRLV DFGLSRVLNT VALPGRGGSG PGMGTPAFIT PEHVLRQPLT MASDIFTWGG AVLFAATGRL PFGNGSPQVL LQRAVYAEPD LTGLDPVLRD VVSATMRKDP SRRPGAAELL EVLGRLVGGL PAPTGLDPLE IAAAAAVSAA APPTETPAAP PDEMSAVEAP ADTAGVVPAA AVAVAEAAEP APVPASAPDA AVAPDPTPDS DAEPAPEAPA ESEPTAESEP TAESEPTAES ETKLELGPEA SLEPEPTAKS EARPEPESGP GAGTGAAASA AAEPGIGRPP AQSSEAAQSS ELSEPTALAA LADGRRRPDG HWLFRLFMAG IASLAVLTMA VEITGAVRAQ AAVRASKATA GQARALLERQ PDLAGQLAAA AYEIAPTAAA GEALIAAAVR RSGHLPGDVR DLAVAPDGSS IVTAGDTGAG LWNLTDPSAN RRITAFPVDG PAVSAVGYVS SPGRTAAAGQ IIVTAAGPAG AGESKVQLWR VTPDGAVERL GVLAGHTGTV GEIAVSRAGD AIATGSSDGM LRLWDVTDPR APAELAVLRT PGPITALAFS PGGDQILVGG AARLSLWSLH DPRQPRRQGL LHGEGVVAGA SYSPDGRTLA VATTGLPDAA TEPGSTAGVP MAVASSLAPP EKSRSVVEIY QPGDPRGLHR LTSFAPASGA GMVAFSPDGR ALAVAAAAGG GDVGVWDMSN PARPRPRLAL PTPAAPSDSA GPSDSAGPSN LAALAIAGSA PAAPEPAALE SATPPPASLA PTALAFADGP ARTLAVADGN GARVWDLDPR TARDQVCGRA QAEITRRDWR RYIPDRHYSP PCPRN
|
| |