Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_1947 |
Symbol | |
ID | 5670348 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 2341986 |
End bp | 2344784 |
Gene Length | 2799 bp |
Protein Length | 932 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 641240868 |
Product | WD-40 repeat-containing serine/threonin protein kinase |
Protein accession | YP_001506290 |
Protein GI | 158313782 |
COG category | [K] Transcription [L] Replication, recombination and repair [R] General function prediction only [T] Signal transduction mechanisms |
COG ID | [COG0515] Serine/threonine protein kinase [COG2319] FOG: WD40 repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0876454 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.992815 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACCGGAG CTACGCCACC ACCGGTGTCC GGGAACACCG CGCGCCCGTT GCGGTCCGAG GACCCGGTAC AGCTCGGCGC CTACCGGGTG GTGGGCCGGC TCGGCCAGGG CGGGATGGGC GCCGTCTTCC TCGGGCAGGC GCCGGACGGC ACCGCCGTCG CCATCAAGGT GATCCGCCCC GAGCTGGCCT CCCGGCCGGA GTTCCGCGCC CGCTTCGCCC GCGAGACCGA GAGCGCCCGC CGGGTCCGCC GCTTCACCAC CGCGGCCGTG CTCGACGCCG ACCCGCACGG GCCGCAGCCG TACCTGGTCA CGGAGTTCGT CGAGGGCCCG ACGCTCTCCC GCCACGTGGC CGCGCGCGGC CCGATGCGGC CGGCCGATCT CGAACAGCTC GCGGTCAGCG TCGCCACCGC GCTGTCGGCC ATCCACGCCG CCGGCATCGT GCACCGCGAC CTCACCCCGG CGAACGTGCT ACTCTCCCCG GTCGGCCCGA AGGTGATCGA CTTCGGGCTG GCGCGTGAGT ACGACACGGT CAGCGACCTG TCCCGCAACG TGAAGCAGGC GATCGGCACG CCCGGCTACA TGTCGCCGGA GCAGATCCTC GACCTGCCGA TCACCGCCGC GGTCGACATC TTCGCCTGGG GATCAATCAT GATCTTCGCG GCGACCGGGC ACCCGCCGTT CGGGCAAGGC CGGATGGAGG CCGTCCTCTA CCGCATCATC AACGAGCAGC CGCAGCTCGA CGGCGTGACG GGCGAGCTGC GCGAGCTCGT CGAGCTCGCC ATGCGTAAGG ACCCGACCAC CCGGCCGAGC GCCGAGGAGC TGCGCGCGTC GCTGATGGGC GGGGTGGCTA TCCCCGACCG GAGCGCGGCG CCCGGGCCGC CGGGCGGCGC CGAAGGCACG GCCGGGGCGC CCCGCGGCCG CCGCTGGTCC CGCGGCGGGC GCCGCGACCG CGCGCAGACA GCCTCGGGAA CCGCCGCCGG CGCAGCCGCC GGAGCGGCCG CGAGCGGGCC GGTCGGACCA GGCGACGTGT CCGGAGCAGG CGGAACGCCC GGAGCAGGTG CTGGCGGAGC AGGTGCTGGT GGAGGCGGGA GCGCTGCCGG AAGCAGGAGC GGGGGCGCTG TCGGAGGCCG TGGCGGGAGC GGAACCGCGG CGGCGGCGCT CGGGCCGCTG ACACACGCGC CGCGGGCCGG TGCCGGGCCC GCGCTGGGCA CCCCACCGCC GGCGTCGTCG TTTCCGGGTG TCTCCTCACC GGGCTCCGCC CAGCCCCGCA CACCGTCGGG ACCGATCTCC CAGCCGCCTC CATACCCGCT GTCACCCGCA CCCCAAGGCC AGGGCGGCCA ACCGGCCGGC GGCCAGGGCA CCGGCGGGCC GGGGTGGTCG GGGCCGGTGC CCGCGGGGCC GCTCGCGCGA CCAACCGAGT CGTCCGGGCC GGTGTCACCG GCGCCGGGTG CCCCGGCCCC GGCCGGCTCC GGCGAAGGAG CGGGTGCCGA TCCGTCGCAG GGCGCGGAGT CCGGCCACGG GTCCAGGCGC CGCACCATGG TCCTCGCCGG GTTCGCGGCG CTACTCGTCG CCGCCGTCGT CGTACCGATC GTGACCCTCG GTGGCGGCGG TGACGACGGG TCGAGCGCGG ACCGCGAGGC CATCGCGGCG CACCTCGCCG CGACCGCCGC GGCCAGCCGC GCGCAGGACC CGGCGCTCGC CGCTCGGCTG AGCCTCGCCG CCTACCGGAT CGCGCCCGTC CAGGCCGCCG AGGACGCCAT GGTCGCCTCG TTCGCAGGTG CGTCCGCGGT GCGCACCCCG GCCTCGGACG TCCCCTACGG GGATATCGCC ATCAACCCCG CCGGCACCGT CCTCGCCGCC ACCAGCGCGG ACGGCGTACT CCGGCTGTTT CGGCTGATCG ACGGCGGCGA ACCGGCGCTG ATCAGCGAAC GCCGCTCGGA CGACCCGTCC GACGGCATCG CGTTCACCGG CGACGGCACG CGGCTCGCGA CCGGCGGAAC GCAGAGCGCG GCCCGCCTGT GGCAGGTCAC CGATCCGGCG AACCCCCAAC AGGTCGCCCA GCTGGACGGC CTGTCCCGTC CCGTTCACGT GGCGCTGTCC GCGGACGGAT CGCTGCTGGC CGCGGCCGCC CAGGACGGCA CGTTCGGTCT GTGGAACGTC TCGAACCCGG CCGCGCCCGC GATGCTGCGC CTCCAGCTCA CCACCGCCGT GATCACGGAC ATGGCCCTGA CCCCGGACGG GAAGCTGCTG GCCACCGCGG GCATCGGTGG TGACGTCCAG CTGTGGAACA TCACGGACCC TCGTAAACCG GTCCAGGCAG GGGTGGCGTC CGGCGCGGTC GGCGCGGTGA ATGCCGTCAC CTTCAGCACA GATGGCCACC AGATGATCAC CGGTGGCGAC GATCGCACCG TCCGTGTCTG GGATGTGCGC GACCCGATGG CCGCCCATAT CACCAGTGAG CTGCACGGTC ACACGGCCCC GGTCAACGCC GTCGTGTTCG GTGCCGGCGG CCAGCCCGTC AGCGGTGACC AGGCGGGCGT CGTCGCCTAC TGGGACACCT CGAGTGCGGC GCCGATGGTC CAGGTGGGCA ATCTGAAGAG CTCAGTCCTC GCCCTGGCGA CGGACGCCGC GGATGACCGC CTCGCACTGA GCACCGAGTC CGGGCAGGTC GCGGTGTGGT CGACGGACGC CGCGAAGCTC ACGACGATCG CCTGCGCCGA CCCGGACGCC CGCATCAGCC GGGCCGAGTG GGAGCAGCGG ATCAGCGAGC TCCCGTTCCG GGACCCGTGC ACCGTCTGA
|
Protein sequence | MTGATPPPVS GNTARPLRSE DPVQLGAYRV VGRLGQGGMG AVFLGQAPDG TAVAIKVIRP ELASRPEFRA RFARETESAR RVRRFTTAAV LDADPHGPQP YLVTEFVEGP TLSRHVAARG PMRPADLEQL AVSVATALSA IHAAGIVHRD LTPANVLLSP VGPKVIDFGL AREYDTVSDL SRNVKQAIGT PGYMSPEQIL DLPITAAVDI FAWGSIMIFA ATGHPPFGQG RMEAVLYRII NEQPQLDGVT GELRELVELA MRKDPTTRPS AEELRASLMG GVAIPDRSAA PGPPGGAEGT AGAPRGRRWS RGGRRDRAQT ASGTAAGAAA GAAASGPVGP GDVSGAGGTP GAGAGGAGAG GGGSAAGSRS GGAVGGRGGS GTAAAALGPL THAPRAGAGP ALGTPPPASS FPGVSSPGSA QPRTPSGPIS QPPPYPLSPA PQGQGGQPAG GQGTGGPGWS GPVPAGPLAR PTESSGPVSP APGAPAPAGS GEGAGADPSQ GAESGHGSRR RTMVLAGFAA LLVAAVVVPI VTLGGGGDDG SSADREAIAA HLAATAAASR AQDPALAARL SLAAYRIAPV QAAEDAMVAS FAGASAVRTP ASDVPYGDIA INPAGTVLAA TSADGVLRLF RLIDGGEPAL ISERRSDDPS DGIAFTGDGT RLATGGTQSA ARLWQVTDPA NPQQVAQLDG LSRPVHVALS ADGSLLAAAA QDGTFGLWNV SNPAAPAMLR LQLTTAVITD MALTPDGKLL ATAGIGGDVQ LWNITDPRKP VQAGVASGAV GAVNAVTFST DGHQMITGGD DRTVRVWDVR DPMAAHITSE LHGHTAPVNA VVFGAGGQPV SGDQAGVVAY WDTSSAAPMV QVGNLKSSVL ALATDAADDR LALSTESGQV AVWSTDAAKL TTIACADPDA RISRAEWEQR ISELPFRDPC TV
|
| |