Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_4047 |
Symbol | |
ID | 3907008 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | - |
Start bp | 4831006 |
End bp | 4833756 |
Gene Length | 2751 bp |
Protein Length | 916 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 637881376 |
Product | WD-40 repeat-containing serine/threonin protein kinase |
Protein accession | YP_483126 |
Protein GI | 86742726 |
COG category | [K] Transcription [L] Replication, recombination and repair [R] General function prediction only [T] Signal transduction mechanisms |
COG ID | [COG0515] Serine/threonine protein kinase [COG2319] FOG: WD40 repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGATAT CCCGGCGCAA CCTCCCTGGG CTCACCGGGA CTGATCTCCA GACTATCGGT CCGTATACAG TTGAACGGAA GCTGGTCGAC GCCAGGACAG GACCAGTGTT TCTTGCACGT AACGGCGAGG CGCGTCCGGT CCTCGTCAAG ACGATTACTG CGCCCTTCGG GCGGGACGCA GAGTTCCGTC GGCGCCTGCG CGTCGACCTC GACAACATCC GCCGGTTGGC GCCGTCCTGC CTCGCCGCCA TCCTCGACCT CGACACGGGC GCTCGCCCCC CATACGTCGT CGCGGAGTTC ATCGACGCTC CGACTCTCGC CGCAACCGTC GCCGGCGGCG CAGCGTTGTC CGGACCAGAC ACCTACCGGC TCGCCGTCGG GCTGGCGACG GCTCTGGCCG CACTGCACGA ATTGGAAATT TTCCTCGGCG ACCTGAAGCC TATCAATGTG GTTCTTTCCG GGCAGGGAGT GCGGCTGGTT GACTTCGGGC TCTTCCGAGC GATGAATGCG GTAAGCATCA ATAATCCGGG CGGACCGCCG TCCGGTATCG GAACACTTGC GTTCATAACG CCCGAACAGG CTCTCGGGCA GACCGCCACC GTCGCATCGG ATGTCTTTAC CTGGGGCGGC ATGCTGTTGT TCGCCGCGAC CGGCCGGCCG CCATTCGGTG CCGGAACGCC CCGGGTCCTG CTGCAGCGGG CCGTCTACGC GGAACCCGAC CTGTCCGTGT TCTGCCCGGA GCTGCGAGAG CTGGTCGCGG CGGCGATGCG CAAGGACCCG AAGCGCCGGC CGGCCGCGGC CGAACTGCTC GAACAGCTGA TGGCCTATCC GACACGAAGC GAGGCGGAGC CGGCGGTCGA ACCCACCCGG CGTCTCGCCC TGCCGGCGGG TGTCATCGAG ACGCTGGTGC CGGTGCAGAC GAGGCGGACG GTCGAGTCCG AGACGAAACC GGAGGTCGTA GGCACGAGCG ACCTCGCCGC TCCTGCCATC GGTGCCATCC ACGGCCTCGA GATCGTGCTC GAAACCGTGA CCGTACTCGA AACCGTGACC GTGCTCGGGA CCGGGCCGGC CGCGGAGATC ACGGCCGTCC CGCCGGCCGC GGAGATCACG GCCGTCCCGC CGGCCGCCGC GGTCGCCCTC GGCGCCGTCC GTACCGGAGC AGAACCGCGG ATGCGTCCGT CGTCTCCCGG TCCGTCGTCT CCCGCTCCGT CGTCTCCCGG TCCGTCGGCC GCTCTGGCCG GTGTGCCCGC CCCACTGCCC ACCGCGGCCG CGAGAGCGCG CGCAAGCTAC GTCCAGGAGG GCACCGGGAA TGTCTCCGCC TTGTCATCAC CATCGTCACC GCGGGATCAC CTGGACCGCG GGTGGCTTCG ACGCGTGCTC TCGATCGGCG TGGCGGTCTC GGTGCTCGTC CTCTCGACCG TTGTGATCGT CGATACGCTG CGGGAGCGCA GCGCGGCGGC CACGTCGCGT GAGGCGGCCC GTGGCGCGAT GCGCCTTCTC GACCGGGAAC CGGATCTCGC CGGCCAACTC GCCGTCTCGG CCTACCGCAT GGCTCCGACC TCGGCTGCGG CGGAGGCGCT GGTCAATGCG AGCATCCGGC AGATCGGGCC GGCGACGGGA GCGATCCGCG ATCTGCTGAT CACCCCGGAT GGTCGATATC TGATCATCGT CGGTGACTTC GGCGGGTCGG TGTGGAACAT CATCGGCCCT GGCCGGGTGC GCTACATCAC CGACCTGCCC GCGGTGGCCG CGGCCATGGG AGATCGCAGC CCCCTCGCGG CCGGGGTCGC CCCGGCGGCC GCGGTGAAGG GCCTGCTCGC CGTCGCGCTG ATACCCGCAT CCGGTCCGGC CACGGGTGTG CGGGCGTCCA CGATCATGGT GACGGCGGGG ACGGACGGGG TGATCCGGTT ATGGCGGCTG GCCCAGCCCG GGTCCACGGA CGACGGTGAC CTTCAGTCGG CGATCAACAC CGGTCAACGG GTGAGTCTGC TCGCCGAGTT GCGGGGACAC ACGGGTGCGG TGGCGAGCGT CGCGGTGAGC GGGGATGGTC GAACCCTGGC CTCGGCGGGC GCCGATCGCG TCGTCCGTCT CTGGGACATC GGCCATCCGC AGAATCCTCG CGCCCTCGCC GAGCTGCCCC AGCCGGCGGA GGTCACCAGC CTCGCCTTCA CCCCCGACGG CGACTCGCTC GCGGTCGGGG GGGTAGGCCA TCTCTCCGTC TGGGATGTGA CCGCTGCCGG GCAACCGCGC CGCCGGGCCC AGCTGACCGC CCCCGCCACT GTCCGCAAGC TCCTCGTCAG CCCGGACGGC CGGTGGCTCG CCGTGGCGAG TACCTCCGAC GGCGGCTCGC TGACGGAGAT CTATGGGCTG GACAGCCCCC GGGGACTGCA CCGCCTCACC GCCATCGCGA GCCGGCCAGG CCAGGCCGGT TCGATCGCGC TCTCCGCCGA CGGGCGGGTC CTCGCTGTCA GTACCCCGGC CGGTCAGGTA ACGCTCTGGG ACATGCGCTC ACCATCCCGG CCGGTGCAGC GGGCCACCCT GCCGGTCGGC ACCGCGCCGA CGGCGACGGT CTTCGGACCT CTGGGACATG AGGGCGTCCT CGCAGTCGTA GCCGGTGACG CCGTCCGTCT CTGGCAGCTC GACCTGCTCG CGGCCGAGGA CGAGATCTGT GCCAGGGCCG AGGGCCGCAT CAATCGGGAG CAGTGGCGGA CCTACCTCGG TCACCGGCAC TACGACCCGC CCTGTGACTG A
|
Protein sequence | MAISRRNLPG LTGTDLQTIG PYTVERKLVD ARTGPVFLAR NGEARPVLVK TITAPFGRDA EFRRRLRVDL DNIRRLAPSC LAAILDLDTG ARPPYVVAEF IDAPTLAATV AGGAALSGPD TYRLAVGLAT ALAALHELEI FLGDLKPINV VLSGQGVRLV DFGLFRAMNA VSINNPGGPP SGIGTLAFIT PEQALGQTAT VASDVFTWGG MLLFAATGRP PFGAGTPRVL LQRAVYAEPD LSVFCPELRE LVAAAMRKDP KRRPAAAELL EQLMAYPTRS EAEPAVEPTR RLALPAGVIE TLVPVQTRRT VESETKPEVV GTSDLAAPAI GAIHGLEIVL ETVTVLETVT VLGTGPAAEI TAVPPAAEIT AVPPAAAVAL GAVRTGAEPR MRPSSPGPSS PAPSSPGPSA ALAGVPAPLP TAAARARASY VQEGTGNVSA LSSPSSPRDH LDRGWLRRVL SIGVAVSVLV LSTVVIVDTL RERSAAATSR EAARGAMRLL DREPDLAGQL AVSAYRMAPT SAAAEALVNA SIRQIGPATG AIRDLLITPD GRYLIIVGDF GGSVWNIIGP GRVRYITDLP AVAAAMGDRS PLAAGVAPAA AVKGLLAVAL IPASGPATGV RASTIMVTAG TDGVIRLWRL AQPGSTDDGD LQSAINTGQR VSLLAELRGH TGAVASVAVS GDGRTLASAG ADRVVRLWDI GHPQNPRALA ELPQPAEVTS LAFTPDGDSL AVGGVGHLSV WDVTAAGQPR RRAQLTAPAT VRKLLVSPDG RWLAVASTSD GGSLTEIYGL DSPRGLHRLT AIASRPGQAG SIALSADGRV LAVSTPAGQV TLWDMRSPSR PVQRATLPVG TAPTATVFGP LGHEGVLAVV AGDAVRLWQL DLLAAEDEIC ARAEGRINRE QWRTYLGHRH YDPPCD
|
| |