Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_5419 |
Symbol | |
ID | 5673750 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 6540882 |
End bp | 6544865 |
Gene Length | 3984 bp |
Protein Length | 1327 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641244274 |
Product | serine/threonine protein kinase |
Protein accession | YP_001509680 |
Protein GI | 158317172 |
COG category | [K] Transcription [L] Replication, recombination and repair [R] General function prediction only [T] Signal transduction mechanisms |
COG ID | [COG0515] Serine/threonine protein kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.388014 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGGGCGA GGATCGTCGT CCATGGTCCG TGGAGCGGGC CGGGTGAGGA GGCGACCGCC GTCTACCTGC GTGACCACCT GGACGAGCGG TGGACGGTCG TCTGCGGCCG CCAGCTCGTC GCCGGCTCGG GTACGCGTGA CTCCGACTTC ATCGTGGTCG GGCCGAACAG TGTCACCGTC ATCGAGGAGA AGTCCTGGCG CGTCGACCTG ATCGGCACCG AGGAGCGCTG GTACGAGCGT AGGACCCTGG AAGAGCGTGG TAATCCGATC AACCAGGCCG TCGGCGTCGC GCGGATCCTC GCCGGCAACC TCAACCGGTT CAACAAGCCG CTCGCGTCCG CGCTCAGGAG CGCGGCGCCG CCCGGCCAGA GGCAGCTCCA TCTCGTCCAC CCGCTGGTCG TGATGTCCTC GCCCGAGGCC ACGTTCCAGA TCGAGGACCC GCGAGCGGCC AGCCAGGTCG TGCGGCTGGC CGGCTGCGAG AAGCGCCTGA CGGAGATCGA CCGTTCGGTC GCGGCGTACT TCGACCTCGC CCCGTTCCGT GACGCGGTCC TGTCCGTCGT CACCGGCCTC AGGCCGAGGC CCGAGACGCC GGAGACGCTC GGCGCCTACA AGATCGTCGG CAACATAGAG CCGACGCCAC GTGGCCGCCG CTACGTCGGC CGGCACCCGG GCGGCTCGGT GCGGGTGCTG ATCACCTACG AGCGCCAGGG CCTCACCGAG GCCGAGCTCA AGGAGAGCGA CAGGCTGATC CTGCGCGAGT ACGACGCGCT GCAGCGCCTC GCGCACCTGC ACCGGGTGTT CCCCGTCGAC CCGTACTTCG GCGTCAACGA CGACCAGATG TGGGTCGCTC CGATGCACCC GCCGGGCCCG AAGCAGAAGA CGCTGCGCAA TCTGATCCGG GCCGGCAGGC AGCCGTCGGC GGCCACCTTC GCCACGGTCG CCGCGGACGC GTTCGCCGGG CTCGCCGACA TCCACGCGGC GGGCATCACC CACCGGGCGC TGCACCCGAG CCGGGTCTGG GTGACGCCGG AGACGAACCG GGTCGTGTTC TCCGACTTCC TGATCGCGAA GATCGCGGAC GCGCGCACGG TGCTCGACGC GGACGACGTC GATCACGACG GCGGGCCGTT CCGGGCGCCG GAATGCCGGC GTACCGCGCA CCTGGCGATC AACCCGTCGG ACGTCTACTC GCTGGCGCTG TGCCTGCTCG ACTGGTGGGA GCTGCGCCAG TCCGACCCGC CGCGCGAGCC GCAGGCGCCG CCGTCGCTGC CGGACCGGGT CCGCGAGGCG CTGAACGCGT GCTTCGCGTC GATGCCGCAG GACCGCCCGT CGGCCCGCGA GGTCGCGGAC CTGCTGGCCG CGCACCTGGC CGAGGAGCGC AAGCGCGCGG TGCGTCTCGC ACCAGGTGTG AAGATCGGCA AGTACGAGCT GGTGCGCCAG CTCGGCGACG GCGCCACCGC CACCACCTGG CTCATGATCG ACCGGGCGTT CGACATGCAC CACACGCTCA AGATCATGAA GTCGGCCGAG CTGGTGGACA ACCTGCGCGG CGAGTTCGTC CTGCTGCACA GGCTCAAGCA CGACCGGATC GTCCGGGTGC ACGACTACCT GCTCGAGCCG CCCGGCCCGG CGCTGATCTC CGAGTATGTC GAGGGCAGGA CCGTCGCCCA GCTCGCGCCG AGGCTGCGCG GCAGTCTCGG CGCCGCCCGC AGGATTCTCG CCGACATCCT CGACGCGCTC GAGTACGCCC ACAACCGCGA CGTCCTGCAC CGCGACCTGT CCCCGAACAA CATCATCGTC GATGCGGACG ACCGGGCGAC GTTGATCGAC TTCGGGGTCG CGTCCCGAGC CGAGGCCGAC GCGCACAGCC TGGTCGGCAC ACCGCCGTAC CAGGCGCCGG AGATCGCGGC CGACGGCGCC TGGTCGCCGG CGGCGGACCT GTACTCGCTG GCCGTCCTGG TCTTCGAGGC GCTGGTCGGC CGCGCGCCGT ACGCGGTCGA GGGACGGAAC CGCAACAAGT ACCTTCTCGT CCCACCGACG CCGGCGGAGG CCGAGGCGGC GGGCACCACC GGCACCATGT TCCTGGAGGT GCTGGCCCGG GCCGGCGACC CGAACCCGGC GACCCGCTTC CCTTCCGCCG CCAAGTTCCG TGACGCGATC GAGGCGGCGT TCGAGCCGCC TACGCCCTCC CTGCCGCCGG CGGGCACGGT TCCGGAAACG CCGGCCATCC CAGCGGGGGG CTCGCCAGTG ACGGTGACGC CGCCGCCCAC GACGGCGACG GTGACGCCTA CGACGGTGCC GGTGCCGCCA GCCGTCTCGG CCCAGCGCGG CCTGGTCGTC AATCCGGCCG TCGACGAGGT CCGCCAGCTG TTCCGCAACA GCAGGCTCGG TAACCGGGGC AACCGGGGCC TGGACTCGGA CTTCGCCGAC CAGACCTACA TCCCGACCCG TCTCGACGAG AAGCTGATGC CGCGGATCAT CGACGGCCGG CCACGGCTGG TCGTCTTCAG CGGCAACCCG GGCGACGGGA AGACGGCCTT CCTGGAACGG CTCGGCGTCG AGCTGCGCCG CCGCGGCGCC CGGGTCCTGG AGAGCGACGC GGCCGGCTGG CGGATGACGC TCGACGGGCA CGAGTTCGCC TCGGTCTACG ACGCCAGCGA GTCGCACGAG GGCATATCGG CGGACCAGCT GCTGCGCCGC GCGCTCGACC CGCTGAACGA CCAGTCCGGT GCTGGCCGCT ACACCGCGCT GCTTGCCGCC AACGACGGCC GCCTTCTCGA CTTCTTCGAC CGCGACGCCC ACCGTTACCC GGAGATCGCG AAGCTGCTCG GCGGCGGCGG CGGCGACGAC GGCAGCGGCA GCGGCCAGGC TGCCGACGCC GCGGGAGTGC TGCGGATCGA CCTCAAGGGC CGGTCGATGG CGTCGGTGGG CCTGGCGGCC GGCTCGCTGA CGGTGCGGAT GCTCGACGCG CTGCTCACCG ACGAGCTGTG GGCGCCGTGC GGCGGCTGCG CGGCCGAGCA CCGCTGCCCG ATCGCGGCCA ACGCCCGCGG CCTGCGCGAC GGGCGCGTCC AGGAGCGGCT GCACCGGCTG GTGCTCACCT CGCACCTGCG TCGTGGCCGG CGTCCCACGA TCCGCGACCT GCGTTCGGCG CTCGCCTACC TGGTGACCGT CGACCTCGGA TGCGCCGACA TCCACGACGA GTTCGGCGAC GGCTTCGGGA ACCCCGCCCG CCCGGAACGG CACTTCAGCG CGGCCGCGTT CGACGCCGCG GCCGGGCACG ACCGGCTGCT CGGCGAGTGG CGCAGCCTTG ACCCGGCGGC GGTGCCGGCG CCGCGGGTGG AGCGGTGGCT CGCACCCGGC ACCCGTTCCA CCGCGGACAT CGCCCGCGCC AAGCGGCTGC GCTACTTCAC CGCGACCGAC GCGGAGCTCG ACGCCGTCGA CCACCTCGGC CCGTACCGGC ACCTCGACGA GTTCCGTGCC CTGCTCACCA GCGCCGGCCC CGACCCCGGC ACCGACGGTG CGGCGCGCCG GACCGCGGTG GTGTCGAAGC TGCTGCGCGG CCTGTCGCGC TGCGTCGGGC CGGTCGGCTA CGACGGCGAC GGGGTCGCGG TCTCGGTGGG CGAGCCTTCC GAGGACGGCG GCGCGGTCGT CAAGGTACTG CCAGCCGACG AGTTCGAGGT CGTCGTGGCC GATGTGGACG ACCGGTTCGT CGAGACGGCG GCCGACATCG TCGTGCTGCG GCACCGTTCC GGCCAGGCGG ACCTGCCGGT CGACGTCGAC CTGTTCGAGC TGCTGATGCG CGCGAACGCC GGCCTCCTGC CGACCAACAC CGAGGTCGCG CCGCTGCTCG AGGAACTGGG CCTGTTCCGC GCAAGCCTCA CCCTCCAGCC CGCGAGCCGA GTGATCATCA TCGAGCCCAA CGGCCAGCGG AACGTCATCA AGGTGACCGG AACGACCATC GAACTGCTGC GGGACGCCCG ATGA
|
Protein sequence | MGARIVVHGP WSGPGEEATA VYLRDHLDER WTVVCGRQLV AGSGTRDSDF IVVGPNSVTV IEEKSWRVDL IGTEERWYER RTLEERGNPI NQAVGVARIL AGNLNRFNKP LASALRSAAP PGQRQLHLVH PLVVMSSPEA TFQIEDPRAA SQVVRLAGCE KRLTEIDRSV AAYFDLAPFR DAVLSVVTGL RPRPETPETL GAYKIVGNIE PTPRGRRYVG RHPGGSVRVL ITYERQGLTE AELKESDRLI LREYDALQRL AHLHRVFPVD PYFGVNDDQM WVAPMHPPGP KQKTLRNLIR AGRQPSAATF ATVAADAFAG LADIHAAGIT HRALHPSRVW VTPETNRVVF SDFLIAKIAD ARTVLDADDV DHDGGPFRAP ECRRTAHLAI NPSDVYSLAL CLLDWWELRQ SDPPREPQAP PSLPDRVREA LNACFASMPQ DRPSAREVAD LLAAHLAEER KRAVRLAPGV KIGKYELVRQ LGDGATATTW LMIDRAFDMH HTLKIMKSAE LVDNLRGEFV LLHRLKHDRI VRVHDYLLEP PGPALISEYV EGRTVAQLAP RLRGSLGAAR RILADILDAL EYAHNRDVLH RDLSPNNIIV DADDRATLID FGVASRAEAD AHSLVGTPPY QAPEIAADGA WSPAADLYSL AVLVFEALVG RAPYAVEGRN RNKYLLVPPT PAEAEAAGTT GTMFLEVLAR AGDPNPATRF PSAAKFRDAI EAAFEPPTPS LPPAGTVPET PAIPAGGSPV TVTPPPTTAT VTPTTVPVPP AVSAQRGLVV NPAVDEVRQL FRNSRLGNRG NRGLDSDFAD QTYIPTRLDE KLMPRIIDGR PRLVVFSGNP GDGKTAFLER LGVELRRRGA RVLESDAAGW RMTLDGHEFA SVYDASESHE GISADQLLRR ALDPLNDQSG AGRYTALLAA NDGRLLDFFD RDAHRYPEIA KLLGGGGGDD GSGSGQAADA AGVLRIDLKG RSMASVGLAA GSLTVRMLDA LLTDELWAPC GGCAAEHRCP IAANARGLRD GRVQERLHRL VLTSHLRRGR RPTIRDLRSA LAYLVTVDLG CADIHDEFGD GFGNPARPER HFSAAAFDAA AGHDRLLGEW RSLDPAAVPA PRVERWLAPG TRSTADIARA KRLRYFTATD AELDAVDHLG PYRHLDEFRA LLTSAGPDPG TDGAARRTAV VSKLLRGLSR CVGPVGYDGD GVAVSVGEPS EDGGAVVKVL PADEFEVVVA DVDDRFVETA ADIVVLRHRS GQADLPVDVD LFELLMRANA GLLPTNTEVA PLLEELGLFR ASLTLQPASR VIIIEPNGQR NVIKVTGTTI ELLRDAR
|
| |