Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_5936 |
Symbol | |
ID | 5674257 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 7209247 |
End bp | 7212336 |
Gene Length | 3090 bp |
Protein Length | 1029 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641244784 |
Product | serine/threonine protein kinase |
Protein accession | YP_001510186 |
Protein GI | 158317678 |
COG category | [K] Transcription [L] Replication, recombination and repair [R] General function prediction only [T] Signal transduction mechanisms |
COG ID | [COG0515] Serine/threonine protein kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0192244 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCAGGCG AGCGCCGGGG CGAACCCGGT CCAGCCGGTC CACGCGCTCT CGTCGGGCGT CGGTACCGGC TGGACGGGGT GATCGGCCAG GGCGGCTTCG GTGTCGTGCA CCGGGCCACC GACGAGCTGC TCGGCCGGCA GGTCGCCGTC AAGGAGGTCC GGCTCCCCAC CGACGGCAGC GAGCGGGAGC GGGAGCTCGC GCGGGAGCGG GTGCTGCGGG AGGCGCGCGC CGCCGGCCGG CTGCATCATC CCGGCGCGGT CGCGGTCCTG GACGTCATCG CCGAGGGCGA CCTGCCCTGG ATCGTCATGG AATTCGTCGA CGGGCGGTCC CTGTCGGCGA TCATCGAGGA GCGCGGGCGG CTCGGGGTCG GCGAGGTCTG CCGCATCGGC ATCAGCCTCG CCTACGCGCT GGAGGCCGCC CACCGCCTGG GCATCGTGCA CCGTGACGTG AAGCCGAGCA ATGTCCTGGT GACAGCGGAC GAACGCGCCC GGCTCACCGA CTTCGGGATC GCCGTCAGCC ATGGCGACCC TCGGCTGACC AGCACCGGGA TGGTTCTCGG CTCCCCGGCC TACCTGGCAC CCGAGCGGGC CCGCGGCGAC GCCGGAACGG CCGCCAGCGA CCGGTGGGGG CTGGGCGCCA CGCTGTTCAC CAGCGTCGAG GGGGTCTCCC CGTTCCCCGG GAACGACCCG ATCTCCATTC TCGCCGCGGT GGTGCAGAGC CGGCGCCGGC CGTTCCGCGC CGCCGGGCGG CTCGCGCCCG TGATCGACGA TCTCATGGCC ACCCACCAGG CACGCCGGCC GAGTCTGGCG ACGGTGCGCT CGCGGCTGCG CGACATCCTC GAACGCGGCG GGGACACCCG GCCCTCCCGG GCCCGCAGCC GACCATCCCG GCCCGCGGTA ACAGCGATCA CTCCGACGTC GGGCACACCC GCGGCCCCCG CACCCGTCGC CCCCGCACCC GTCGCCCCCG GGCCTCCGAC GGCCCCGATC CCGGCCGGGT TCGCGGCGGA CGGCGGTGAC ACGTTCACCG CGGGCGGAGC ATCCACGGGC GGCGCATCCC CGGCCGACCA GGACACCGTG GTCAGCCAGT ATGCCGGGCC CGACGAGACC ACCGTCCTCG GCGAGTCCGT CCTCGGTGGA CCGGCGACCG CCGGCGCGCG CGCCGGTGAG ACGCACAACG CCATGGCGGA GGACGACGAA ACGTCCGACG ACCACTCGTC CCCGGAGGAC GGCGGCGCGG ACCTGACCGC GCTGCTCACG GGCACGACGA CAGCACCGGC CGGCGGCGCA CCCGCCGGCC CACCGCCAGC CGCCCCGGGG CTGGCCGGCA CCGTGCTGAT GAACGCGGCG CCCAGCGAGG CACCACCGGA CGACACCATG CCTGGCGAGG CACTGTCAGA CGACACCGTG CCCGGCGGCC CACTGTCGGA CGACACCCTG CCCGGCGAGA CACCATCAGA CGACACCCTG CCCGGCACCC TGCCCGGCGA GGCGCCGCCG AATGACACCG TGCCCGGCGG GGTGCTGGCG GACGACACCC TGCTGGCCGA GGCATCGCCG GACGGAACCG TGCTGACCCG GACGTCGCCG GCGAGCACGG GCAACACCGG GTCCGACGCG GACAATGACA CCCGCATCGA CATCGGTACA GACAGCGCGG CGGTCATCGC CGCGGAGGAT GCCGGCTCCC CGATACCGGA CTCGGCGGCC GTCGAACCCA CGGCGAGCCG GGTCGGGGCA GGCGAGCCGG CGACGAGCGA CGCACGGCCG GCGGGAACGG CGAGACAGTC GGGAACGGCT GGTGCGCACC CGCGGTTGAC CGGTCGGCTG GAGCGGCTCC GGGCGATGGA GCGGCGGCGG GGCGCGACGG TTGAGCGTCC CCGGGGCGCT GAGCGGCTGC GACCGGTCGA CCGCATGCGG GCACTCGGTG GCATCCCGGC GCGGGCGCGG ACATCCCAGG ACTCGTTGGA TCCGATGACC GGCGGCCTGC CGATACCCGG CGCCGGTGCG GCATCGGCCG CGACGGGAAC GGTGCCGGGC CCGGCCGGTT CCGCGGGCAC CGAACCGGCC GCCACCACGG CCACCGGGAC AGCCAAGGCC ACCGGGACAG CCGGGACGAG GAGTACGCCC GACGCCACGG GAACGCCAGG CACCGCCACG GAACCGACGT CCGCTGTGGG GCGCCGGTAC TCGGCGGCCG CGGCGTTGTT CTCCCGGCCG GGCAGGTCCA CGGCCTCCGG CGATTCCGGT CCGGGGATCT CCGCGGGTCC GTCGAGCGCA TCCCGCGCCG CCGCCCATCC GCGACGGGAG CCGGAGCAGG AGCCGCCGAC GAGTGGGCAG AACCTGTTCC GCCCGCCCTC CGAACGGCAG CCGGAGGAGC GGCGGCACCC GGGTGGGGTG CTGGGGCGTC CGGACGGGGC GAGCGCCGTT ACGCCCAGCC GCGGACGTCG CCTGGCTGTG ATCCTCATCG CGTCCGTCGT CGTGATCGCG CTTGCGGTCG TCGCCGCCGT CCTGGTGTTC GGCGGGGGTG ACGACGGTGC GGAGCCGGCC GGAGCCGGGC CGCGCCCGAC CGTCCAGGCT GACCAGGAAG CCGCAGCAGT CCAGGCTGGA GACCTGGTCG CGTCGAATTC TGAGCCGGTC ACGGCGCCTC CGGGCTGGGT CTCCTACGTC GATCCGACAG GCTGGTCGAT CGCCTACCCG TCGCGCTGGC AGCGGCGGCC CGGCCCCGGG GGCGAGGGGA ATACGGACTT CGTGGATCCG GCTACTGGTA CGTTCTTGCG CATCGGGAGC ATCGCGAGTG CGAACACCTC CGCTATTGAG GACTGGCGCA CGAACGAAAT CAGCTTCCGG GATCGAGTGC GGGACTACCG GCGGGTACGG ATCGAACCAG GTGACGGGGC GGACGGCGCG ACACAGGCCG ACTGGGAGTT CACCTACGCC TCCGGTGGGG GGACGGTGCA TGTGCTCAAC CGTGGCGCGG TGCGAAACGG GCATGGATAC GCTCTGTACT GGTACACGGC CGAGGAGCTG TGGGAGGCGG ATCAGCCGCT CATGCGACAG CTCTTCGCGA CCTTCAGGCC GGCCTCATGA
|
Protein sequence | MSGERRGEPG PAGPRALVGR RYRLDGVIGQ GGFGVVHRAT DELLGRQVAV KEVRLPTDGS ERERELARER VLREARAAGR LHHPGAVAVL DVIAEGDLPW IVMEFVDGRS LSAIIEERGR LGVGEVCRIG ISLAYALEAA HRLGIVHRDV KPSNVLVTAD ERARLTDFGI AVSHGDPRLT STGMVLGSPA YLAPERARGD AGTAASDRWG LGATLFTSVE GVSPFPGNDP ISILAAVVQS RRRPFRAAGR LAPVIDDLMA THQARRPSLA TVRSRLRDIL ERGGDTRPSR ARSRPSRPAV TAITPTSGTP AAPAPVAPAP VAPGPPTAPI PAGFAADGGD TFTAGGASTG GASPADQDTV VSQYAGPDET TVLGESVLGG PATAGARAGE THNAMAEDDE TSDDHSSPED GGADLTALLT GTTTAPAGGA PAGPPPAAPG LAGTVLMNAA PSEAPPDDTM PGEALSDDTV PGGPLSDDTL PGETPSDDTL PGTLPGEAPP NDTVPGGVLA DDTLLAEASP DGTVLTRTSP ASTGNTGSDA DNDTRIDIGT DSAAVIAAED AGSPIPDSAA VEPTASRVGA GEPATSDARP AGTARQSGTA GAHPRLTGRL ERLRAMERRR GATVERPRGA ERLRPVDRMR ALGGIPARAR TSQDSLDPMT GGLPIPGAGA ASAATGTVPG PAGSAGTEPA ATTATGTAKA TGTAGTRSTP DATGTPGTAT EPTSAVGRRY SAAAALFSRP GRSTASGDSG PGISAGPSSA SRAAAHPRRE PEQEPPTSGQ NLFRPPSERQ PEERRHPGGV LGRPDGASAV TPSRGRRLAV ILIASVVVIA LAVVAAVLVF GGGDDGAEPA GAGPRPTVQA DQEAAAVQAG DLVASNSEPV TAPPGWVSYV DPTGWSIAYP SRWQRRPGPG GEGNTDFVDP ATGTFLRIGS IASANTSAIE DWRTNEISFR DRVRDYRRVR IEPGDGADGA TQADWEFTYA SGGGTVHVLN RGAVRNGHGY ALYWYTAEEL WEADQPLMRQ LFATFRPAS
|
| |