Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4642 |
Symbol | |
ID | 5672985 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 5535509 |
End bp | 5540398 |
Gene Length | 4890 bp |
Protein Length | 1629 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641243500 |
Product | serine/threonine protein kinase |
Protein accession | YP_001508916 |
Protein GI | 158316408 |
COG category | [K] Transcription [L] Replication, recombination and repair [R] General function prediction only [T] Signal transduction mechanisms |
COG ID | [COG0515] Serine/threonine protein kinase [COG3706] Response regulator containing a CheY-like receiver domain and a GGDEF domain [COG3899] Predicted ATPase |
TIGRFAM ID | [TIGR00254] diguanylate cyclase (GGDEF) domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGGAGC AGACCCGGAT CACTCGGCTG GTGTTTCCGA CTGGATGTGT GATCCGAAAG GAACCGCTGG GACCAGACGC GCAGCGGCGG TTGCGGCATG AGGTCGAGAT CCTCGGACGG CTGTCGGGAG TCCAGGGGGT CGCGCACCTC GCTGCTGGGG CCGTGGAGTG CGCGGGATCG ATCCTGCTGG TCGATGTCGA TGGCATGGCG CTGTCCGAGT GGACGACTCC ACTGGATCCG GCCGAGCTGG TCGATCTCGC CGAATCGCTG GCGCGGGCCG TCGCTGGGAT GCATCACCGG GGTGTGGTAC ACCGGGACAT CTGCCCGGCG AACATCGTGG TGAGCAAGGA CTTGGGAGTT CCCTGCCTGA TCGACTTCGC GCTTGCCACA GCTCTTTCGT CGGTCCGGCC GGAGTTCATG TACCACGGGG CAACTGTCGG GACGGTGCCG TACCTGGCGC CGGAGCAGAC CGGGCGGACC GGTCGCCCGG TCGACCAGCG AGCCGACCTG TACGCGCTCG GGGCCACCCT GTACGAGCTG GCTACCGGCG CGCCGCCGTT CGGCACGGAC AACCCAGTTC GGATCATTCA TGACCATCTC GCTCGGATCC CACTCTCGCC TGTGGCGGTG AATCCGTCGG TACCGGCCGG GCTGTCGAAC ATCATCACGC ACCTGCTGGA GAAAGAACCC GACGACCGCT ACCAGAGCGC CGACGGGTTG GTGCACGATC TCACCCTGGT ACGACGCGGT GCGGCGGCGG TGCATCCTGG CGAGCATGAT TTTCTGACGC GTCCGCTGAC GCCATCCCGG CTGTCCGGGC GCGACCGGGA GGTCGACGAG CTGGACGCGG CGTTCGCCGA GGCCGTGACC GGACGCTCTC ACGGGGTCCT CGTCGGCGGC GCACCGGGCG TGGGTAAGAC ATCCCTGGTC AGTGAGCTGC GGCCGATAGC GGCCGGCGTC GACGGCTGGT TCGTGGCCGG CAAATTCGAC CAGTATCGCC GTGACCAGGA GTATGACGGG GTCCGGCAGG CGCTGCGTGC GCTGGGTCGG CTGCTGCTGG CCGAGCCGGA CGACTACCTG GCCGAGGTGC GGGACCGGAT GTTGCGGGCG TTGGGACCCA ACGCAGGCCT GGCTGCCGCC ACCGTGCCGG AACTGGCGGT GCTGCTGAAG GTTCGCCCAG AGCCAGGCGA TCCGATGACC GCACAGGTAC GAGCGTCACG TATAGCAGTC GAGACACTAC GTGCCGTTTC CTCCCGGGAG CGACCGGTGG TGTTCTTCGT CGACGACCTG CAGTGGGCCG GGCGGACGCC GCTGGGCTTC GTCGACCTGG TGTTCGCCGG TGAGGAGCAG ATCGCGGGAC TGCTGCTGGT CGGCGCGTAC CGGGAAAGTG AGATCGACGC CGCTCATCCA CTGACGCCGA TGCTCGCCCG CTGGCGTCAC CAGGTGGTCG GGCCCCATCA CCTGCGGTTG GGTAGCCTGC CGCCAGCGGG GCAGGCGGCC TTGGTGGCTG ACCTGCTGCA CCTGTCCCCG CCGCCCGCCA CCGAGCTGGC ACAGATGATT GCGCCATCCG CCCGGGGAAA TCCATACGAC ATCGTGGAGC TACTCAACGC GCTGCGTCAC GACAGGGTAC TGACCCTCGG CGGGGGTGGG TGGCGATGGG ACTGGGAGGC GCTGCGCCGC CGGCTCGACC GGGTGGATGT GACCGAGCTG CTGGCCGCGC GGGTGGCCGC GCTGCCGCCA GCCACCGCGG AGGTGCTCGC GGTGATGGCC TGCCTGGCGG GCCAGGTGGA GCTGGACCTC CTAGTTGCCG CAACTGGGCT GCCGGCGGAC GAGGTCGAGC GGCGGCTCGC CCCCGCGTTC GGCGACGGCC TACTGGTACT GGAGTCCGAC GGCCGGCAGA GCGTGCGGTT TCACCACGAT CGGACGCGGG AGTCCGTGCT GGGCGGTCGC GCCGCGCAGG ACCAGGGCGC GAGGCGGTTG CGTCTGGCAC GGCGCCTGGT CGAGCGGCCC GAGTTCTTCG CCGTGGCGGC GGAGCAGTAC CTGCCCGTAG CTGATGCTGT GCATGGCACC GAGGAACGGA AGCGGGTGGC CGGACTCTTC CGGCAAGCCG CTGAGGAAGC GAAACTGCTG AGCAATTACT CACTGGTCGA GCGGCTCCTG ACCGCGGCGG TCGAGGTCAT CGATCCGACT GACACCGACC AGTTGATCGC AGTCCATACC GACCGGCACG CCGCCTTGTA CAGCCTCGGC AGGCTGGAGG AGGCGGATGA CACGTACCAG ACCATCGGCC GGTTGTGCAC CCACCCGGTT CAACTCACGG CCGCCACAGT GGTGCAGGTC AGCAGTCTCA CCAACCGGGG ACGCGGTGGC GAGGCGATGC GGCTCGGCCT CGACCAGCTG CGGCACCTCG GCATATCCGT CCCGGACGAG AACGATCTCG ATGCGGAGAT CGACCGCGGA CTCGACGCGC TCTACCGATG GATCGACACG ACCAGCGAGT CCGACGACCT GCACCGACCG GGCATCACGG ACCAGTCCCG GCTCAGCACC ATCAAGCTCG TCAACCGGCT CATGCCCGCG GCCTTCTTCT GTGACCAGGC GATGATGGCC TGGCTGACCA TCAAGACGCT GGAGGTCTGG GCATGGCACG GCCCGGATCG CGCTCTGCTC GGCCCGGCCG GCCACATCGC GATCGTGACC ATCGTCCGCA GGGGCGACTA CCGCACCGGG CACCGCATCC TGCGGCGGAT CCTGGCGGTC GGTCAGGCGC TCGGCTATGA GCCTGACGTG TGGCAGGCGC AGTTCGTGTA TGTGATCACT ACCGGCCACT GGTTCGCGCC CCTCGAGGAC AATCTGTCTG AGGGCCGTCG CGCGCTGGAG GGACTTGTCC TGAGCGGTGA TCTCCAGAAC GCATGCTGGG CCCACTACGC ACTGTTGTAC GACCTGTTGG ACTGCGCGCC GTCTCTTGAT GTCTTCGCCG CTGAGGTCGA CGAGGCACTG GCGTCCGCAG CCCGCACCGG CAATGGACAT GCCGAGACGA TATTTCGGAT ATTTCGCCGA CTGGTACGAG TGATGCGAGG CGAGTCCGTC GAATCGGCGG CCGATGAGGC GGCCGAGCTG AGCATGCTGG CCGCGGACCC GTTCGCCGTC GCCAACCTGC ATGTGACCCG GGCGCTCGCC GCAGCCGTTC TCGGTCATCC GGTCGATCTG ACCCGGCATG CGGCAGCGGT GATGCCCTTT AGGCCAATGG TCGGAGCGAA CTATGCGATG GCGGTGGCGC GGGTGCTGCT TGCCCTGGGC ATGGCGGCGC AGATTCGTGC GGCAGGAGCG GATCGGTGTG ACACAGAGCT CGCCGAGCTC GACGAACTGG TCGAGTGGCT GGCCGCACGT GCGGCCGACG CGCCGGCCAA CTTCCTGCAC CTGTTACGCA TGATGGAGGC GGAGCGGGCA TGGGCTGTCG GAGACTTCCG CGAGGCGGCG TACACGTTTG ACGTGGCGCA GAGAGAGGCT TCCGTACGGG CGCGCCCGTG GCACCGGGCA CTGATCCTGG AACGCGCTGC ACGCTTCTAC CTGGCCCATG GCATGGATGA AGCCGGCCAC ACGCTCCTGG CAGCCGCCCG GCGCCAGTAC CTGGACTGGG GCGCGACCGC GAAGGTCAGC CAACTCGACT GGGCCCACCC GACGCTACGA ACCGAACCTG CCTGCGAACC GGTCGTTCAT CCACCGGCGG CGCCATCCGC GCGACGCTCG ACCGTCGCGA CCGGCACCGT CGATCTGCTC GGCGTCGTCG CGGCATCCCA GGCACTCAGC TCGGAGACCA GCATCGAGGG CCTCCGGGCA AGGGGCGTGG GGATCCTGTC GGAGATGACC GGTGCCACCG GCGTCCACCT GCTGTTGCGT AGGCAGGAAC AAGACGGGTG GTTGGTGCCG ACCGGTGTCG GTGGCACCGT CCCGCTCCGG GAGGCCAGCC GGCGGCGTCT GTTGCCCGCC TCGGTCATCC GATATGCCGA GCGCACCCAC GAACCTGTGG TCGTCGCCAA CGCCACCCGC GACGACCGGT TCCGCCGCGA CCCCTACCTC CTCGACTTCG ACCGCTGTTC ACTGCTCGCC ATACCCATCA TGATCCGGGG CCATCTGCAG GCGATGCTGC TATTAGAGAA CCGACTGATC CGTGGCGCGT TCTCCACCGA ACGCCTCGAA GGAATCATGC TCATCGCCGG ACAGCTCGCG GTCTCGCTCG ACAACGCCAT GGTCTACGCG TCGCTGGAAC GCAAGGTTAC CGAACGAACC GGGCAGCTCG CCGCTACCAA CGAACGACTC GCCGCCGCGA ACCACCAGCT GGAACAACTC TCGGTCACCG ATCCGCTGAC AGGGCTGGCC AACCGGCGAC GCCTCGAAGA GACCCTGGAC GTCGAATGGC GCCGGGCCCA GGAACACGCG GCACCCATCG CGCTGGCGAT GGTCGACATC GACCACTTCA AACTCTACAA CGACCACTTC GGACACACCG CAGGTGACCG ATGCCTCCAG CGGGTCGCCG CATGCCTGGC CGAGAATGTC GGTGACACCT TTCTGACCGC CCGTTATGGG GGGGAAGAGT TCACCGTCGT GATGCCCGAC ACCGACTCGG ACACCGCCGC TAGGCTGGCC CGACGCCTCT GCTCCGCCGT CGAGGAACTG GCCGAGCCAC ACCCTCTGGT GGTAGAGCGC ATCATCACCG TGAGCATCGG CGTAACCGCG GCCATCCCAA CTCCCGACGA CGACATGGCG GCGTTCGCCG AATTCGCCGA TGTCGCGCTG TACCGGGCCA AAGACGGCGG CCGCAATCGG GTCCGAATGA TCCCGTTTCC GGTGGACAAC GGACGTGGGG AAAAACCGCC TGGCCGGTGA
|
Protein sequence | MTEQTRITRL VFPTGCVIRK EPLGPDAQRR LRHEVEILGR LSGVQGVAHL AAGAVECAGS ILLVDVDGMA LSEWTTPLDP AELVDLAESL ARAVAGMHHR GVVHRDICPA NIVVSKDLGV PCLIDFALAT ALSSVRPEFM YHGATVGTVP YLAPEQTGRT GRPVDQRADL YALGATLYEL ATGAPPFGTD NPVRIIHDHL ARIPLSPVAV NPSVPAGLSN IITHLLEKEP DDRYQSADGL VHDLTLVRRG AAAVHPGEHD FLTRPLTPSR LSGRDREVDE LDAAFAEAVT GRSHGVLVGG APGVGKTSLV SELRPIAAGV DGWFVAGKFD QYRRDQEYDG VRQALRALGR LLLAEPDDYL AEVRDRMLRA LGPNAGLAAA TVPELAVLLK VRPEPGDPMT AQVRASRIAV ETLRAVSSRE RPVVFFVDDL QWAGRTPLGF VDLVFAGEEQ IAGLLLVGAY RESEIDAAHP LTPMLARWRH QVVGPHHLRL GSLPPAGQAA LVADLLHLSP PPATELAQMI APSARGNPYD IVELLNALRH DRVLTLGGGG WRWDWEALRR RLDRVDVTEL LAARVAALPP ATAEVLAVMA CLAGQVELDL LVAATGLPAD EVERRLAPAF GDGLLVLESD GRQSVRFHHD RTRESVLGGR AAQDQGARRL RLARRLVERP EFFAVAAEQY LPVADAVHGT EERKRVAGLF RQAAEEAKLL SNYSLVERLL TAAVEVIDPT DTDQLIAVHT DRHAALYSLG RLEEADDTYQ TIGRLCTHPV QLTAATVVQV SSLTNRGRGG EAMRLGLDQL RHLGISVPDE NDLDAEIDRG LDALYRWIDT TSESDDLHRP GITDQSRLST IKLVNRLMPA AFFCDQAMMA WLTIKTLEVW AWHGPDRALL GPAGHIAIVT IVRRGDYRTG HRILRRILAV GQALGYEPDV WQAQFVYVIT TGHWFAPLED NLSEGRRALE GLVLSGDLQN ACWAHYALLY DLLDCAPSLD VFAAEVDEAL ASAARTGNGH AETIFRIFRR LVRVMRGESV ESAADEAAEL SMLAADPFAV ANLHVTRALA AAVLGHPVDL TRHAAAVMPF RPMVGANYAM AVARVLLALG MAAQIRAAGA DRCDTELAEL DELVEWLAAR AADAPANFLH LLRMMEAERA WAVGDFREAA YTFDVAQREA SVRARPWHRA LILERAARFY LAHGMDEAGH TLLAAARRQY LDWGATAKVS QLDWAHPTLR TEPACEPVVH PPAAPSARRS TVATGTVDLL GVVAASQALS SETSIEGLRA RGVGILSEMT GATGVHLLLR RQEQDGWLVP TGVGGTVPLR EASRRRLLPA SVIRYAERTH EPVVVANATR DDRFRRDPYL LDFDRCSLLA IPIMIRGHLQ AMLLLENRLI RGAFSTERLE GIMLIAGQLA VSLDNAMVYA SLERKVTERT GQLAATNERL AAANHQLEQL SVTDPLTGLA NRRRLEETLD VEWRRAQEHA APIALAMVDI DHFKLYNDHF GHTAGDRCLQ RVAACLAENV GDTFLTARYG GEEFTVVMPD TDSDTAARLA RRLCSAVEEL AEPHPLVVER IITVSIGVTA AIPTPDDDMA AFAEFADVAL YRAKDGGRNR VRMIPFPVDN GRGEKPPGR
|
| |