Gene Franean1_4642 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4642 
Symbol 
ID5672985 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5535509 
End bp5540398 
Gene Length4890 bp 
Protein Length1629 aa 
Translation table11 
GC content68% 
IMG OID641243500 
Productserine/threonine protein kinase 
Protein accessionYP_001508916 
Protein GI158316408 
COG category[K] Transcription
[L] Replication, recombination and repair
[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG0515] Serine/threonine protein kinase
[COG3706] Response regulator containing a CheY-like receiver domain and a GGDEF domain
[COG3899] Predicted ATPase 
TIGRFAM ID[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGAGC AGACCCGGAT CACTCGGCTG GTGTTTCCGA CTGGATGTGT GATCCGAAAG 
GAACCGCTGG GACCAGACGC GCAGCGGCGG TTGCGGCATG AGGTCGAGAT CCTCGGACGG
CTGTCGGGAG TCCAGGGGGT CGCGCACCTC GCTGCTGGGG CCGTGGAGTG CGCGGGATCG
ATCCTGCTGG TCGATGTCGA TGGCATGGCG CTGTCCGAGT GGACGACTCC ACTGGATCCG
GCCGAGCTGG TCGATCTCGC CGAATCGCTG GCGCGGGCCG TCGCTGGGAT GCATCACCGG
GGTGTGGTAC ACCGGGACAT CTGCCCGGCG AACATCGTGG TGAGCAAGGA CTTGGGAGTT
CCCTGCCTGA TCGACTTCGC GCTTGCCACA GCTCTTTCGT CGGTCCGGCC GGAGTTCATG
TACCACGGGG CAACTGTCGG GACGGTGCCG TACCTGGCGC CGGAGCAGAC CGGGCGGACC
GGTCGCCCGG TCGACCAGCG AGCCGACCTG TACGCGCTCG GGGCCACCCT GTACGAGCTG
GCTACCGGCG CGCCGCCGTT CGGCACGGAC AACCCAGTTC GGATCATTCA TGACCATCTC
GCTCGGATCC CACTCTCGCC TGTGGCGGTG AATCCGTCGG TACCGGCCGG GCTGTCGAAC
ATCATCACGC ACCTGCTGGA GAAAGAACCC GACGACCGCT ACCAGAGCGC CGACGGGTTG
GTGCACGATC TCACCCTGGT ACGACGCGGT GCGGCGGCGG TGCATCCTGG CGAGCATGAT
TTTCTGACGC GTCCGCTGAC GCCATCCCGG CTGTCCGGGC GCGACCGGGA GGTCGACGAG
CTGGACGCGG CGTTCGCCGA GGCCGTGACC GGACGCTCTC ACGGGGTCCT CGTCGGCGGC
GCACCGGGCG TGGGTAAGAC ATCCCTGGTC AGTGAGCTGC GGCCGATAGC GGCCGGCGTC
GACGGCTGGT TCGTGGCCGG CAAATTCGAC CAGTATCGCC GTGACCAGGA GTATGACGGG
GTCCGGCAGG CGCTGCGTGC GCTGGGTCGG CTGCTGCTGG CCGAGCCGGA CGACTACCTG
GCCGAGGTGC GGGACCGGAT GTTGCGGGCG TTGGGACCCA ACGCAGGCCT GGCTGCCGCC
ACCGTGCCGG AACTGGCGGT GCTGCTGAAG GTTCGCCCAG AGCCAGGCGA TCCGATGACC
GCACAGGTAC GAGCGTCACG TATAGCAGTC GAGACACTAC GTGCCGTTTC CTCCCGGGAG
CGACCGGTGG TGTTCTTCGT CGACGACCTG CAGTGGGCCG GGCGGACGCC GCTGGGCTTC
GTCGACCTGG TGTTCGCCGG TGAGGAGCAG ATCGCGGGAC TGCTGCTGGT CGGCGCGTAC
CGGGAAAGTG AGATCGACGC CGCTCATCCA CTGACGCCGA TGCTCGCCCG CTGGCGTCAC
CAGGTGGTCG GGCCCCATCA CCTGCGGTTG GGTAGCCTGC CGCCAGCGGG GCAGGCGGCC
TTGGTGGCTG ACCTGCTGCA CCTGTCCCCG CCGCCCGCCA CCGAGCTGGC ACAGATGATT
GCGCCATCCG CCCGGGGAAA TCCATACGAC ATCGTGGAGC TACTCAACGC GCTGCGTCAC
GACAGGGTAC TGACCCTCGG CGGGGGTGGG TGGCGATGGG ACTGGGAGGC GCTGCGCCGC
CGGCTCGACC GGGTGGATGT GACCGAGCTG CTGGCCGCGC GGGTGGCCGC GCTGCCGCCA
GCCACCGCGG AGGTGCTCGC GGTGATGGCC TGCCTGGCGG GCCAGGTGGA GCTGGACCTC
CTAGTTGCCG CAACTGGGCT GCCGGCGGAC GAGGTCGAGC GGCGGCTCGC CCCCGCGTTC
GGCGACGGCC TACTGGTACT GGAGTCCGAC GGCCGGCAGA GCGTGCGGTT TCACCACGAT
CGGACGCGGG AGTCCGTGCT GGGCGGTCGC GCCGCGCAGG ACCAGGGCGC GAGGCGGTTG
CGTCTGGCAC GGCGCCTGGT CGAGCGGCCC GAGTTCTTCG CCGTGGCGGC GGAGCAGTAC
CTGCCCGTAG CTGATGCTGT GCATGGCACC GAGGAACGGA AGCGGGTGGC CGGACTCTTC
CGGCAAGCCG CTGAGGAAGC GAAACTGCTG AGCAATTACT CACTGGTCGA GCGGCTCCTG
ACCGCGGCGG TCGAGGTCAT CGATCCGACT GACACCGACC AGTTGATCGC AGTCCATACC
GACCGGCACG CCGCCTTGTA CAGCCTCGGC AGGCTGGAGG AGGCGGATGA CACGTACCAG
ACCATCGGCC GGTTGTGCAC CCACCCGGTT CAACTCACGG CCGCCACAGT GGTGCAGGTC
AGCAGTCTCA CCAACCGGGG ACGCGGTGGC GAGGCGATGC GGCTCGGCCT CGACCAGCTG
CGGCACCTCG GCATATCCGT CCCGGACGAG AACGATCTCG ATGCGGAGAT CGACCGCGGA
CTCGACGCGC TCTACCGATG GATCGACACG ACCAGCGAGT CCGACGACCT GCACCGACCG
GGCATCACGG ACCAGTCCCG GCTCAGCACC ATCAAGCTCG TCAACCGGCT CATGCCCGCG
GCCTTCTTCT GTGACCAGGC GATGATGGCC TGGCTGACCA TCAAGACGCT GGAGGTCTGG
GCATGGCACG GCCCGGATCG CGCTCTGCTC GGCCCGGCCG GCCACATCGC GATCGTGACC
ATCGTCCGCA GGGGCGACTA CCGCACCGGG CACCGCATCC TGCGGCGGAT CCTGGCGGTC
GGTCAGGCGC TCGGCTATGA GCCTGACGTG TGGCAGGCGC AGTTCGTGTA TGTGATCACT
ACCGGCCACT GGTTCGCGCC CCTCGAGGAC AATCTGTCTG AGGGCCGTCG CGCGCTGGAG
GGACTTGTCC TGAGCGGTGA TCTCCAGAAC GCATGCTGGG CCCACTACGC ACTGTTGTAC
GACCTGTTGG ACTGCGCGCC GTCTCTTGAT GTCTTCGCCG CTGAGGTCGA CGAGGCACTG
GCGTCCGCAG CCCGCACCGG CAATGGACAT GCCGAGACGA TATTTCGGAT ATTTCGCCGA
CTGGTACGAG TGATGCGAGG CGAGTCCGTC GAATCGGCGG CCGATGAGGC GGCCGAGCTG
AGCATGCTGG CCGCGGACCC GTTCGCCGTC GCCAACCTGC ATGTGACCCG GGCGCTCGCC
GCAGCCGTTC TCGGTCATCC GGTCGATCTG ACCCGGCATG CGGCAGCGGT GATGCCCTTT
AGGCCAATGG TCGGAGCGAA CTATGCGATG GCGGTGGCGC GGGTGCTGCT TGCCCTGGGC
ATGGCGGCGC AGATTCGTGC GGCAGGAGCG GATCGGTGTG ACACAGAGCT CGCCGAGCTC
GACGAACTGG TCGAGTGGCT GGCCGCACGT GCGGCCGACG CGCCGGCCAA CTTCCTGCAC
CTGTTACGCA TGATGGAGGC GGAGCGGGCA TGGGCTGTCG GAGACTTCCG CGAGGCGGCG
TACACGTTTG ACGTGGCGCA GAGAGAGGCT TCCGTACGGG CGCGCCCGTG GCACCGGGCA
CTGATCCTGG AACGCGCTGC ACGCTTCTAC CTGGCCCATG GCATGGATGA AGCCGGCCAC
ACGCTCCTGG CAGCCGCCCG GCGCCAGTAC CTGGACTGGG GCGCGACCGC GAAGGTCAGC
CAACTCGACT GGGCCCACCC GACGCTACGA ACCGAACCTG CCTGCGAACC GGTCGTTCAT
CCACCGGCGG CGCCATCCGC GCGACGCTCG ACCGTCGCGA CCGGCACCGT CGATCTGCTC
GGCGTCGTCG CGGCATCCCA GGCACTCAGC TCGGAGACCA GCATCGAGGG CCTCCGGGCA
AGGGGCGTGG GGATCCTGTC GGAGATGACC GGTGCCACCG GCGTCCACCT GCTGTTGCGT
AGGCAGGAAC AAGACGGGTG GTTGGTGCCG ACCGGTGTCG GTGGCACCGT CCCGCTCCGG
GAGGCCAGCC GGCGGCGTCT GTTGCCCGCC TCGGTCATCC GATATGCCGA GCGCACCCAC
GAACCTGTGG TCGTCGCCAA CGCCACCCGC GACGACCGGT TCCGCCGCGA CCCCTACCTC
CTCGACTTCG ACCGCTGTTC ACTGCTCGCC ATACCCATCA TGATCCGGGG CCATCTGCAG
GCGATGCTGC TATTAGAGAA CCGACTGATC CGTGGCGCGT TCTCCACCGA ACGCCTCGAA
GGAATCATGC TCATCGCCGG ACAGCTCGCG GTCTCGCTCG ACAACGCCAT GGTCTACGCG
TCGCTGGAAC GCAAGGTTAC CGAACGAACC GGGCAGCTCG CCGCTACCAA CGAACGACTC
GCCGCCGCGA ACCACCAGCT GGAACAACTC TCGGTCACCG ATCCGCTGAC AGGGCTGGCC
AACCGGCGAC GCCTCGAAGA GACCCTGGAC GTCGAATGGC GCCGGGCCCA GGAACACGCG
GCACCCATCG CGCTGGCGAT GGTCGACATC GACCACTTCA AACTCTACAA CGACCACTTC
GGACACACCG CAGGTGACCG ATGCCTCCAG CGGGTCGCCG CATGCCTGGC CGAGAATGTC
GGTGACACCT TTCTGACCGC CCGTTATGGG GGGGAAGAGT TCACCGTCGT GATGCCCGAC
ACCGACTCGG ACACCGCCGC TAGGCTGGCC CGACGCCTCT GCTCCGCCGT CGAGGAACTG
GCCGAGCCAC ACCCTCTGGT GGTAGAGCGC ATCATCACCG TGAGCATCGG CGTAACCGCG
GCCATCCCAA CTCCCGACGA CGACATGGCG GCGTTCGCCG AATTCGCCGA TGTCGCGCTG
TACCGGGCCA AAGACGGCGG CCGCAATCGG GTCCGAATGA TCCCGTTTCC GGTGGACAAC
GGACGTGGGG AAAAACCGCC TGGCCGGTGA
 
Protein sequence
MTEQTRITRL VFPTGCVIRK EPLGPDAQRR LRHEVEILGR LSGVQGVAHL AAGAVECAGS 
ILLVDVDGMA LSEWTTPLDP AELVDLAESL ARAVAGMHHR GVVHRDICPA NIVVSKDLGV
PCLIDFALAT ALSSVRPEFM YHGATVGTVP YLAPEQTGRT GRPVDQRADL YALGATLYEL
ATGAPPFGTD NPVRIIHDHL ARIPLSPVAV NPSVPAGLSN IITHLLEKEP DDRYQSADGL
VHDLTLVRRG AAAVHPGEHD FLTRPLTPSR LSGRDREVDE LDAAFAEAVT GRSHGVLVGG
APGVGKTSLV SELRPIAAGV DGWFVAGKFD QYRRDQEYDG VRQALRALGR LLLAEPDDYL
AEVRDRMLRA LGPNAGLAAA TVPELAVLLK VRPEPGDPMT AQVRASRIAV ETLRAVSSRE
RPVVFFVDDL QWAGRTPLGF VDLVFAGEEQ IAGLLLVGAY RESEIDAAHP LTPMLARWRH
QVVGPHHLRL GSLPPAGQAA LVADLLHLSP PPATELAQMI APSARGNPYD IVELLNALRH
DRVLTLGGGG WRWDWEALRR RLDRVDVTEL LAARVAALPP ATAEVLAVMA CLAGQVELDL
LVAATGLPAD EVERRLAPAF GDGLLVLESD GRQSVRFHHD RTRESVLGGR AAQDQGARRL
RLARRLVERP EFFAVAAEQY LPVADAVHGT EERKRVAGLF RQAAEEAKLL SNYSLVERLL
TAAVEVIDPT DTDQLIAVHT DRHAALYSLG RLEEADDTYQ TIGRLCTHPV QLTAATVVQV
SSLTNRGRGG EAMRLGLDQL RHLGISVPDE NDLDAEIDRG LDALYRWIDT TSESDDLHRP
GITDQSRLST IKLVNRLMPA AFFCDQAMMA WLTIKTLEVW AWHGPDRALL GPAGHIAIVT
IVRRGDYRTG HRILRRILAV GQALGYEPDV WQAQFVYVIT TGHWFAPLED NLSEGRRALE
GLVLSGDLQN ACWAHYALLY DLLDCAPSLD VFAAEVDEAL ASAARTGNGH AETIFRIFRR
LVRVMRGESV ESAADEAAEL SMLAADPFAV ANLHVTRALA AAVLGHPVDL TRHAAAVMPF
RPMVGANYAM AVARVLLALG MAAQIRAAGA DRCDTELAEL DELVEWLAAR AADAPANFLH
LLRMMEAERA WAVGDFREAA YTFDVAQREA SVRARPWHRA LILERAARFY LAHGMDEAGH
TLLAAARRQY LDWGATAKVS QLDWAHPTLR TEPACEPVVH PPAAPSARRS TVATGTVDLL
GVVAASQALS SETSIEGLRA RGVGILSEMT GATGVHLLLR RQEQDGWLVP TGVGGTVPLR
EASRRRLLPA SVIRYAERTH EPVVVANATR DDRFRRDPYL LDFDRCSLLA IPIMIRGHLQ
AMLLLENRLI RGAFSTERLE GIMLIAGQLA VSLDNAMVYA SLERKVTERT GQLAATNERL
AAANHQLEQL SVTDPLTGLA NRRRLEETLD VEWRRAQEHA APIALAMVDI DHFKLYNDHF
GHTAGDRCLQ RVAACLAENV GDTFLTARYG GEEFTVVMPD TDSDTAARLA RRLCSAVEEL
AEPHPLVVER IITVSIGVTA AIPTPDDDMA AFAEFADVAL YRAKDGGRNR VRMIPFPVDN
GRGEKPPGR