Gene Franean1_7079 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_7079 
Symbol 
ID5675389 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp8638105 
End bp8643036 
Gene Length4932 bp 
Protein Length1643 aa 
Translation table11 
GC content70% 
IMG OID641245924 
Productserine/threonine protein kinase 
Protein accessionYP_001511315 
Protein GI158318807 
COG category[K] Transcription
[L] Replication, recombination and repair
[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG0515] Serine/threonine protein kinase
[COG3706] Response regulator containing a CheY-like receiver domain and a GGDEF domain
[COG3899] Predicted ATPase 
TIGRFAM ID[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGATTCC AGCGATCGGC ACCGGGCCGG GATCTGCCGC TGGCGGCGCG GGTGGAGGTG 
CTCCATGAGT CGGAGCGGAC CCGGGTCGCT CGGCTGGTGT TCGCGTCCGG GAGCGTGATC
CGGAAGGAAC CGCTAGGGCC TGACCCACAG CGTCGGCTGC GGCATGAGGT CGAGACGCTC
GGGCGGCTGT CCGGGGTCGA GGGGATCGCG CAGCTGGCGG CCGACGCGGC GCCAGACCCG
GGGGCGATCC TGCTGGCCGA TGTCGGTGGC ACGGTTCTGT CGGAGCGGGT CATGCCTCTG
GAGCCGGCCG AGCTGGTCGA TTTGGCCGGA TCGCTTGCGC GCGCGGTGGC GGGGATGCAC
CGCCGGGGTG TGGTGCATCG GGACGTCAAT CCCGCGAACA TCGTGGTGGG TGGCGACCGG
GGCGCGCCCG AGCTGTACCT GATCGACTTC GCGCTGGCCA CGACCTTGGC GGCGGTTCAG
CCGGAGTTCG TGCATCACAG CGGGATCGTC GGGACGTTGC CGTACCTGGC GCCGGAGCAG
ACCGGCCGGA CCGGTCGCCA GGTCGACCAG CGGGCCGACC TGTACGCGGT CGGGGCGACC
CTGTACGAGC TGGCGACCAG TGCGCCGCCG TTCGGCGTCG GCGACCCGCT TCGGATCATT
CATGACCATC TCGCGCGGGT GCCGGCACCG CCGTCAGCCG TGAACCCAGC GGTGCCGGCC
GACCTGTCGG CGATCATCAT GCATCTGCTG GAGAAGGAGC CGGACGACCG CTACCAGAGC
GCCGACGGGC TGGTGCATGA TCTCGCGCTG CTGGGCCGCG GTGCGGCGGT ACATCCCGGC
GAGCATGATT TCCCGACGTG GCCGCTGACG CCGTCCCGGC TGGCCGGGCG GGGGGAGGAG
ATCGGCGAGC TGGGCGTGGC CTTCGTCGAG GCGATGGCGG GCCGCTGCCG TGGGCTCCTG
GTCGGTGGCG CACCGGGGGT GGGTAAGACG TCCCTGGTCA ACGAGCTGCG GCCGATCGTG
GCCGGTAGCG ACGGCTGGTT TGTGGCGGGC AAGTTCGACC AGTACCGCCG CGACCAGGAG
TACGACGCGG TTCGGCAGGC GTTCCGAGCG CTGGGCCGGC TGTTGCTGGC CGAGCCGGAC
GACGACCTGG CCGAGGTCCG GGACCGGATG CTGCGGGCGT TGGGGCCCAA CGCCGGGCTG
ATCAGCGCCA TCGTGCCGGA GCTGGCGGCG CTGCTGAAGG TGCCACCGGA TCCGGGAGAT
CCGATGACCG CCCAGGTGCG AGCCTCACGC GCAGCGGTGG GGATGCTGCG GGCCGTTGCT
TCCTGGAGGC GGCCGGTGGT GTTGTTCGTC GACGATCTGC AGTGGGCTGG GCGGACACCG
CTGGGTTTCG TCGACCTGAT CTTCGGTGGT GAGGAGGTGG TCGAGGGGCT GCTGTTAGTC
GGCGCGTACC GAGAGAGCGA GGTGGACGCC GCTCATCCGC TGGCGCCGAT GCTGGCTCGT
TGGCGGCGGC AGCCGGTCGG TCCGCACCAC CTGCGGTTGG GCAACCTGAC GCCGGACGGT
CAGACGGGGA TGGTGGCCGA CCTGTTGCGC CTGGCCCCGC GGCCCGCGGC CGAGCTGGCG
CGGCTGATCG CGCCCGCTAC CGGAGGCAAT CCGTACGACA CGGTGGAGCT GCTCAACGCG
CTGCGCCACG AGGGCGTACT GGTCTGCAGT GATCGTGCTT GGCGGTGGGA CCAGTCCACG
CTGCGCGGCC GGCTGGAGCA CGTGGACGTG ACCGAAATGC TGGCCGCGCA CGTGGCCGCG
TTGCCGCCGG ACACCAGGGA TGTGCTGGCG GCGATGGCCT GCCTCGCCGG GCGGGTGGAC
CTGGACCTCC TTCAGACCGC GACCGGTCTG CTGGCGGACG AGGTCGAGCG GCGGCTCGGC
CCCGCGTTCG CCGACGGCCT GCTGGTGCTG GAGTCCGACA GCCGGCCGGG GGTGCGGTTT
CACCACGACC GGACGCGGGA GTCCGTCCTG GGAGTCCTCG GCGCGCAGGC GCTACGCGCC
GAGCGGTTGC GGCTGGCCCG CCGCCTGGCC GGCCGAGGTG AGTTCTTCGC TGTCGCGGCG
GAGCAGTACC TGCCCGTGGC CGACTCCGTG CACGCACCCG CGGAGCGGCA GCTGATGGTC
GGGCTTTTCC GGCGCGCCGC CGACGAGGCG AAGATGCTGA GCAACTACCC GCTGGTGGAG
AGGTTACTGA CCGCCGCGGT GACGCTTATC GAGCCGGTCG ACACCGACCA GTTGATCGCG
GTCCACACCG GCCGTCAGGC CGCCCTGTGT GGCCTCGGCC GGCTGGACGA GGCTGACGAG
GTCTATCAGA CCATCTGCCG GCTTTGTACC CACCCAGCCG GACGCACGGA TGCGACAGTG
ATGCAGGTCA GGAGCATCCA TGACCGTGGC CGTTCTAGCG AGGCGATGCG GCTCGGCCTC
GACCAGCTGC GCCAACTCGG CCTCCCCGTC CCAGACCGGA ACCACCTCGA CCGGGAGATC
GACCGGGGAT TGGATGCGGT TTATCGGTGG ATCGACCAGA CCAGCGAAAG CGAAGCCGAC
GATCTGCGCC GGCCGAGAAT CACCGACCGC TCGAAACTCG ACGCTTTCGG GGTCGTCGAC
TCTCTCATGC CCGCGGCGTA CGTCTGCGAT CAGGCGATGC TGGCGTGGCT GACCGTGCAG
GCGCTGGCGA TGTGGGCGAG GCATGGCCCG GATCCCACCC TGCTCTGCCC CGCGAGTCAC
GCCACGTTCG TGACCAGCGC CCGCAGGCAG GACTACCGCA CCGGGTACCG CATCATGCGG
CGGATCCTCG CGGTCGGTCG GGAACGCGGC TTCGAGCCGG AAATATGGCG GGCGCGGTTC
CTGTACGCGT TCAGTACCGG TCACTGGTTC GACCCGCTCG AGGACGCCGT GTCCGAGGCG
CGTCGCGCGC TGGAGGGTCT TGTCCAGAGC GGCGATCTGC ACAGCGCGTG CTGGACCCAT
ACCGTGCTGG TGATCAACCA GCTGGACTGT GCGCCGTCGC TGGACGCGAT TGTCGCTCAG
GTTGACGCCG CGCTGGCGTT CGCCACCCGC GCTGGCAACG GCCACGTCGA GGAGGCGTTG
CGGTCATGGT GCCAGCTGGC CCGGGTGCTG CGCGGCGAGT CCATCGACTC GGCGGCCGAC
GAGGCGGCCC AGCTGAGCGT GCTGGCCGCC AACCCGATGG CCGTCGTCTA CTTGCATCTC
AACCGGGCGC TCGCCGCGGC CATCCTCGAC CGCCCGGCCG AGCTTGCCCA GTACACGGAA
GCCGTGATGC CGTTGCTGCC GTCCGTTGAA GCGAGCTATG TGGTGGCGAG GGCGCGTCTG
CTGCGGGCCC TGGCATTGGC CTCACAGGCC CGCACCGCCA CAGAGACAGG CCGGCGCGAG
GCGATATTGG CCGAGCTGGA CGAGTCGGTC GAGTGGCTGG CCGCGCGGGC GGCCGACGCG
CCGGTCAACT TCCTGCACCT GCTGCGCCTG GTGGAGGCGG AGCGAGCCTG GGCCGCCGGC
GACTTCCGCG AGGCGGCGTA CGCGTTCGAC GTGGCACAGT GCGAGGCCTC GGTACGGGCG
CGCCCGTGGC ACCGGGCGCT GATCCTGGAA CGCGCCGCCC GGTTCCACCT CGCCCATGGC
ATGACGGCAG GTGGGAGTGC CCTGCTCGAC GCCGCCCGTC GCCAGTACCT GGACTGGGGC
GCGACCGCGA AGGTTCGCCA GCTCGACTGG GCCCACCCGA CGCTACGCAC CGAACCCGCC
GGCGCCCAGG CGGCCGCGCC CCCACCGACG CAGCCCGCCA CCCGCCGCTC CACCATCACC
ATCACCACCG GCACCATCGA CCTGCTCGGC GTCGTCGCCG CCTCTCAGGC ACTCAGCTCC
GAGACCAGCA TCGAAGGCCT ACGGACCAGG GTCGAGGGAA TCCTGTCCGC GTTGACGGGT
GCCACCGGCG TCCACCTGCT ACTGCGCGAC CAGGAGCAGC GGGACTGGCC GGTGCCAGCC
GGCGACGCCG GCACCGTCCC GCTCGCCGAG GCCGGCCGAC GACGGCTGCT GCCACCCTCG
GTCATTCGAT ACGCCGAACG CACCCAGGAA CCCCTCGTCA TCGCCGACGC CACCCGCGAC
GACCGCTTCC ACCGCGACCC CTACCTCACC GGCCTCGACC GCTGCTCCCT GCTCGCCATC
CCCATCACGA TCCGCGGCGA GCTACGCGCG ATGCTGCTGC TGGAAAACAG GATGATCCGC
GGCGCGTTCA CCACCGAACG CCTCGAAGGA ATCATGCTCA TCGCCGGACA ACTCGCCGTC
TCCCTCGACA ACGCCCAGGT CTACACCTCC CTGGAACGCA AGGTCGCCGA ACGCACTTGG
CAACTGGCCG CCGCCAACCA GCGACTCGAA CAGCTTTCGG TCACCGACCA GCTGACCGGA
CTGGCCAACC GCCGACGCCT CGACGAGGTC CTCGACGCCG AATGGCACCG GGCAACCCGC
CAGAAGACAC CCATCGCGTT CGCCATGATC GACATCGACC AATTCAAGCT CTACAACGAC
CACTACGGCC ACACCGCCGG CGACCGTTGC ATACACCAGG TCGCCACCTG CCTGGCCGTC
AACATCCGCG GCACCGACCT CGCCGCCCGC TACGGCGGCG AGGAGTTCGC GATCGTAATG
GCGGGCACCG ATCTGCCCGT CGCCGCCCGA ATCGCCCATC GACTCCGCTC CGCCGTCGCG
GAGCTCACCG AGCCACACCC ACTGAGCACC GAACAGATCG TCACCGTCAG CATCGGCGTC
ACCGCGATTG TCCCCACCCC CGGCGACAAC CCGACAACCT TTGTCGAACT CGCCGACACG
GCGCTATACC AAGCCAAGAA CGGCGGACGC AACCGGGTGG AAACGGCACT TCCGCGCCCC
GCCTCGGGAT AG
 
Protein sequence
MGFQRSAPGR DLPLAARVEV LHESERTRVA RLVFASGSVI RKEPLGPDPQ RRLRHEVETL 
GRLSGVEGIA QLAADAAPDP GAILLADVGG TVLSERVMPL EPAELVDLAG SLARAVAGMH
RRGVVHRDVN PANIVVGGDR GAPELYLIDF ALATTLAAVQ PEFVHHSGIV GTLPYLAPEQ
TGRTGRQVDQ RADLYAVGAT LYELATSAPP FGVGDPLRII HDHLARVPAP PSAVNPAVPA
DLSAIIMHLL EKEPDDRYQS ADGLVHDLAL LGRGAAVHPG EHDFPTWPLT PSRLAGRGEE
IGELGVAFVE AMAGRCRGLL VGGAPGVGKT SLVNELRPIV AGSDGWFVAG KFDQYRRDQE
YDAVRQAFRA LGRLLLAEPD DDLAEVRDRM LRALGPNAGL ISAIVPELAA LLKVPPDPGD
PMTAQVRASR AAVGMLRAVA SWRRPVVLFV DDLQWAGRTP LGFVDLIFGG EEVVEGLLLV
GAYRESEVDA AHPLAPMLAR WRRQPVGPHH LRLGNLTPDG QTGMVADLLR LAPRPAAELA
RLIAPATGGN PYDTVELLNA LRHEGVLVCS DRAWRWDQST LRGRLEHVDV TEMLAAHVAA
LPPDTRDVLA AMACLAGRVD LDLLQTATGL LADEVERRLG PAFADGLLVL ESDSRPGVRF
HHDRTRESVL GVLGAQALRA ERLRLARRLA GRGEFFAVAA EQYLPVADSV HAPAERQLMV
GLFRRAADEA KMLSNYPLVE RLLTAAVTLI EPVDTDQLIA VHTGRQAALC GLGRLDEADE
VYQTICRLCT HPAGRTDATV MQVRSIHDRG RSSEAMRLGL DQLRQLGLPV PDRNHLDREI
DRGLDAVYRW IDQTSESEAD DLRRPRITDR SKLDAFGVVD SLMPAAYVCD QAMLAWLTVQ
ALAMWARHGP DPTLLCPASH ATFVTSARRQ DYRTGYRIMR RILAVGRERG FEPEIWRARF
LYAFSTGHWF DPLEDAVSEA RRALEGLVQS GDLHSACWTH TVLVINQLDC APSLDAIVAQ
VDAALAFATR AGNGHVEEAL RSWCQLARVL RGESIDSAAD EAAQLSVLAA NPMAVVYLHL
NRALAAAILD RPAELAQYTE AVMPLLPSVE ASYVVARARL LRALALASQA RTATETGRRE
AILAELDESV EWLAARAADA PVNFLHLLRL VEAERAWAAG DFREAAYAFD VAQCEASVRA
RPWHRALILE RAARFHLAHG MTAGGSALLD AARRQYLDWG ATAKVRQLDW AHPTLRTEPA
GAQAAAPPPT QPATRRSTIT ITTGTIDLLG VVAASQALSS ETSIEGLRTR VEGILSALTG
ATGVHLLLRD QEQRDWPVPA GDAGTVPLAE AGRRRLLPPS VIRYAERTQE PLVIADATRD
DRFHRDPYLT GLDRCSLLAI PITIRGELRA MLLLENRMIR GAFTTERLEG IMLIAGQLAV
SLDNAQVYTS LERKVAERTW QLAAANQRLE QLSVTDQLTG LANRRRLDEV LDAEWHRATR
QKTPIAFAMI DIDQFKLYND HYGHTAGDRC IHQVATCLAV NIRGTDLAAR YGGEEFAIVM
AGTDLPVAAR IAHRLRSAVA ELTEPHPLST EQIVTVSIGV TAIVPTPGDN PTTFVELADT
ALYQAKNGGR NRVETALPRP ASG