Gene Franean1_4550 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4550 
Symbol 
ID5675738 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5425245 
End bp5430194 
Gene Length4950 bp 
Protein Length1649 aa 
Translation table11 
GC content71% 
IMG OID641243413 
Productserine/threonine protein kinase 
Protein accessionYP_001508829 
Protein GI158316321 
COG category[K] Transcription
[L] Replication, recombination and repair
[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG0515] Serine/threonine protein kinase
[COG3706] Response regulator containing a CheY-like receiver domain and a GGDEF domain
[COG3899] Predicted ATPase 
TIGRFAM ID[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.773527 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCCAGAC AGATGACCTC AGGCCGGGAT CCTCCGATGG CAGCAGCGTC ACAAGTGGTG 
CTGCACGCGA CGGATCAGAC CCGGATCAGC CGCCTGGTGC TGCCGAGCGG TTGTGTGATC
CGGAAAGAGC CGCTGGGCCC GGACGCACAG CGCCGGGTGC GCCATGAGGT CGAAATCCTC
GAACGGCTGG CGGGAGTCGA GGGCGTCGTG CACCTGGCCG CCGGGGCGGC GCAGTGCGCG
GGATCGATCC TGCTGGCCGA TGTCGGCGGC ACGGCGCTGT CCCGGCGGGA GACCCCGCTG
GAGCCGGTCG AGCTGATCGA TCTTGCCATG TCGCTGGCGC ACGCGGTCGC GGGGATGCAC
CACCGGGGGG TGGTGCACCG GGACCTCTGC CCAGCGAACA TCGTGGTGAG CGAGGACAGT
GGGGCGCCGT ACCTGATCGA CTTCGCGCTT GCCACGACCT TCTCGGTGGT CCAGCCGGAG
TTCGCCCAGC ACGGGGCCGC GGTCGGGACC GTGCCGTACC TGGCACCGGA GCAGACCGGA
CGGACCGGTC GTCCGGTCGA CCAGCGGGCC GACCTGTACG CGCTGGGAGC CAGCCTGTAC
GAGCTGGCCA CCGGCGCGCC GCCGTTCGGC ACGGAGGACC CGATCAGGAT CATTCATGAT
CATCTAGCTC GGGTCCCGGA CCTGCCAGGG GCGGTCGAGC CGTCCGTGCC CCCCGCGCTG
TCCGAGATCA TCATGCATCT GCTGGAGAAG GAGCCTGATG ATCGTTACCA GACCGCCGAG
GGGCTGGCGC ATGACCTCGC GCTGGTACGC CGCGGCGTGG TGGCGGTGCG CCCCGGTGAG
CACGATCTCC CGACGCGGCC GCTGGCGCCC TCCCGGTTGT GCGGGCGGGA CCGGGACCTC
GACCGGCTCG ACGCGGCGTT CGCCGAGGCC GTCACCGGGA AATCCCGCGG GGTCCTCGTC
GCGGGCGCGC CGGGTGTGGG TAAGACATCC CTGGTCAACG CGCTGCGGCC GATCGCTGCT
GGCGTCGACG GCTGGTTCGT CGCGGGCAAG TTCGACCAGT ACCGCCGCGA CCAGGAGTAT
GACGGAGTCC GCCAGGGGCT GCGGGCGTTG GGCCGGCTGT TGCTGGCCGA GCCCGAGGAC
TACCTGGCCG AGGTACGGGA GCGGATGCTG CGGGGCCTGG GCCCCAACGC CGGCCTGCTG
ACGGCCACCG TGCCGGAGCT GGCGACACTG CTGAAGGTTC CGCCGGAGCC GGGCGATCCG
ATGACCGCCC AGGCGCGGCT GTCACGCACA CCGGTCGAGG TGCTGCGCGC CGTGGTCTCC
CCGGCGCGGC CAGTGGTGTT CTTCGTCGAC GATCTGCAGT GGGCTGGGCG CACCCCCCTG
GGCTTCGTCG ACCTGGTCTT CACCGGCGAG GAGCAGATCG CCGGACTGCT GCTGGTCGCC
GCCTACCGAG AGAGTGACGT CGCCGCCGCG CATCCGCTGG CCCCGATGCT GGCCCGCTGG
TCCGACCGGA CGGCCGGCCC GCACCAGCTG CGCCTGGGCA GCCTGCCGGC AGTGGGACAG
GCGGCCATGG TCGCCGACAT GCTGCACCTG GCTCCGGAGC CCGCCGCCCG GCTGGCGCGG
GCGATCGCGC CGTCCACCGG CGGCAATCCG TACGACGTCG TGGAACTGCT CAACTCGCTG
CGGCGCGAGG GCGTCCTGAC ACCAGGCGCG GGAGGCTGGC GATGGGACGG GAAGGTGCTG
CGCCGCCGGC TCGACCGGGT GGAGGTGACC GCGCTCCTGG CCGCCCGGAT GGCCGCGCTG
CCGTCCGAGA GCGCCGAGCT GCTGGCGGTG ATGGCGAGCC TGGCGGGCCA GGTAGATCTT
GATCTTCTCG CGGTCGCGAC AGGCCTGCCG CCGGGCGAGC TCGAGCGGCG GCTCGCGCCC
GCCTTCGGCG ACGGCCTGCT GATGCTGCAG TGCGACGGGC GACCCGGTGT CCGGTTCAAC
CACGACCGGA CGCGCGAGAC CGTGTTGGGT GGCAGCCCGG CGGAGGAGCA GCGCGCACGG
CGGCTGCGGC TGGCGCGGCG GCTGGCCGAC CGGCCGGAGT TCTTCGCCGT GGCTGCCGAG
CAGTACCTGC ACGTCGTGGA CATGGTGCGC AGCACCGGCG AGCGGCGCAG CGTGGTCGCG
CTCTTTCGTC GCGCCGCCGA AGAAGCGAGA GTACTGAGCA ACTATCCGCT GGTGGAGAGG
TTGCTGAGCG CGGCGGTCGG GCTTGTCGAC CCGACCGACA CCGACATGTT GATCGCCGTC
CACACCGACC GGCATGCCGC TCTGTACTGC CTCGGCCGGC TGGACGACGC GGACGACACC
TACCGGACCA TCGACCGGTT GTGCACCCGG CCGACCCAGC TCACGTCCGC CACGATGGTT
CAGGTCATCA GTCTCACCAA CCGGGGTCGC GCCGGCGAGG CGATGCGGCT CGGCCTCGGC
CAGCTGCGGC GTCTCGGCAT GTCGGTTCCG GAGGAGAACG CGCTCGACGA GGAGATCGAC
CGCGGACTCG ACGAACTTCG TCGGTGGATC GACACCACCA GCGAGTCGGA TGATCTGCGC
AAGCCGGAGA TCACCGACCA GTCCCGCCTC AGCACCATCA GGCTCGTCAA CCGGGTCATG
GTGGCGGCGT TCTTCCACGA CCACACGGTG ATGGCCTGGC TGGCCATCAG GACTCTGCAG
ATGTGGGAAC GCTACGGCCC GGATCGTGCC CTGGTGGGCC CGGCCAGCCG CATCCCGATC
GTGACCATCG TGCGCAGGGG TGACTACCAC ACCGGGCCAC GCATCCTGCG ACGGATCCTC
GCGGTCGCGC GGGCTCGCGA CTACGAGCCC GACGTGTGGC AGGCACTCTT CACCTACGTG
GCCACCTGCG GCCACTGGTT CGCGCCTCTG GAGGACAACC TCCCCGACGG CCATCGCGCG
CTGGAGGGCC TTCTGCTGGG CGGCGACCTG CAGAACGCGT GCTGGACCCA CTACGCACTG
GTGTGCGACC TGTTGGACTG CGCTCCGTCG CTCGACGTCT TCGCCACGGC GGTCGAGGAG
GCGCTCGCCT TCGCCCTGCG CACCGGAAAC GGGCACGCCG AAGGAACGTT CCGGGCCTAC
CTCCGGATGG TGCGGGTGCT GCGGGAACCC GTCGAGCCGG CCGGTGACGA CGCGGCCGAG
CTGGGCCTGC TGGCGGCCGA CCCGTTCTCT CTCGCCAACC GGCACGTGGC ACGGGCTCTC
GCCGCGGCGG TTCTCGACCA CCCGGCCGAC GCGGCGCGGC ATGCCGCGGC CGTGATGCCC
TGTGGGGCCG TGATCGGGAC GAACTATCCC ATGGCGGTGG CGCGGATGCT GCTCGCGATG
ACGCTGGCGG CACGGGCGCG GGCCGCGGGA GCGGATCAGC GCGACGCCGT GCTGGACGAG
CTCGACGAAC TGATCGAATG GCTGGCCGCG CGGGCGGCGG ACGCACCGAG CAACTTCCTG
CATCTGCTGC GCCTGGTGGA AGCGGAACGG GCGTGGGCCG TCGGGGACTT CCACGAGGCG
GCATACACGT TCGACGTGGC GCTGCGCGAG GCGTCAGCCC GGATGCGCCC ATGGCATCGG
GCGTTCACAC TGGAACGCGC CGCACGCTTC TACCTGTCAC ACGGCATGGA GGCGGCCGGC
CTCGCGTTCC TGCGGGCCGC CCGGGGCCAG TACCTGGAAT GGGGCGCGAC GGCGAAGGTC
AGCCAACTCG ACTGGGCCTA CCCGGCGCTG CGGACCGCCG CGATGGCCGC GCCCGCGGGG
CCGATCGTCC TCCCGCAGGC GGAGCAGTCC GGGCGCCGCT CGACCGTGGC GACCGGCACT
GTCGACCTGC TCGCCGTGGT CGCCGCATCC CAGGCGCTCA GCTCCGAGAC CAGCGTGGAC
GGCCTGCAGG CGAAAGTGGT GGGAATACTC TCCGAGACGA CCGGCGCCAC CAGCGTCCAC
CTCCTGCTGC GAGAAAAGGA GCAGGACCGA TGGCTGGTAC TCACCGGAGC CGGCGGCACC
GTCTCGCTCC AGGAAGCCGG CCGGCGGCGC CTGCTGCCGG TCTCGGTCAT CCGCTACGCC
ACCCGCACCC ACGAACCCGT CGTCGTCGCC AACGCCACCC GGGACGACCG CTTTCACCGC
GACCCCTACT TCGCCGGACT CAACCGCTGC TCGCTGCTCG CCGTCCCCGT CATGACCCGG
GGCAGCCTGC GGGCGATGCT CCTGCTGGAG AACCGGATGA TCCGCGGCGC GTTCTCCACC
GAACGCCTCG AAGGCATGAT GATCATCACG GGGCAGCTCG CGGTCTCGCT CGACAACGCC
CTGATCTACG CGTCGCTGGA ACGCAAGGTC GCCGAGCGAA CCAGGCAGCT GGCCGCCACG
AACAGGCAGC TTGCCGCGGC GAACCACCAG CTCGAACAGC TTTCGATCAC CGACCCGCTC
ACCGGGCTGG CCAACCGGCG GCGTCTTGGG GAGGTGCTGG CCGTCGAGTG GGAACGGGCC
CGGCAGCACG CCGGGCCGAT TGCGCTCGCG ATGGTCGACA TCGACCACTT CAAGCTCTAC
AACGACCATT TCGGGCACGC CGCCGGTGAC CGGTGCCTGC AGCGGGTCGC CGCATGCCTC
GCCGGGAACA TCGGCGACAC CCTGCTCGCC GCCCGCTACG GGGGAGAGGA GTTCACCGTC
GTGATGCCCG ACACCGACCT GGGCGCCGGT GCCCAGGTCG CCCGGCATCT GTGCCGTGCC
GTCGAGGACC TGGCCGAACC ACACCCGCTG ACGGCCGAGC GCATCATCAC CGTCAGCATC
GGCGTCACCG CGACAGTCCC CGTCGCCGCC GCTGCGGCCG CCGACGCGGC AACGGGCGAC
GGCATGGTCG CCTTTGTCAA GGCCGCGGAC ACCGCGTTGT ACCGGGCGAA ACACAGTGGA
CGTAACCAGG TGGAGACCGC AGTGCGTTGA
 
Protein sequence
MARQMTSGRD PPMAAASQVV LHATDQTRIS RLVLPSGCVI RKEPLGPDAQ RRVRHEVEIL 
ERLAGVEGVV HLAAGAAQCA GSILLADVGG TALSRRETPL EPVELIDLAM SLAHAVAGMH
HRGVVHRDLC PANIVVSEDS GAPYLIDFAL ATTFSVVQPE FAQHGAAVGT VPYLAPEQTG
RTGRPVDQRA DLYALGASLY ELATGAPPFG TEDPIRIIHD HLARVPDLPG AVEPSVPPAL
SEIIMHLLEK EPDDRYQTAE GLAHDLALVR RGVVAVRPGE HDLPTRPLAP SRLCGRDRDL
DRLDAAFAEA VTGKSRGVLV AGAPGVGKTS LVNALRPIAA GVDGWFVAGK FDQYRRDQEY
DGVRQGLRAL GRLLLAEPED YLAEVRERML RGLGPNAGLL TATVPELATL LKVPPEPGDP
MTAQARLSRT PVEVLRAVVS PARPVVFFVD DLQWAGRTPL GFVDLVFTGE EQIAGLLLVA
AYRESDVAAA HPLAPMLARW SDRTAGPHQL RLGSLPAVGQ AAMVADMLHL APEPAARLAR
AIAPSTGGNP YDVVELLNSL RREGVLTPGA GGWRWDGKVL RRRLDRVEVT ALLAARMAAL
PSESAELLAV MASLAGQVDL DLLAVATGLP PGELERRLAP AFGDGLLMLQ CDGRPGVRFN
HDRTRETVLG GSPAEEQRAR RLRLARRLAD RPEFFAVAAE QYLHVVDMVR STGERRSVVA
LFRRAAEEAR VLSNYPLVER LLSAAVGLVD PTDTDMLIAV HTDRHAALYC LGRLDDADDT
YRTIDRLCTR PTQLTSATMV QVISLTNRGR AGEAMRLGLG QLRRLGMSVP EENALDEEID
RGLDELRRWI DTTSESDDLR KPEITDQSRL STIRLVNRVM VAAFFHDHTV MAWLAIRTLQ
MWERYGPDRA LVGPASRIPI VTIVRRGDYH TGPRILRRIL AVARARDYEP DVWQALFTYV
ATCGHWFAPL EDNLPDGHRA LEGLLLGGDL QNACWTHYAL VCDLLDCAPS LDVFATAVEE
ALAFALRTGN GHAEGTFRAY LRMVRVLREP VEPAGDDAAE LGLLAADPFS LANRHVARAL
AAAVLDHPAD AARHAAAVMP CGAVIGTNYP MAVARMLLAM TLAARARAAG ADQRDAVLDE
LDELIEWLAA RAADAPSNFL HLLRLVEAER AWAVGDFHEA AYTFDVALRE ASARMRPWHR
AFTLERAARF YLSHGMEAAG LAFLRAARGQ YLEWGATAKV SQLDWAYPAL RTAAMAAPAG
PIVLPQAEQS GRRSTVATGT VDLLAVVAAS QALSSETSVD GLQAKVVGIL SETTGATSVH
LLLREKEQDR WLVLTGAGGT VSLQEAGRRR LLPVSVIRYA TRTHEPVVVA NATRDDRFHR
DPYFAGLNRC SLLAVPVMTR GSLRAMLLLE NRMIRGAFST ERLEGMMIIT GQLAVSLDNA
LIYASLERKV AERTRQLAAT NRQLAAANHQ LEQLSITDPL TGLANRRRLG EVLAVEWERA
RQHAGPIALA MVDIDHFKLY NDHFGHAAGD RCLQRVAACL AGNIGDTLLA ARYGGEEFTV
VMPDTDLGAG AQVARHLCRA VEDLAEPHPL TAERIITVSI GVTATVPVAA AAAADAATGD
GMVAFVKAAD TALYRAKHSG RNQVETAVR