Gene Franean1_3855 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3855 
Symbol 
ID5672218 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4578250 
End bp4583430 
Gene Length5181 bp 
Protein Length1726 aa 
Translation table11 
GC content76% 
IMG OID641242733 
Productserine/threonine protein kinase 
Protein accessionYP_001508153 
Protein GI158315645 
COG category[K] Transcription
[L] Replication, recombination and repair
[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG0515] Serine/threonine protein kinase
[COG3706] Response regulator containing a CheY-like receiver domain and a GGDEF domain
[COG3899] Predicted ATPase 
TIGRFAM ID[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.993945 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGATCCC AGCAGACGCC GCCGAGCCGG GATCCGCCGG CGGTGGTGCG GACGGAGCTG 
TTGCACGAGA CGGCCCGGAC CAGGGTGACG AGGCTGCTGC TCCCGGCCGG GAGCGTGATC
CGCAAGGAAC CGCTGGGTCC GGGCGCCCGG TGGCGGCTGC GCCACGAGGT CGACATCCTC
GAGCGGCTGG CCGGGGTCGA GGGGGTCGCA CACCTGGCGG CCGCCGGGCC CGACGGCCCG
GCCCCCGCCC CGCCGGCCGC CGAGATCGAC GGCGCGGGCT CCGCCCTGCC GGCCGACAGG
GGCGGCGTCC CGCTGGCCGG CGTCGGCGGC CTCCTGCTGG CCGACGTCGG CGGCGCGGAC
CTGTCCCGGC GGGCCACGCC CATGGACCCG GCCGAGCTGG TCGACCTGGC CGGGGCGCTC
GCCCGGGCGG TGGCGGGGAT GCACGGGCGC GGTGTGCTGC ACCGGGACAT CAGCCCGGCG
AACATCGTGG TGAACCGGGC CGGGAACCGG CCGTACCTGA TCGACTTCGC GCTCGCGACG
ACCGCCACCG AGATCCAGCC CGGGTTCGCC CATCACAACG AGATCGTCGG GACGGTGCCC
TACCTCGCGC CGGAGCAGAC CGGCTGGACG GGGAGCCCCG TCGACCAGCG GGCCGACCTG
TACGCCGTCG GCGCCACCCT GTACGAGCTG GCGACCGGCG CGCCGCCGTT CGGCTCGGGT
GACCCGCTGC GGATCATCCA CGACCATCTC ACCCGGGTGC CCGTGGCGCC GTCGGTGGTG
AACCCGGCAC TGCCGGCCGG GCTGTCGGAC GTCGTCATGC ACCTGCTGGA GAAGGAGCCG
GACGACCGCT ACCAGAGTGC CGAGGGCCTG GCCCTCGACC TCGTCCTGGT GCACCGCGGC
GCGTGCGTGC GGCCCGGCGA GCACGATCTT CCGGCGCGGC CGCTGACGCC GTCCCGGCTG
GCCGGCCGGG AGCGGGAGAC CGGCGAGCTC CGCGCGGCGT TCACCGACGC GGTGGCGGGC
CGCTGCCGCG GGATTCTCGT CGCCGGGGCG CCCGGGGTAG GGAAGACGTC CCTGGTGAAC
GAGCTGCGGT CGATCGTGGC CGGTGCCGGC GGCTGGTTCG TGACGGGCAA GTTCGACCAG
TACCGGCGGG ACCAGGAGTA CGACGGGGTC CGCCAGGCGC TGCGGGCGCT GGGCCGGCTG
CTGCTGGCCG AGCCGGAGGG CTCCCTGGAC GAGGTCCGGG AGCGGCTCCT GGGCGGGCTG
GGCCCCGGAG CCGGCATGGC CGCCGCTGTG GTGCCCGAGC TGGCCGCGCT GCTGCGGATC
CCGCCGAAGG CGGGCGACCC GATGACCGCG CAGGTACGTG CGCAGCGCGG CGCCGCCGAG
ACGCTGCGCG CCGTCGCGTC CCGGGAACGT CCGCTGGTGA TCTTCGTCGA CGACCTGCAG
TGGGCCGGTC GGACCCCGCT CGGCGTGCTC GACCTGATCC TCGGCGGGGA GAAGGACCAC
GAGGGGCTGC TGGTCGTCGG CGCCTACCGG GACGACGAGG TCGACGCCAC GCACCCGATG
GCGCCGATGC TCGCGCGCTG GCGGCGCCAG CCGGCCGGGC CGCGCCACCT GCGGCTGCGG
AACCTGCCCC CGCCCGACCA GGCCGCCATG GTGACCGACC TGCTGCGGCT GCCTCCGGAG
CGCGGCCGGC AGCTCGCGCG GCTGATCGCG CCGACCACCG GGGGCAATCC GTACGACACC
GTCGAGCTGG TCAACTCGCT GCGCCACGAC GGCCTGCTGA CCCGGGTGGC CGGCCGCTGG
CGGTGGGCGC CGGAGACGCT GCGCGCCCGG CTCGCCCGGG CGGACGTGAT CGAGCTGCTG
CGCGCCCGGG TGGCGGCGCT GGCCCCGCCC GCCCGGGAGG CACTGGCGGC GGCCGCGTGC
CTGGCCGGCC AGGTGGACTT CGAGCTCCTG GCAGCCGCGA CCGGGCAGGC GGCGCGGGAG
CTCGAGCGGC GGCTCGCCCC GGCGTTCGCG CGCGGGCTGC TGGTGCTGGA GACCGACGGG
CGACGCGGCG TGCGCTTCCG CCACGACCGG GCGCAGGAGG CCGTCCTGGC CGGCCTCACC
GGGCCGGCGG AACGGGCTCT GCGGCTGGCC CTGGCCCGGC GGCTCGCCGG CCGGCCCGAG
TATCTCGCCG TCGCGGCCGC GCAGTACCTG CCGGTGCTCG ACGAGGTGCG TCTGCCCGCG
GAGCGGCGGC TGACCGCCGA CCTGCTGCGG CGTGCCGCCG ACGAGGCCAC GCTGCTGAGC
AACTACCCGC TGGTGGAGCG GCTGCTGGCC GCCGCGGTGG CGTTCACCGA CCCGGCGGAC
ACCGATCAGC TGATCGAGGT GCACACCGGC CGGCACGCCG CCCTGTACAT GCTCGGCCGG
CTGGAGGACG CCGACGAGGA GTACGAGACC CTCGGCCGGC TGTGCGCCGA CCCGGCGCGG
CGCACGGCCG CGACCCTGGT CCAGGTCAGC AGCCTCACCA ACCGGGGCCG GGCCGGCGCG
GCGCTGAGCC TGGGCTACCA GCAGCTGCGG CGGCTCGGCC TCGCCATCCC GGAGCGGGAC
GACCTCGACG GGGAGATCGA CCGCGGCCTG GACACGCTCA ATCAGTGGCT CGAGCGGACC
ACGACGGCCG ACGACCTGCG GCGGGCCCGA GGCGCGGACC GGGCGGGTCC CGGCGCCGAC
CTGGCGGGGT CCGGCGCCGA CCTGGCGGGG TCCGGCGCCG ACCTGGCGGG GTCCGGCGCC
GTCGAGCTCA TCAACCGGCT CATGCCCGCC GCGTTCTTCG AGGACCAGCA GATGCTGGCC
TGGCTGACCA TACAGACACT GCGGATGTGG ATCCGGGACG GTCCCGGCCC GGCCCTGGCC
GGCCCCGCCG GCCACGCCGG GTTCGTCACC GTCGCCTGGC GCGACGACTA TCGCACCGGG
TGGCGGGCGC TGCGGCGGAT CATCGAGGTC TGCCGGGCCC GCGGCTACGA ACCCGCGCTG
TGGCAGGTGC GTTTCCTGTA TGTGCTCGGC ACCGGCCACT GGTTCGACCC GCTCGAGGAG
AACGTGGACG CGGCGCGCCG CGCCCTGGAG GGCCTGATCC GCGGCGGCGA CCTGCAGAAC
GCGTGCTGGA CCCACTACGC GCTGGTGTAC GGCCTGTTCG ACTGCGCGCC GTCCCTCGAC
GTCGTCGCCG CCGAGGTCGA CGAGGCGCTG GCGTTCGCCG CGTGCACCGG CAACGGCCAC
GCCGAGGAGA CGTTCCGTAT GTACCGCCAC CTGACGGGCA TCCTCCGCGG CGCGACCGGC
ACCCCGGCGG CCGACGGGAC GTCCGATCTG GGCCGGCTGG GCACGGAGCT GCAGGCCGCC
GACCCGCTGG CCGCCGCCCA GCTGCACATC ACCCGGGCGA TCGCCGCGGC CATCCTCGAC
CACCCGGTGG AGCTGGCCCG GCACACCGCG GGGGCCATGG CGGTGCTGCC GGCCGTCCAG
GCGCACTACT CGACGGCGCT GGCGCGGCTG CTGCGCTCGC TGGCGCTGGC CGGCGAGGCC
CGGGCCGCGG TGCCGTCCCG GCGTGCCGCC GTGCTGGCCG AGCTGGACGA GTCGATCGAC
TGGCTGGCGG CGCGCGCGGC CGACGCGCCG TCCAACTACC TGCACCTGCT GCGCCTGGTG
GAGGCGGAGC GGGCCTGGGC GGCCGGGGAG TTCCGAGAGG CGGCGTACGC GTACGACGTG
GCGCAGCGGG AGGCCGCCAC CCGGGCCCGG CCGTGGCACC GGGCGCTGAT CCTGGAGCGC
GCGGCGCGGT TCTACCTGGC GCACGGGCTG GAGGAGGCCG GCCACGCGCT GCTCGCCGCC
GCCCGCCGCC ACTACCTCGC CTGGGGCGCG ACGGCGAAGG TCGCCCAGCT CGACTGGGCC
CACCCGGCGC TGCCCGCCAG GCCTGTCGAG CTCACCGGGC CCGGTGAGCT CGACAGGCCT
GAGAGCATCA CCGGGCCCGG CGGACTCACC GGGCCCGACG CGGCGGGTCC GGCCGCCGAC
GTCGCCCGCC CGGCGCCGGC GCCGGCGAAC GCGTCGGCCG CGGACTCACC GGCCGGGCGG
CGCGCCGTCC TGACGACCGG GACCATCGAT CTGCTCGGCA TCGTCGCGGC GTCGCGCACG
CTCAGCTCCG AGACCAACAT CGGCGGGCTG CGGGCCAGGG TCGCGGGCAT CCTGTCGGAG
ATGGCCGGCG CGACCGGCAT CCACCTGCTG CTCCGTGACC AGCAGGATCA GGGTTGGCTG
GTGCCGGTCG GGGACGGCGG CTCCGTGCCG CTGCCCGAGG CCGCCCGGCG GCGCCTGCTG
CCGGCCTCGG TCGTCCGCTA CGCCGAACGC ACCCGGGAGC CGGTCGTCGT CGCCGACTCG
ATCCGCGACG ACCGGTTCTG CCGCGACCCG TACTTCACCG ACATCGACCG CTGCTCCCTG
CTGGCCGTCC CCATCATGAT CCGGGGCGGG CTGCGGGCGA TGCTGCTGCT GGAGAACCGG
ATGATCCGCG GTGCGTTCGC CACCGAACGC CTCGAAGGGA TCATGCTCAT CGCCGGGCAG
CTGGCCGTCT CGCTCGACAA CGCCCTGGTG TACGCCTCGC TGGAGCGCAA GGTCGCCGAG
CGCACCCAGC AGCTCGCCGC CGCCAACCGG CGGCTGGAGC AGCTCTCGCG TACCGACTCG
CTCACCGGCC TGGCGAACCG GCGGCGTCTC GAGGAGGTCC TCGACGCCGA GTGGCACCGC
GCCCGGCTGC ACTCGACGCC GATCGCGCTG GCGATGATCG ACATCGACCA CTTCAAGCTC
TACAACGACC ACTTCGGGCA CACCGCCGGT GACCGGTGCC TGCAGCGGGT CGCCGCCGGC
CTGGCGGCGA GCCTGCGGCC CACCCACATG GCGGCGCGGT TCGGGGGAGA GGAGTTCGCC
GTCGTGATGC CCGACACCGG CGCGGAGACC GCCGTCGAGC TGGCCCGGCA GCTGCGCCGC
GCCGTCGCCG GGCTGGCCGA GCCGCATCCG CTGGTGATCC ACCGGATCGT CACCGTGAGC
ATCGGGGTCG CGGCGACCAT TCCCGACCCG GGCGACGAGG TGGCGGCGTT CGTCGAACGC
GCCGACATCG AGCTCTACCG GGCCAAGCGC GGCGGCCGGG ACCGGGTGGA AGGCCCCCTC
CCGCGCCCCG CCACCCGTTG A
 
Protein sequence
MGSQQTPPSR DPPAVVRTEL LHETARTRVT RLLLPAGSVI RKEPLGPGAR WRLRHEVDIL 
ERLAGVEGVA HLAAAGPDGP APAPPAAEID GAGSALPADR GGVPLAGVGG LLLADVGGAD
LSRRATPMDP AELVDLAGAL ARAVAGMHGR GVLHRDISPA NIVVNRAGNR PYLIDFALAT
TATEIQPGFA HHNEIVGTVP YLAPEQTGWT GSPVDQRADL YAVGATLYEL ATGAPPFGSG
DPLRIIHDHL TRVPVAPSVV NPALPAGLSD VVMHLLEKEP DDRYQSAEGL ALDLVLVHRG
ACVRPGEHDL PARPLTPSRL AGRERETGEL RAAFTDAVAG RCRGILVAGA PGVGKTSLVN
ELRSIVAGAG GWFVTGKFDQ YRRDQEYDGV RQALRALGRL LLAEPEGSLD EVRERLLGGL
GPGAGMAAAV VPELAALLRI PPKAGDPMTA QVRAQRGAAE TLRAVASRER PLVIFVDDLQ
WAGRTPLGVL DLILGGEKDH EGLLVVGAYR DDEVDATHPM APMLARWRRQ PAGPRHLRLR
NLPPPDQAAM VTDLLRLPPE RGRQLARLIA PTTGGNPYDT VELVNSLRHD GLLTRVAGRW
RWAPETLRAR LARADVIELL RARVAALAPP AREALAAAAC LAGQVDFELL AAATGQAARE
LERRLAPAFA RGLLVLETDG RRGVRFRHDR AQEAVLAGLT GPAERALRLA LARRLAGRPE
YLAVAAAQYL PVLDEVRLPA ERRLTADLLR RAADEATLLS NYPLVERLLA AAVAFTDPAD
TDQLIEVHTG RHAALYMLGR LEDADEEYET LGRLCADPAR RTAATLVQVS SLTNRGRAGA
ALSLGYQQLR RLGLAIPERD DLDGEIDRGL DTLNQWLERT TTADDLRRAR GADRAGPGAD
LAGSGADLAG SGADLAGSGA VELINRLMPA AFFEDQQMLA WLTIQTLRMW IRDGPGPALA
GPAGHAGFVT VAWRDDYRTG WRALRRIIEV CRARGYEPAL WQVRFLYVLG TGHWFDPLEE
NVDAARRALE GLIRGGDLQN ACWTHYALVY GLFDCAPSLD VVAAEVDEAL AFAACTGNGH
AEETFRMYRH LTGILRGATG TPAADGTSDL GRLGTELQAA DPLAAAQLHI TRAIAAAILD
HPVELARHTA GAMAVLPAVQ AHYSTALARL LRSLALAGEA RAAVPSRRAA VLAELDESID
WLAARAADAP SNYLHLLRLV EAERAWAAGE FREAAYAYDV AQREAATRAR PWHRALILER
AARFYLAHGL EEAGHALLAA ARRHYLAWGA TAKVAQLDWA HPALPARPVE LTGPGELDRP
ESITGPGGLT GPDAAGPAAD VARPAPAPAN ASAADSPAGR RAVLTTGTID LLGIVAASRT
LSSETNIGGL RARVAGILSE MAGATGIHLL LRDQQDQGWL VPVGDGGSVP LPEAARRRLL
PASVVRYAER TREPVVVADS IRDDRFCRDP YFTDIDRCSL LAVPIMIRGG LRAMLLLENR
MIRGAFATER LEGIMLIAGQ LAVSLDNALV YASLERKVAE RTQQLAAANR RLEQLSRTDS
LTGLANRRRL EEVLDAEWHR ARLHSTPIAL AMIDIDHFKL YNDHFGHTAG DRCLQRVAAG
LAASLRPTHM AARFGGEEFA VVMPDTGAET AVELARQLRR AVAGLAEPHP LVIHRIVTVS
IGVAATIPDP GDEVAAFVER ADIELYRAKR GGRDRVEGPL PRPATR