Gene Franean1_3011 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3011 
Symbol 
ID5671393 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3539261 
End bp3544174 
Gene Length4914 bp 
Protein Length1637 aa 
Translation table11 
GC content70% 
IMG OID641241913 
Productserine/threonine protein kinase 
Protein accessionYP_001507333 
Protein GI158314825 
COG category[K] Transcription
[L] Replication, recombination and repair
[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG0515] Serine/threonine protein kinase
[COG3706] Response regulator containing a CheY-like receiver domain and a GGDEF domain
[COG3899] Predicted ATPase 
TIGRFAM ID[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.15444 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGATCCC AGCGAACGAC ACCAAGCCGA GATCCGCCGG TGGCTGCGGT GCGGATGGAG 
TTGCTCCACG AGACGGAGCG GACTCGGGTA ACTCGGCTGG TGTACCCGGC TGGGATTGTG
ATCCGGAAGG AACCGCTGGG GCTTGGCGCG CAGCGACGGT TGCGTCATGA GGTCGAGATC
CTCGAGCGGC TGTCCGGAGT CGAGGGGGTC GTGCATCTGG CGGCCGGAGC GCCGCCATGC
CCGGGGTCGC TCCTGCTGGC CGATGTCGGT GGCAGGGCTC TGTCCGAGCG GAGCACTCCC
CTGGATCCGG CCGAGCTGGT CGATCTGGCC GGATCGCTCG CGCGCGCCGT GGCGGGGATG
CATCGCCGGG GTGTGGTGCA TCGGGATATC AGCCCGGCGA ACATCGTGGT GAGTGGGGAC
CGAGGCCTCC TGTACCTGAT CGATTTCGCG TTGGCTACGA CCTTCGCGGA GGTTCATCCG
GGGTTCATCC ACGACAATGA GATTGTCGGG ACGGTGCCGT ACCTGGCCCC GGAGCAGACC
GGCCGGACCG GTCGCCGGGT CGATCAGCGG GCCGACCTGT ACGCGGTCGG GGCAACCCTG
TACGAGCTGG CCACGGGTTC GCCGCCGGTC GGCACGGGTG ATCCACTGCG GATCATTCAT
GATCACCTCA CCCGGGCGCC GGCAGCGCCG TCGGTGGTGA ACCCGTCGGT GCCGGCCGGG
CTGTCGGCGA TCATCATGCA TCTTTTGGAG AAGGAGCCGG ACGAGCGTTA CCAGAGCGCC
GACGGACTGG TGCATGATCT CGCCCTGGTC CACCGGGGCG GCGTGGTTGT GCATCCCGGC
GGGCATGATT TTCCGGCGCG GCCGCTGACG CCGTCCCGGT TGGCCGGGCG GGAGGAGGAG
ATCGGCGAGC TGGGTGCGGC GTTCGCCGAG GCGATGGCGG GCCGCTGCCG TGGGGTCCTG
GTCGGCGGCG CAGCAGGGGT GGGCAAGACG TCGCTGGTCG AGGAGTTGCG GCCGATCGTG
GCCCGCGGCG ACGGCTGGTT CGTGGCGGGC AAGTTCGACC AGTATCGCCG GGACCAGGAG
TACGACGGAG TCTGGCAGGC GTTCCGGGTG CTGGGCCGGC TGCTGCTGGC CGAGCCGGAG
GACTACCTGG TCGAGGTGCG GGAGCGGATG CTGCGGGCGT TGGGGCCCAA CGTCGGACTT
GCCGTGGCGG TCGTGCCGGA GCTGGCGGTG CTGCTGAAGG TTCCTCCGGA GCCGGGGGAT
CCGATGACCG CACAGGCACG AATACAGCGT GCCGAGGTCG AGGTACTGCG TTCCGTCGCC
TCCCGGAAAC GGCCGGTGGT GTTGTTCGTC GACGACTTGC AGTGGGCCAG ATGGACAGCG
CTGGGCCTCG TCGATCTGGT CTTCGGGGGT GAGGAGCAGG TCGAGGGGTT GTTGCTGGTC
GGCGCCTACC GGGAGAGCGA GGTGGACGCC GCGCATCCGC TGGCGCCGAT GCTCGCCCGC
TGGCGTCGCC AGCCGGCCGG GCCGCGTCAT CTGCGGTTGG GGAACCTGCC GCCGGCAGGG
CAGGCGGCCA TGGTGGCCGA CCTGCTGCGC TTGGCCCCCC AGCACGCCGC CGAGCTGGCA
CGGTTGGTCG CGTCGTCGGC TGGAGGCAAT CCGTATGACA CGGTGGAGCT ACTCAATGCG
CTGCGTCACG ACGGCGTACT GGCCCTCAGC GACGACGGCT GGCGATGGGA CCGGGCCGCG
CTGCGCCGCC GGCTGGACCG GGTGGACGTG ACCGCGCTAC TGGCCGCCCG CATGGCTGTG
CTACCACCAG ATACCAGGGA GATGCTGACG ATGATGGCAT GCCTGGCCGG CCGGGTCGAG
CAGGACCTCC TAGCGGCTGC GACCGGGCTG GCGGCGGACG AGGTCGAGCG GCGGCTTGCC
CCCGCGTTCG CCGACGGGTT TGTGGTGCTG GAGTCCGACG GCCGGCCCAG CGTGCGGTTT
CACCACGATC GGGCGCAGGA GGCCGTCCTG GGCAGTCTCA CCCCGCAGGG GCAGCGCGAC
AGGCGGTTGG GTCTGGCCCG GCGTCTGGCC GACCGATCTG AGTTCTTCAC CGTGGCGGCG
GAGCAGTACC TGCTGGTGGC CGACGCCGTG CAGGCCGTCG AGGAGCGGCG GCTGATGGCC
GGCCTGTTCC GGCGGGCCGC CGACGAAGCG CGAGTGCTGG GCAACTATCC GCTGGTGGAG
AGGTTCCTGA CCGCGGCGGT GACGCTGATC GACCCGGCCG ACACCGACCA GCTGATCGCG
GTCCACACCG ACCGGCACGC CGCCTTGTAC ATGCTCGGCT GGCTGGAGGA GGCGGACGAG
GAATACCAGA CCGTCGACGC GCTGTGCGCC CAGCCGGCTC AGCGCACGCC TGCCACCGTG
GTGCAGGTCA GCAGCCTCAC CAACCGGAGC CGCGCCGACG AGGCGGTACG GCTCGGCCTC
GATCAGCTAC GGCAGCTCGG CCTCGCTGTC CCGGACCGGA ACAACCTGGA TGCGGAAATC
GACCGCGGAC TGGACGCGGT CTACCGGTGG ATCGACCGGA CCAGCGAATC CGACGATCTG
CGCCGACCGA AAATCACCGA CCATTCGCGA CTCAGTGCCA TCAGGCTTGT CAATCGGCTC
CTGCCCCCGG CGTTCTTCTG CGACCAGGCG GTGATGGCCT GGCTGGCCGT GCAGGCGCTG
GAGATGTGGG CACGGTACGG CCCGGATCCC GCCCTGCTCG GCCCGGCCGG CCACATCGCG
TTTGTGACCA TCGCCCGCAG GAACGACTAC CGCACCGGGC ACCGCATGAT GCGGCGGATC
CTCACCGTCG GTCGGGCCCG TAGCTACGAG CCTGAGATCT GGCAAGCGCA GTTCCTGTAT
GTGCTCAGTA CCGGCCACTG GTTCGATCCC CTTGAGGACA ACCTGTCCCT CGGGCGTCGC
GCTCTGGAGG GCCTGACAAA AGGTGGTGAC CTGCAGAACG CATGCTGGGC CCACGCCACA
TTGGCGTACT ACCAACTGGA CTGCGCACCT TCGCTCGAGA TCGTTGTCAC CGAGGCCGAC
GAGGCGCTGG CGCTCGCCGT ACGCACCGGC AACGGCCACG CCGAGGAGTT GTCACGGACC
TGCCGCCAGC TGGCGAGGGT GCTGCGAGGC GAGGCCGTCG ACTCAACGGT CGACGAGACG
GCCCAGCTGA GCAGGCTGGC CGCCGACCCG CATGCCGCCG CCTACCTGCA CCTCAGCCGG
GCACTCGCCG CGGCCATCCT CGATCATCCG GCTGAGCTGG CCCGGTGCAC GGCAGCCGCG
ATGCCGTTGC TGCCGTCCAT CCAGGCGCAC TATGCGACGG CGGTGGTCCG TCTGCTGCGG
GCCATGGCAC TGGCAGGGCA GGCCCGCGCG ACGGAAGCGG GCCGGCGCGG CGCCCTGCTG
GACGAGCTGG ACGAGCTGGT CGAGTGGCTG GCCGCGCGTG CGGCCGACGC GCCGGTCAAC
TTCCGGCACC TGCTGCGCCT GGTGGAGGCG GAGCGGGCCT GGGCGGCCGG TGACTTTCAC
CGGGCGGCGT ACACGTTCGA CCTGGCGCAG CGCGAGGCCT CGGTGTGGGC GCGCCCGTGG
CACCGGGCGC TGATCCTGGA GCGTACCGCG CGGTTCTACC TGGCCCACGG CATGGAGGCA
GCAGGTCGTC CGCTGCTGGC CGCCGCCCGC CAGCACTACC TGGACTGGGG CTCCACCTCA
AAGGTCAGCC AGCTCGACTG GGCCTTCCCG ACACTACGGA CCAAGTCCGC CGGCGGGAAG
CCGGTCGCGC AACCACCGGC GGAGCCCGCT GCCCGCCGAT CCACCGTCAC GACCGGCACC
ATCGATCTGC TCGGCATCGT CGAAGCCTCC CATGCGCTCA GCTCCGAGAC CAGCATCGAG
GGCCTGCGGG CCAGGGTCGT GGGCATCCTG TCGGCGATGA CCGGCGCCAC CGGCGTCCAC
CTGCTGCTGC GCGACGGAGA AGAACACATG TGGCTGGTGC CGGCCGGCGA CGGCACCGTC
CCGCTCCGGG AGGCCAGCCG CCGGCGGCTG CTACCGGCCT CGGTCGTCCA CTATGCGACC
CGTACCCACG AACCCGTCGT CGTCGCCGAC GCCACCCGCG ACGACCGCTT CCGTCGCGAC
CCCTACGTCG TCGACCTGGA CCGCTGCTCT CTGCTGGCCA TTCCCATCAT GATCCGGGCC
GAGCTGCGGG CGATGCTGCT GCTGGAGAAC CAGATGATCT GCGACGCGTT CTCCGTCGAA
CACCTCGAAG GGATCATGAC CATCGCCCGG CAGCTGGCAG TCTCGCTCGA AAACGTCGCA
ATGTACACGT CACTGGAACG CAAGGTTACC GAGCGGACCC GGCAGCTTAC CGCCGCCAAC
CAGCAGCTGG AACAGCTCTC GATCACCGAT CCGCTGACCG GGCTGGCTAA CCGGCGCCGT
CTCGACGAGG TCCTCGACGC CGAATGGCAC CGGGCCCGGC AGCAGGCAAC GCCCATCGCG
CTGGCGATGG TCGACATCGA CCACTTCAAG CTCTACAACG ACCACTTCGG GCACGCCACC
GGCGACCGGT GCCTGCAACG GGTCGCCGCC TGCCTGGCCG AGAACACCCG CGACACCGAC
CTGGCCGCCC GCTACGGCGG TGAGGAGTTC GCCATCGTGA TGCCCACCAC CGACACCGGC
GCGGCTACCC GAATCGCCCA CCGCCTCCGC ACCGCCGTCG CGGAGGCGGC CGAGCCGCAC
CCGCTGGTCG CCGGGGGCAT CATCACCGTG AGCATCGGCG TCGCGGCGAT CACTCCCACT
CCAGACGAGC ACGCGGAGGG GCTTGTCGAC CTCGCCGACG TCGAGCTGTA CCGGGCCAAA
CGCGGCGGAC GCAACCGGGT GGAGGCGGCA CTTCCAGGCC CCTCATCGCG ATAG
 
Protein sequence
MGSQRTTPSR DPPVAAVRME LLHETERTRV TRLVYPAGIV IRKEPLGLGA QRRLRHEVEI 
LERLSGVEGV VHLAAGAPPC PGSLLLADVG GRALSERSTP LDPAELVDLA GSLARAVAGM
HRRGVVHRDI SPANIVVSGD RGLLYLIDFA LATTFAEVHP GFIHDNEIVG TVPYLAPEQT
GRTGRRVDQR ADLYAVGATL YELATGSPPV GTGDPLRIIH DHLTRAPAAP SVVNPSVPAG
LSAIIMHLLE KEPDERYQSA DGLVHDLALV HRGGVVVHPG GHDFPARPLT PSRLAGREEE
IGELGAAFAE AMAGRCRGVL VGGAAGVGKT SLVEELRPIV ARGDGWFVAG KFDQYRRDQE
YDGVWQAFRV LGRLLLAEPE DYLVEVRERM LRALGPNVGL AVAVVPELAV LLKVPPEPGD
PMTAQARIQR AEVEVLRSVA SRKRPVVLFV DDLQWARWTA LGLVDLVFGG EEQVEGLLLV
GAYRESEVDA AHPLAPMLAR WRRQPAGPRH LRLGNLPPAG QAAMVADLLR LAPQHAAELA
RLVASSAGGN PYDTVELLNA LRHDGVLALS DDGWRWDRAA LRRRLDRVDV TALLAARMAV
LPPDTREMLT MMACLAGRVE QDLLAAATGL AADEVERRLA PAFADGFVVL ESDGRPSVRF
HHDRAQEAVL GSLTPQGQRD RRLGLARRLA DRSEFFTVAA EQYLLVADAV QAVEERRLMA
GLFRRAADEA RVLGNYPLVE RFLTAAVTLI DPADTDQLIA VHTDRHAALY MLGWLEEADE
EYQTVDALCA QPAQRTPATV VQVSSLTNRS RADEAVRLGL DQLRQLGLAV PDRNNLDAEI
DRGLDAVYRW IDRTSESDDL RRPKITDHSR LSAIRLVNRL LPPAFFCDQA VMAWLAVQAL
EMWARYGPDP ALLGPAGHIA FVTIARRNDY RTGHRMMRRI LTVGRARSYE PEIWQAQFLY
VLSTGHWFDP LEDNLSLGRR ALEGLTKGGD LQNACWAHAT LAYYQLDCAP SLEIVVTEAD
EALALAVRTG NGHAEELSRT CRQLARVLRG EAVDSTVDET AQLSRLAADP HAAAYLHLSR
ALAAAILDHP AELARCTAAA MPLLPSIQAH YATAVVRLLR AMALAGQARA TEAGRRGALL
DELDELVEWL AARAADAPVN FRHLLRLVEA ERAWAAGDFH RAAYTFDLAQ REASVWARPW
HRALILERTA RFYLAHGMEA AGRPLLAAAR QHYLDWGSTS KVSQLDWAFP TLRTKSAGGK
PVAQPPAEPA ARRSTVTTGT IDLLGIVEAS HALSSETSIE GLRARVVGIL SAMTGATGVH
LLLRDGEEHM WLVPAGDGTV PLREASRRRL LPASVVHYAT RTHEPVVVAD ATRDDRFRRD
PYVVDLDRCS LLAIPIMIRA ELRAMLLLEN QMICDAFSVE HLEGIMTIAR QLAVSLENVA
MYTSLERKVT ERTRQLTAAN QQLEQLSITD PLTGLANRRR LDEVLDAEWH RARQQATPIA
LAMVDIDHFK LYNDHFGHAT GDRCLQRVAA CLAENTRDTD LAARYGGEEF AIVMPTTDTG
AATRIAHRLR TAVAEAAEPH PLVAGGIITV SIGVAAITPT PDEHAEGLVD LADVELYRAK
RGGRNRVEAA LPGPSSR