Gene Francci3_2952 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2952 
Symbol 
ID3903767 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3489708 
End bp3493928 
Gene Length4221 bp 
Protein Length1406 aa 
Translation table11 
GC content72% 
IMG OID637880273 
Productserine/threonine protein kinase 
Protein accessionYP_482039 
Protein GI86741639 
COG category[K] Transcription
[L] Replication, recombination and repair
[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG0515] Serine/threonine protein kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.388012 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGACGATG AGCGCTGGAC GCAGTGCACA CCGTCCGAGT ACGCCTGGGA GCGTGCGGCC 
CTGTCCTATC TCAAGGCGCA GCTACCAACC GGGGATCCCT ACCGGGCTTG GGCGAACGCC
GAGTTTCTCG GGTCCGACGG TTCCGTCAAC GAGGTCGACC TGCTTCTCGT CCTGCCGGCG
GGCATCGTCG TGCTGGAGAT CAAGTCCTGG TCCGGTGTCC TCGTGGGAGA CGCGGGCACC
TGGCGTCAGA GCCACCGCGC CCCGGTCGAC AACCCCGTCA TCGGAGCAAA CCGCAAGGCC
CGCAAACTGA AGTCGCTGCT GGTTTCCCGG CCGGCGATGC GGGGCCGACG GGTGCCATGG
GTGGAGGGCG CGGTATTCCT GAGCGACGCG CGGCTGGAGA TCCGCCTTGT CCCAGAAGGC
CGGGCGCACG TCTTCGGACG GCATGACCAG ACCCAGCTGC CGAGTTTCCT GGACTTCGCG
CGGCAGCCCG GCCGGGTGGA CGGCGAGCTC AGTAATGCCC TGGCCCGGGC CGTCGACCAG
GCCGGAATCC GCCCGTCCCA GCGGGCGCGG ACCGTCGGCA GCCTTCGCCT GGAGCTGCCC
GCCTTCCAGG AGGGCGTCGG CTGGCAGGAC TTCCTCGCCC GCAACCAGCG GTTTCCCGAC
GACCGGCCGC GGCGGGTGCG GATCTATCTG GCCAGTGGTG TGGAGTCGGT GCACCAGCGT
GACCAGCTCG TGCGGGCCGC CGAGCGGGAG TACCTCGCAC TACGCGGCAT CGACTATCCG
GGGATCGCGG CGCCGATCGA CTTCGTCGAG CACGATCTGG GCCCCGCGCT CGTCTTTCCC
CACGATCCCG AGCTGATCCG GCTGGACCAT TTCCTCCAAG AACACGAACG CGAGCTGTCC
TTCGACGATC GGTTGGCACT GCTGCGCGTC CTTGCCGAGA CCATGGCCTA CGCCCACCGG
CGGGTGCTGA CGCACCGGGG CCTGAGCCCG CGATGCGTCT GGGTTCGGCG CCGGTCCGAC
CGCTTCGCCC TCCAGATCAC GGACTGGCAG ACGGCCTCCC GCGGCTCCGA ATCGACGTCA
GGGATACCGA GCACCACCGG CCCGGCCACG TCGACGGGGG GCTACGTCGA ACTGGCCGAC
GACGCCGCCG CGGCCTACTT CGCACCCGAG TGGAGCTGGG GCACGGCGAA GGGGGTCACC
CTCGACGTCT TCGGGGTCGG CGCCATCGCC TACCGGATCT TCACCGGGAA ACCCCCCGCG
GCCTCATCCG GGGAGCTCAG CCGACGTCTG TCCGAACACA ACCGACTGTT ACTCGCCGAA
CAGGTTGATG CCGTCTCCGA GGAGCTCAAC AAGCTCGTCG CCCGGGCGAC CGAGGCCGAC
CCGGAGCAAC GCACCCGGGA CATGGCCGCG TTCCTGCTCG ATCTCGACCT GGTTCGGGAA
CGGCTGGCCG CCATCGCGGA GGAGGCACCG GTTGTCGTTG ATCCCCTGGA GGCCGAGGCC
GGCGCCGAAC TGGAGGGCGG ATTTTCGGTC CTCGGCCGGC TGGGCCGCGG CTCGACGGCG
CTCGCTCTGC TCGTCGAGCG GGACGGCGGG CGCGCGGTTC TCAAGGTGTC GTTGGACCGA
GACAGGGATC CGCGGTTGGT CGCCGAGGCC GAGACGCTGC GCACCCTCAG CGAGCATCCC
GGTGTCGTGC AGCTGCTGAG CGACGGCGTG ATCGACGTCG GCCCACGCCG GGCCCTGTTG
ATCACTTCAG CCGGGGAACG CACCCTCGCC CAGGAGCTAC GCGAGCGAGG CCGACTACAA
CCGGAATGGC TCCAGGGGTG GGGCGACGAC CTGCTGGAGA TCGTCCAGCA TCTACAGCGG
GCCGGGCTGG CGCATCGCGA CATCAAGCCG GACAACCTCG GCGTCGCCGA ACGTGGCGGC
AGGGGCAAGA AGCGGCAGCT CGTCCTGTTC GACTTCTCCC TGGCGAAGGA GCCGTTGGAG
GCCATCGAAG CCGGCACCCG CCCGTACCTC GACCCGTTCC TCGGCCGCGG CGAGCGCCGC
CGATGGGATC AGGCTGCCGA GCGATACGCC GCGGCGGTGA CGCTCTACGA GATGGCGACC
GCACGCCGGC CCGAGTACGG AACCCATGGC GGGCATCCGA GTTTCGTCGA CGCCGACGTG
GCGATCGAAC CCGAGCTGTT CGACCGGTCG TACGCGACCG GTCTCACCGA GTTCTTCCGC
CGAGCGCTGC ACCGGGACGT CAGACAGCGC TTCGACACCG CGGAGGACAT GCGCCGCGCC
TGGAACGCGA TCCTGGCCGA ACCGGCCAGC GTGGTCCCGC AGCCGCCCTC CGCGCGCATC
AGCCGGGACA CGCCCATCGG CGCCGCCGGT CTGTCCCGCC CGGTGCTGTC GGTGTTCGAA
CGGCTGTCGA TCGATACGGT GGGCGGCGCG CTCGACCTGT CGCCCGCCCA GGTCGTCTGG
CTGCCCGGCA TCGGTACGAA GACCCGCCAG CAGTTGCGTG CGGACCTGGA CCGGCTGGCC
GGCCAGGTCA CCTCCTCCCC GGCCGAGCGG CCCACCGAGC CGACGCTCCT CGACCGCGTC
GCCGCCGGGC TCATCCCCCG CGCGACCGAC CGGGCCGTCG CCGAAGCCCT GCTCGGCCTC
GGCGAGCAGG GCGGCAACGC ATGGACGAGC GTGCGCGACG CGGCCAAGGC CCTTGACCGC
GAGCAACGTA CCGTCCGGCA GGCGGTAGCC AGGTTCGAGC GGCATTGGCT GGCCCTCGAC
GGGATGCGAG AGCTGCGGGA CACCATCGTC AAGGTCGTCG AGTCGGTCGG TGGGGTCGCC
TCCGCCCGGC ACTGCGCGGC GGCCCTGCTC GACTTTCAGG GCAGCACGGT GGAGGAACCG
CTGCGCTCCC GGCTCGCCGA GGCCGTCGTA CGCGCGGCGA TCGATGCCGA GCTGGCCGAC
GACTCCCCCC GCGACGACAC CACTCACGAC ACCACCCACG ACGCCTCCGG CGGTTCCGAG
ATGGGTGGCG ATCCACGGCT GGTGTACAGC CGAGAGCCCA ACGGCATCCT CGTCGCCGCC
GGCCCCGCGC GCGCCGGGGA CGGGCCCTCC ACCGCCGATC GGTTGGACTG GGCCTCCCGG
CTCGGCGGCG CGGCCGACGA CCTCGCCGGC GCGGACCCGC TGCCGGCTCC GGCTCGGGTC
ATCGAGACGT TGCGCGCCGT CCGCGCCCCC GGCGACACCG ACCCCTCCCT GATCTTCCCC
GAACGGCTCC TCGACGTCGC CATGATCGCC TCCGAGAACG CCGCGGTGAC GCCGCGCCTG
GAGGTCTACC CGCGAGGCCT CGACGCGGGA CGGGCGCTGC GGCTGGCCGC CGGGGCGCTG
TACGGCCGCA CGGAGCTGAC CCCCGAGCAG GTGGCCGAGC GGATCATCGT GCGGTTCCCG
CACGCGGGCG CGCTGCCCGG CCGTCCCGAG CTGGACGCCC TGCTCGACGC GGCCGGGGTG
CCGCTGTGCT GGGACGACGA GAAAAAGCGG TACGTGACCC GCCGCATCGA GGTGACCGGG
CTCACCTCCC TGGTCACCTC CCGTGGAACC AGGCCGCGCT GGAGCACCGG CGCGGGCGCC
GCCTGGACGA CAATGGGATG GCGCCGGGTC TCCGACGAGG TGCTGGCCGC GGATGACCGG
CTCACCCGTT CGCTGGTGGA CGGCGGCTGG CTGGTGCTGT CGGTGCCGCC GCGGCGACTC
GCCCGGGCCG AACGCTGCCT GGCCGCGCAG GATGTCACCG TTGTCGACGC GGAACAAGCC
CTGCTCGCGG GGATGCGGGA GTTCTGCGCC CAGCACCGGG TGCAGTGGTC GATCGTCCTG
GCGGCCGACG CCGCCGACCG ATCGTCGCGC GACTGGGCCA ACCTCTCAAG GGTCGCGCAG
GCCGGGCTCG CCGGTGTGCG GGCCTCGATC GAGGCCGCGG GGCCGGCGGT GCTGATCACG
AATGCGGGGG TGGTCGCCCG GTACGATCCG GCGCTGGTGG TACTGGATGA GCTGCGCGCA
TCGGTGCGGA TGACGACCGA GACATCACCG GTGCGTACGG TGTGGCTGCT CGTACCCTGG
GCCGACGTGG ACAAACAACC GTTGTTGGAT GGTGGCGCTC CCGTGCCCCA GTTCGGCAAC
CAGGGCCTGG CGCTGTCCGA GGAGTGGATT GTCCGGCACG AGTCTCGGCT CGCCGACGGC
GCTGAGGGAG GAGCAGCGTG A
 
Protein sequence
MDDERWTQCT PSEYAWERAA LSYLKAQLPT GDPYRAWANA EFLGSDGSVN EVDLLLVLPA 
GIVVLEIKSW SGVLVGDAGT WRQSHRAPVD NPVIGANRKA RKLKSLLVSR PAMRGRRVPW
VEGAVFLSDA RLEIRLVPEG RAHVFGRHDQ TQLPSFLDFA RQPGRVDGEL SNALARAVDQ
AGIRPSQRAR TVGSLRLELP AFQEGVGWQD FLARNQRFPD DRPRRVRIYL ASGVESVHQR
DQLVRAAERE YLALRGIDYP GIAAPIDFVE HDLGPALVFP HDPELIRLDH FLQEHERELS
FDDRLALLRV LAETMAYAHR RVLTHRGLSP RCVWVRRRSD RFALQITDWQ TASRGSESTS
GIPSTTGPAT STGGYVELAD DAAAAYFAPE WSWGTAKGVT LDVFGVGAIA YRIFTGKPPA
ASSGELSRRL SEHNRLLLAE QVDAVSEELN KLVARATEAD PEQRTRDMAA FLLDLDLVRE
RLAAIAEEAP VVVDPLEAEA GAELEGGFSV LGRLGRGSTA LALLVERDGG RAVLKVSLDR
DRDPRLVAEA ETLRTLSEHP GVVQLLSDGV IDVGPRRALL ITSAGERTLA QELRERGRLQ
PEWLQGWGDD LLEIVQHLQR AGLAHRDIKP DNLGVAERGG RGKKRQLVLF DFSLAKEPLE
AIEAGTRPYL DPFLGRGERR RWDQAAERYA AAVTLYEMAT ARRPEYGTHG GHPSFVDADV
AIEPELFDRS YATGLTEFFR RALHRDVRQR FDTAEDMRRA WNAILAEPAS VVPQPPSARI
SRDTPIGAAG LSRPVLSVFE RLSIDTVGGA LDLSPAQVVW LPGIGTKTRQ QLRADLDRLA
GQVTSSPAER PTEPTLLDRV AAGLIPRATD RAVAEALLGL GEQGGNAWTS VRDAAKALDR
EQRTVRQAVA RFERHWLALD GMRELRDTIV KVVESVGGVA SARHCAAALL DFQGSTVEEP
LRSRLAEAVV RAAIDAELAD DSPRDDTTHD TTHDASGGSE MGGDPRLVYS REPNGILVAA
GPARAGDGPS TADRLDWASR LGGAADDLAG ADPLPAPARV IETLRAVRAP GDTDPSLIFP
ERLLDVAMIA SENAAVTPRL EVYPRGLDAG RALRLAAGAL YGRTELTPEQ VAERIIVRFP
HAGALPGRPE LDALLDAAGV PLCWDDEKKR YVTRRIEVTG LTSLVTSRGT RPRWSTGAGA
AWTTMGWRRV SDEVLAADDR LTRSLVDGGW LVLSVPPRRL ARAERCLAAQ DVTVVDAEQA
LLAGMREFCA QHRVQWSIVL AADAADRSSR DWANLSRVAQ AGLAGVRASI EAAGPAVLIT
NAGVVARYDP ALVVLDELRA SVRMTTETSP VRTVWLLVPW ADVDKQPLLD GGAPVPQFGN
QGLALSEEWI VRHESRLADG AEGGAA