Gene Francci3_0727 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_0727 
Symbol 
ID3905854 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp838950 
End bp841451 
Gene Length2502 bp 
Protein Length833 aa 
Translation table11 
GC content73% 
IMG OID637878060 
ProductWD-40 repeat-containing serine/threonin protein kinase 
Protein accessionYP_479840 
Protein GI86739440 
COG category[K] Transcription
[L] Replication, recombination and repair
[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG0515] Serine/threonine protein kinase
[COG2319] FOG: WD40 repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.851559 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.037092 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCGCCGCA AGCGGGAGAC GGGCAACGCG GGGACGGTGG CGCCCCCGTC CGCATCGCCC 
GCTCGACCCG ACCCCGCCTC CGAAGCGAAC CCCGTGGGAC CCGAGACGCC CTCCGGACCC
GCCGCTCGGT CGGGACGGGG GACGTCCTCG GCGCCGGGCT TCGGCCCGGT GGGCGGCAGC
GGTCTGACGG GGCCGTCCAT CGAGCCCCTC GACAGCGGTG ACCCGACCGA ACTGGGCCAG
TTCACGCTGC TGGGACGCCT CGGTGAGGGC GGGATGGGGA CCGTCTTCCT CGGGCGGGGC
AGGCCGGACG TCGCCGAGCA CGCCGGCCGG CTCGTCGCGG TCAAGGTCAT TCGGCCCGAC
CTGGCCCGGG TGCCCGAGTT CCGGGCCCGG TTCCGCCGGG AGGCCGACAT CGCGCGCCGG
GTCGCCCGCT TCTGCACCGC CGAGGTACTC GGCGTCGTCG ATCCGCCGGA CGGGCGGCCC
TACCTCGTCA CCGAGTACAT CGACGGCCTC ACCCTCGCCC AGACGGTGGC CGCGGACGGC
CCGCTGCGGT CGGCCGACCT GGAGCGCGTC GCGGTCAGCG TGGCCGCGGC GCTGACCGCC
ATCCACGGCG CCGGCCTCGT ACACCGTGAT CTCAAACCGT CCAACGTGCT GCTCTCAGCC
TTGGGTCCCC GCGTCATCGA CTTCGGCATC GCCCGGGCGC TGGACGCCCC GACGATGCTC
AGCCAGGAGA TCCAGCGGAT CGGTACACCG GCGTTCATGG CCCCGGAGCA GGCCAACGGC
GAGCCGGTGA CGGCGGCGGC GGACGTCTTC GCGTGGGGTG GCCTGGTGAC CTATGCGGGC
ACCGGTTCCT TCCCCTTCGG GGACGGGCCG ACGCCGGTGC AGCTCTACCG CGTCGTACAC
CGCGAACCAC TGCTCGACGG TCTCGCTCCA GCCCTGCGGC CGATCGTCGA AGAGGCCATG
CGCAAGGACC CGGCCACCCG GCCGAGTGCC CAGGAGCTGT TCCTGCGTCT GGTCGGAATG
GGTCCGACCA CGCATCCCGA CCCGGAGGTG ACCCGCGTCA TCCGTGCCGG GGTGTCGATG
CCCGCGCCCC CGCAGCGGAC CGACCCGGAC CGCTGCGCGC CCGGACAGAT GTCTGCCCCG
GGCGGGTCTG CCTCAAGCGG GTCGGTCGGC GCCGGGCCGC CCGTCGACCT GTCGGCACGG
GACCGCGGCC GGTGGAACTG GCGCCGCGTC GCGCTGCTGG CCGCCCCGCT CCTCGCGGTC
CTGCTCATCG CGGCCCTCAT ACCATTCGCG CTCACCACCG GCAGTGACCG GCGGCCGAGC
CGGGAGGAGA CGGCCGCTCG CGTCGCTCTG GCCGCCGAGG CCGTCCGCAA CTCCGATGCG
AACCTGGCCG CCCGGCTGTC CCTGGCGGCC TACCGCATCA GCCCGGTGCG GGAGGCCCGC
GCCGCCCTGC GCACCTCCTT CGCCGCCGCG ACGGCGACGG TGCTCGACGG GCACACGCAG
TCGGCGCTCG GTGTGGACAT CAGCCGGGAC GGGCGCCTGC TGGCCTCGAC CGGTGCCGAC
AACCTCGTCC AGCTGTGGGA CATCTCCGCG CGTTCCCATC CGGTGAAGCT CGCCACCCTG
GCCAGGCACA CCTCGTGGAC GCTCGACGCG GCGTTCAGCC CGGACGGACG GCTGCTCGCC
ACGGTCAGCT ACGACCGGTC GGTGATCCTG TGGGACCTCG GCGATCCGCG CCACCCCGTC
GAACTCTCCG TCATCCTCGG ACACAACGGC TGGGTACTCG ACGCCGCGTT CAGCCCGGAC
GGGAAGGTCC TCGCCACCTC GGGCTATGAC AACACGGCCC GGCTGTGGGA CGTCACCGAT
CCGCGCCGCC CCAGTCAACT GTCCGTGCTC GACCGCCACA CCAGTTGGGT GAACGAGGTC
GCGTTCAGCC CGAATGGTCA CCTGTTGGCG ACCGCCAGCG CCGACCGGAC GGCCCGGCTG
TGGGACGTCA CCGATCCGCG CCGGCCGCGA CCGCTCGCGG CCATCACCGC GCACACCGAC
TACGTGTGGG CGGTCGCCTT CAGCCCGGAC GGCCGGCGCC TCGCCACGGG CGCCTATGAC
GGTACGGCCC GAATCTGGGA CATCACCAAT CCGTCTCGGC CGGCGGCCAC CGCTTCCTTC
CCGGCCGACG AGAAATGGGT GTTCGACGTG GCGTTCAGCC CGGACGGCAG GACCCTGGCC
ACCGCGGGCT GGGACACCAC GGTGCACCTG TGGGATGTCA CCGAACCGGG CCGGCCGCCG
GCGATCGGCA CCATCACCGG GCACGGGGAC TGGGTGCAGG CCCTGGCGTG GACACCGGAC
AGCCACAGCA TCGCCACGGC AAGCGACGAC TACACCGTGC GCATCAGCCG GATCGGTGAC
GCCGACCTGA TCGCCGCCGC GTGCGCCGAC CCGTCGAAGC AGATCACCGA TGCCGAATGG
CAGCGCCACA TCTCAGACGT GCCCTACCAA CCGGTCTGCT GA
 
Protein sequence
MRRKRETGNA GTVAPPSASP ARPDPASEAN PVGPETPSGP AARSGRGTSS APGFGPVGGS 
GLTGPSIEPL DSGDPTELGQ FTLLGRLGEG GMGTVFLGRG RPDVAEHAGR LVAVKVIRPD
LARVPEFRAR FRREADIARR VARFCTAEVL GVVDPPDGRP YLVTEYIDGL TLAQTVAADG
PLRSADLERV AVSVAAALTA IHGAGLVHRD LKPSNVLLSA LGPRVIDFGI ARALDAPTML
SQEIQRIGTP AFMAPEQANG EPVTAAADVF AWGGLVTYAG TGSFPFGDGP TPVQLYRVVH
REPLLDGLAP ALRPIVEEAM RKDPATRPSA QELFLRLVGM GPTTHPDPEV TRVIRAGVSM
PAPPQRTDPD RCAPGQMSAP GGSASSGSVG AGPPVDLSAR DRGRWNWRRV ALLAAPLLAV
LLIAALIPFA LTTGSDRRPS REETAARVAL AAEAVRNSDA NLAARLSLAA YRISPVREAR
AALRTSFAAA TATVLDGHTQ SALGVDISRD GRLLASTGAD NLVQLWDISA RSHPVKLATL
ARHTSWTLDA AFSPDGRLLA TVSYDRSVIL WDLGDPRHPV ELSVILGHNG WVLDAAFSPD
GKVLATSGYD NTARLWDVTD PRRPSQLSVL DRHTSWVNEV AFSPNGHLLA TASADRTARL
WDVTDPRRPR PLAAITAHTD YVWAVAFSPD GRRLATGAYD GTARIWDITN PSRPAATASF
PADEKWVFDV AFSPDGRTLA TAGWDTTVHL WDVTEPGRPP AIGTITGHGD WVQALAWTPD
SHSIATASDD YTVRISRIGD ADLIAAACAD PSKQITDAEW QRHISDVPYQ PVC