Gene Franean1_0669 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0669 
Symbol 
ID5669086 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp789736 
End bp792663 
Gene Length2928 bp 
Protein Length975 aa 
Translation table11 
GC content74% 
IMG OID641239596 
ProductWD-40 repeat-containing serine/threonin protein kinase 
Protein accessionYP_001505034 
Protein GI158312526 
COG category[K] Transcription
[L] Replication, recombination and repair
[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG0515] Serine/threonine protein kinase
[COG2319] FOG: WD40 repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0315888 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.915288 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTCATTG CCCGGCGACA CCTCGAACGT TCGCGCGGCA ATCAACCGGT TTCGATCGGT 
CCGTATGTTG TCGAACGTCA GCTCCTCGAG GCCGGAACCG GGCCGGTCTA TCTCGGGCGG
GATCCCGAGG GCGGCCAGGT GGCCATCAAG GTGATCAGCG CGGCGTTCGC TCGCGACCAT
GATTTCCGCC GCAGGCTGCG CGCCGATCTC GAGACCGTAC GCGCACTGGC ACCTCCGTGC
CTGGCGGACA TCATCGACGC CGACACATCC GCCCATCCGC CGTATGTCGT CACCGAGTTC
GTGGACGCTC CGACCCTGGC AGCGACGGTC GCGCAGGGCG GTCCGCTGGC GGTGCCCGAC
GTTCGCCGGC TCGCCGTGGC GCTCGGTTCG GCGCTGACGG GGCTCCATGG TGCCGGTCTC
GTCTTCGGTG ACCTTAAACC GGCGAATGTG GTGCTTTTCG AGGGTGGAAT CCGCCTCGTC
GACTTCGGGT TGTCGCGAGT TTTGAACACC GTTGCCCTAC CGGGTCGCGG CGGATCAGGG
CCCGGGATGG GTACCCCCGC ATTTATTACC CCGGAGCATG TCCTGAGGCA GCCGCTCACG
ATGGCGTCAG ACATCTTCAC GTGGGGTGGA GCAGTCCTCT TCGCCGCGAC CGGACGACTG
CCGTTCGGTA ACGGATCACC GCAGGTTCTG TTGCAACGTG CGGTCTATGC GGAACCCGAC
CTCACCGGTT TGGACCCGGT ACTGCGCGAC GTAGTCAGCG CCACGATGCG TAAGGATCCC
TCCCGCCGAC CCGGCGCCGC GGAGCTGCTC GAGGTGCTCG GACGACTTGT CGGTGGCCTC
CCGGCACCCA CGGGCCTGGA TCCGCTCGAG ATCGCCGCCG CGGCGGCCGT AAGCGCCGCC
GCCCCGCCCA CCGAGACACC CGCCGCCCCG CCTGACGAGA TGTCTGCGGT CGAGGCGCCT
GCCGACACCG CTGGCGTCGT CCCGGCGGCC GCTGTGGCAG TCGCCGAGGC CGCCGAGCCC
GCCCCGGTGC CGGCGAGCGC GCCGGACGCG GCTGTCGCTC CGGATCCGAC GCCCGACTCC
GATGCCGAGC CCGCGCCCGA GGCCCCGGCG GAGTCTGAGC CCACGGCGGA GTCTGAGCCC
ACGGCGGAGT CTGAGCCCAC GGCGGAGTCT GAGACGAAGC TTGAACTCGG GCCCGAGGCC
TCGCTCGAAC CTGAGCCCAC GGCGAAGTCC GAAGCGAGGC CCGAGCCTGA GAGCGGTCCC
GGGGCTGGCA CCGGGGCGGC GGCCAGCGCT GCCGCCGAGC CCGGGATCGG AAGGCCGCCG
GCTCAGTCTT CGGAGGCGGC CCAGTCTTCG GAGCTGTCGG AGCCCACTGC GCTGGCTGCG
CTGGCCGACG GACGTCGACG GCCTGACGGG CACTGGTTGT TTCGGCTGTT CATGGCGGGG
ATCGCCAGTC TCGCCGTCCT GACCATGGCG GTCGAGATCA CGGGGGCTGT TCGGGCCCAG
GCCGCGGTGC GGGCGTCCAA GGCCACCGCC GGTCAGGCGC GCGCCCTGCT GGAGCGTCAG
CCGGACCTCG CCGGCCAGCT CGCCGCGGCG GCCTACGAGA TCGCTCCGAC CGCGGCGGCC
GGCGAGGCGC TCATCGCCGC GGCTGTCCGT CGGAGCGGCC ACCTGCCCGG CGACGTCCGC
GACCTCGCCG TGGCCCCGGA TGGCAGCAGC ATCGTCACCG CGGGCGACAC CGGCGCCGGC
CTCTGGAACC TCACCGACCC GTCCGCCAAC CGTCGCATCA CCGCGTTCCC CGTCGACGGG
CCCGCGGTCA GCGCTGTGGG CTACGTGTCG TCCCCCGGGC GGACGGCCGC CGCCGGCCAG
ATCATCGTCA CCGCGGCCGG CCCGGCAGGC GCGGGCGAGA GCAAGGTCCA GCTGTGGCGG
GTGACGCCGG ACGGCGCGGT CGAGCGGCTC GGCGTGCTCG CCGGGCACAC CGGCACCGTC
GGCGAGATCG CGGTGAGCCG GGCCGGCGAC GCGATCGCCA CCGGCTCCTC GGACGGCATG
CTCCGGCTGT GGGACGTGAC CGACCCACGC GCCCCCGCCG AGCTCGCCGT CCTGCGGACA
CCCGGGCCGA TCACCGCGTT GGCGTTCTCC CCGGGCGGGG ATCAGATCCT GGTCGGCGGG
GCCGCCCGCC TCTCGCTGTG GTCGCTGCAT GATCCGCGCC AGCCGCGCCG GCAGGGCCTG
CTGCACGGCG AGGGCGTCGT CGCCGGCGCG TCCTACTCCC CGGACGGCCG GACGCTCGCG
GTAGCCACCA CCGGCCTGCC GGACGCCGCG ACGGAGCCCG GGTCGACCGC CGGCGTTCCG
ATGGCGGTCG CCTCGTCCCT GGCGCCGCCG GAGAAGTCGC GGTCCGTGGT GGAGATCTAC
CAGCCGGGGG ACCCGCGGGG GCTGCACCGG CTCACCTCCT TCGCCCCCGC GAGCGGCGCG
GGAATGGTGG CCTTCTCACC GGACGGGCGG GCACTCGCGG TGGCCGCCGC GGCGGGCGGC
GGCGACGTCG GCGTCTGGGA CATGTCGAAC CCGGCCCGGC CGCGCCCCCG GCTCGCCCTG
CCCACGCCCG CGGCACCCTC CGACTCCGCC GGACCCTCCG ACTCCGCCGG GCCCTCCAAC
CTGGCGGCGC TCGCGATCGC CGGGTCGGCG CCCGCCGCAC CCGAACCGGC GGCGCTCGAA
TCGGCCACGC CCCCACCGGC CAGCCTCGCG CCGACCGCGC TCGCCTTCGC CGACGGCCCC
GCCCGGACAC TCGCCGTCGC CGACGGGAAC GGCGCGCGCG TGTGGGATCT CGACCCGCGG
ACCGCGCGGG ACCAGGTCTG CGGCCGGGCC CAGGCCGAGA TCACGAGGCG GGACTGGCGT
AGGTACATCC CCGACCGCCA CTACTCGCCG CCCTGCCCCC GGAACTGA
 
Protein sequence
MVIARRHLER SRGNQPVSIG PYVVERQLLE AGTGPVYLGR DPEGGQVAIK VISAAFARDH 
DFRRRLRADL ETVRALAPPC LADIIDADTS AHPPYVVTEF VDAPTLAATV AQGGPLAVPD
VRRLAVALGS ALTGLHGAGL VFGDLKPANV VLFEGGIRLV DFGLSRVLNT VALPGRGGSG
PGMGTPAFIT PEHVLRQPLT MASDIFTWGG AVLFAATGRL PFGNGSPQVL LQRAVYAEPD
LTGLDPVLRD VVSATMRKDP SRRPGAAELL EVLGRLVGGL PAPTGLDPLE IAAAAAVSAA
APPTETPAAP PDEMSAVEAP ADTAGVVPAA AVAVAEAAEP APVPASAPDA AVAPDPTPDS
DAEPAPEAPA ESEPTAESEP TAESEPTAES ETKLELGPEA SLEPEPTAKS EARPEPESGP
GAGTGAAASA AAEPGIGRPP AQSSEAAQSS ELSEPTALAA LADGRRRPDG HWLFRLFMAG
IASLAVLTMA VEITGAVRAQ AAVRASKATA GQARALLERQ PDLAGQLAAA AYEIAPTAAA
GEALIAAAVR RSGHLPGDVR DLAVAPDGSS IVTAGDTGAG LWNLTDPSAN RRITAFPVDG
PAVSAVGYVS SPGRTAAAGQ IIVTAAGPAG AGESKVQLWR VTPDGAVERL GVLAGHTGTV
GEIAVSRAGD AIATGSSDGM LRLWDVTDPR APAELAVLRT PGPITALAFS PGGDQILVGG
AARLSLWSLH DPRQPRRQGL LHGEGVVAGA SYSPDGRTLA VATTGLPDAA TEPGSTAGVP
MAVASSLAPP EKSRSVVEIY QPGDPRGLHR LTSFAPASGA GMVAFSPDGR ALAVAAAAGG
GDVGVWDMSN PARPRPRLAL PTPAAPSDSA GPSDSAGPSN LAALAIAGSA PAAPEPAALE
SATPPPASLA PTALAFADGP ARTLAVADGN GARVWDLDPR TARDQVCGRA QAEITRRDWR
RYIPDRHYSP PCPRN