Gene Franean1_4089 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4089 
Symbol 
ID5672447 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4871523 
End bp4873841 
Gene Length2319 bp 
Protein Length772 aa 
Translation table11 
GC content75% 
IMG OID641242965 
Productserine/threonine protein kinase 
Protein accessionYP_001508382 
Protein GI158315874 
COG category[K] Transcription
[L] Replication, recombination and repair
[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG0515] Serine/threonine protein kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.205571 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCAGTCG TCGACCGCGC GCGTGTCGCC GCCGCGCTCC CGGGCTACGG GATCGGCGGT 
GAGCTGGGGT CGGGTGCCTT CGGGCTCGTC CTCGCCGGCT ACCACCAGGA GCTCGACCGG
CGGGTCGCGG TCAAGGTGCT CTCCAGCGGA GCGGCCGAGG CGGCGGCGGA CTTCCGCACC
GAGGCCCGGT TGCTCAGCAG GCTCGACCAC CCGCACATCG TGCGGACCTA CGACTACGTC
ACCCGCGGTG ACCTGTGTCT GCTGGTGATG GAGATGCTGC CCGGCGGGAC GCTGACGCAC
CGGGTGATGC GCCCCGAGGC GGCCTGCGCG GTCGGCCTCG CCGTCGCCGA CGCGCTCGCC
CACGCGCACG CGCACGGCGT GCTGCACCGC GACATCAAAC CGGCGAACAT CCTGTTCACC
GACGCCGGCC AGCCGAAGAT CACCGACTTC GGTATCTCGA AGGTGTTCGA GGGGTCGGCG
TCGACGGCGA GCCGGGTGGT GGGCACCCCC CGCTACATGG CCCCCGAGCA GATCACCGGC
GACCGGCTCA GCCCGGCCAC CGACCTGTAC GCGCTCGGCG TCGTCCTCTA CGAGCTGTTC
ACCGGCGCGT CACTGGCCCC GCCGTCGCTG CCGCTGCCGG TGCTGCTGCG CCATCACACC
GAGGTCGTCC CGCCCCCGCC GCCGGGGGTG CCGGCTCCGC TGGCCGCCGT GCTGATGCGC
AGCCTGGCCA AGCGGCCCGA GGACCGCCAT CCCAGCGCCC GGGCGTTCGC GATCGATCTG
GCCCGCGCCG CCACCGCCGT GTTCGGGCGG CACTGGCTCG ACCGCGCCGG CATCCCGATC
CACCTCGCCG ACGACGTCCG CCAGATCACC ACCCGCAGGT TCGGGCGGGC CCCGCGGGCC
GACGACCATC CGTGGCGCGG TGCCGGCGGC CCGGGCCCGG CCGGACGGGT GCGTTCCCGG
CTGCCCGCGG TGCTGCTCGC CGGCGGCGGA TCCGGCGGCC GGGTGCCGCT CGCGGTGACG
GCGCTGGTTG TCCTCCTCGC CCTCGTCGCC GGTGGCGGGG TCGGCTGGGC GGTGTGGGGC
CAGCCCGAGC CCGCGCCCCC GGCGCCGCCC GCGCCGCCGC CGGCCTGGCA GCCGGAGCTG
GCCACGGGCA CGATCACCAC CGTGTTCGGG GCCGGCTCGG ACGGCTTCTC CGGCGACGGC
GGTCCCGCCG TACAGGCGGA GTTCGACCGG GCCAGCGACC TGACCATCGA CGCCCCGCGC
GGCTACGTCT ACGTCGCCGA CACCGACAAC CACCGGATCC GCCGCGTCGA CCGGGCCGGC
ACGATCACCA CGGTCGCCGG CACCGGGGCG GACGGCTTCT CCGGCGACGG CGGCCCGGCC
ACCGAGGCGC AGCTCGACGA GCCGACCAGC GTCGCGGTCG CCCCCGACGG AACCCTGTAC
GTCGCCGACA CGCGGAACCA CCGGGTTCGC CGCATCGGCC GGGACGGGAT CATCACCACG
ATCGCGGGGC AGGACGAGTT CGGTTTCGCC GGTGAGGTGA GCGAGGACGG CCTGGCCTAC
TCGGGCGACG GCCTACCCGC GGTGAACGCG AAGCTCAACT ACCCCAACAC CGTGCTGATG
GAGACCGACG GGAGTCTACT GATCGCGGAC GGTGAGAACA ACCGGGTCCG CCGCATCGGC
CTGGACGGGA TCATCACCAC GATCGCGGGC ACCGGCGCGG AGGGCTTCGG CGGTGACGGC
GGCCCGGCCA CGTCCGCCCG GTTCAGCTAT CCGAGCGCGC TGGCCCGCGG CCCCGACGGC
AGCCTCTATG TCGCCGACCA GGACAACCAC CGGGTGCGCC GCATCGCCGG GGACGGGACG
ATCAGCACGC TGGCCGGGAC GGGAAAGACG GGGTACTCCG GTGACGGCGG CCCGGCCGAC
CAGGCGCAGA TCAACGCGGT CGGCGCCGAC CTCGTCGTCG ACGCTGCGGG CAACGTCTAC
CTCTCCGACC CGGGGAGCAA CCGGGTCCGC CGCATCGCCC CCGACGGGAC CATCACCACG
ATCGCCGGTA CGGGGGTGTC GAAGTACTCC GGCAACGGCG GCCCGGCGAC CGCCGCCGAG
CTGGTGTACC CGGGCGGACT CGCTCTCGAC CAGCTCGGGA ACCTCTACAT CGCCGACGGT
ATCGACAGCC GGGTGCGAGC CGTACGCCTC CCGCCGGGCA GCTGCGCCGC CGCGCCGTGC
CCGACGTCCT CACCGGCCCC CGGCGCGTCC CCGGCGCCCG CCGGACCCGT CCCGCCCTCG
ACAGGTGCGA CGCCCGCGGC CGCGGGCACC GCGGCCTAG
 
Protein sequence
MPVVDRARVA AALPGYGIGG ELGSGAFGLV LAGYHQELDR RVAVKVLSSG AAEAAADFRT 
EARLLSRLDH PHIVRTYDYV TRGDLCLLVM EMLPGGTLTH RVMRPEAACA VGLAVADALA
HAHAHGVLHR DIKPANILFT DAGQPKITDF GISKVFEGSA STASRVVGTP RYMAPEQITG
DRLSPATDLY ALGVVLYELF TGASLAPPSL PLPVLLRHHT EVVPPPPPGV PAPLAAVLMR
SLAKRPEDRH PSARAFAIDL ARAATAVFGR HWLDRAGIPI HLADDVRQIT TRRFGRAPRA
DDHPWRGAGG PGPAGRVRSR LPAVLLAGGG SGGRVPLAVT ALVVLLALVA GGGVGWAVWG
QPEPAPPAPP APPPAWQPEL ATGTITTVFG AGSDGFSGDG GPAVQAEFDR ASDLTIDAPR
GYVYVADTDN HRIRRVDRAG TITTVAGTGA DGFSGDGGPA TEAQLDEPTS VAVAPDGTLY
VADTRNHRVR RIGRDGIITT IAGQDEFGFA GEVSEDGLAY SGDGLPAVNA KLNYPNTVLM
ETDGSLLIAD GENNRVRRIG LDGIITTIAG TGAEGFGGDG GPATSARFSY PSALARGPDG
SLYVADQDNH RVRRIAGDGT ISTLAGTGKT GYSGDGGPAD QAQINAVGAD LVVDAAGNVY
LSDPGSNRVR RIAPDGTITT IAGTGVSKYS GNGGPATAAE LVYPGGLALD QLGNLYIADG
IDSRVRAVRL PPGSCAAAPC PTSSPAPGAS PAPAGPVPPS TGATPAAAGT AA