Gene Franean1_4910 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4910 
Symbol 
ID5673250 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5896694 
End bp5898352 
Gene Length1659 bp 
Protein Length552 aa 
Translation table11 
GC content74% 
IMG OID641243765 
Productserine/threonine protein kinase 
Protein accessionYP_001509181 
Protein GI158316673 
COG category[K] Transcription
[L] Replication, recombination and repair
[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG0515] Serine/threonine protein kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.104206 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00224255 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGATCGTCG ATCGGGACGT CGTCGCGGCG GCCCTGCCCG GCTATGCTCT CGCGGCGGAG 
CTCGGCCGCG GATCCTCCGG CGTCGTGCTG GCCGCCCAGG ACGACGCACA CCGCCGCGTC
GCGGTGAAGG TCATGCCGCT CGCTGCCGAC GAGGACGCCA CCTTCGGCGT GACCGGCGTC
CTGCGGCCCG CCTCAGCCGT CGCGCTCGAC GCCGAGTCCG AGGCGCTCCT TCTGGCCGGC
CTGGACCATC CGCACCTCGT CCGCGTGTAC CGCTACACCG CCCACGCGGA CCTGGCCCTG
ATCGTCATGG AACTCCTCGA CGGTGGCAGT CTGCGGGTGC GGGCGACCAC CGGCCTCTCG
CTGGAGGCCG CCTGCGCCAT GGCGCTCGTC ACCGCGGCGG CCCTGGAGCA CGTGCACGCC
CGGGACATCC TCCATCGCGA CATCAAGCCC GAGAACATCC TGTTCACCGC GCAGGGAGTG
CCGAAGCTGA CGGACTTCGG CATCGCCCAC ACGCTGCGGG TGACGCCGAG GGCGCCTGGT
CCCGGTGCCC TGTGGGGTGT GGGTGCTCTG GGCGGTACAG GTGCTCCGGG CGGTACGGGG
GGGCTGGGCA GCGTCGGCGG GCTGGGCGGT GTCAATGGTC CGGGCGGTCC GCCGGGCTCG
GCGATCGGCA CCCCGCGCTA CATGGCTCCC GAACAGTTCA CGCTGAGCCC ACTGAGCCCC
GCGACCGACG TCTACGGGCT CGGGGTCGTC CTGTACGAGC TGTTGGCCCG CCGACCGCTC
TTCCTCGTCC GGCCCGCGAC CAGCGACGCG TGGGCCCGCC ACCACCTCAC CGTCACGCCA
CGCCCGCTGG CCGGGATCCC CGAGCCCGTT GCCCGGGTCG TCGAGCAGGC ACTCGCCAAG
GATCCGGACC GCCGGCCGCA GACCGCCCGC TCGTTCGCGC TCAGCCTGGC CGGGGCGATG
GCCCGCGCCC GCGGCCCCAA CTGGCTCACC CGGGCCGGCA TCCCGACGTT CCTCGACGAC
GAGGTCCAGG CGGCCGCCAC CGGACCGAGC ACGCCGTCCC GGACCTCCGG GACGAACCGG
CGCCCCGACC CAACCGGCGA CGATCACGGC CTCGAGGTCA TCCTGGACAG CCCGTCACCG
GCGGTGGCCG GTGACGGGCC CGGTGATCAC GCCGACGACC AGCGGACCCC GGACCAGCCC
ACGGCGGCCG AGCCCACAGC GGTCCAGCCC ACGGCGGATC CGGAGGCCAG TCAGGACAGC
GGTACCGACC CGGCGGCGGC CGGTCCCCCG ATGGCCGGCC CGGTCGCCGG CGAGGCGTCC
ACCGACGTCC TCACCAAAGA TCAGGACACC ACCACCACCG CCGCCGCCGA CCTCAGTCCC
GTCACCAGCC CAATCAGCAC CGCCAGCCCC ACCGACCCCA CAGACCCGGT CAGCACTCCC
GGCCCAGGCA GTACCGGCGG TGCCGTCGAC ACCGCCGGCA CGGTCCGCAC CGCAGGCGAC
GCCGGTGACG GCACCGGCGG CGACGTCCGC GCCGACAGTG CCAGCGGATC GGATCCGCGC
TGGCGAGCCA GCCTTACGCC GGCCCGGGTT TGGGCCCTCG CCGTGCTGGC GCTGGTACTG
GCGGTGCTGA TCGCCGTCCT GATCGCGATG CCGGGGTGA
 
Protein sequence
MIVDRDVVAA ALPGYALAAE LGRGSSGVVL AAQDDAHRRV AVKVMPLAAD EDATFGVTGV 
LRPASAVALD AESEALLLAG LDHPHLVRVY RYTAHADLAL IVMELLDGGS LRVRATTGLS
LEAACAMALV TAAALEHVHA RDILHRDIKP ENILFTAQGV PKLTDFGIAH TLRVTPRAPG
PGALWGVGAL GGTGAPGGTG GLGSVGGLGG VNGPGGPPGS AIGTPRYMAP EQFTLSPLSP
ATDVYGLGVV LYELLARRPL FLVRPATSDA WARHHLTVTP RPLAGIPEPV ARVVEQALAK
DPDRRPQTAR SFALSLAGAM ARARGPNWLT RAGIPTFLDD EVQAAATGPS TPSRTSGTNR
RPDPTGDDHG LEVILDSPSP AVAGDGPGDH ADDQRTPDQP TAAEPTAVQP TADPEASQDS
GTDPAAAGPP MAGPVAGEAS TDVLTKDQDT TTTAAADLSP VTSPISTASP TDPTDPVSTP
GPGSTGGAVD TAGTVRTAGD AGDGTGGDVR ADSASGSDPR WRASLTPARV WALAVLALVL
AVLIAVLIAM PG