Gene Franean1_4070 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4070 
Symbol 
ID5672428 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4851611 
End bp4853632 
Gene Length2022 bp 
Protein Length673 aa 
Translation table11 
GC content77% 
IMG OID641242946 
Productserine/threonine protein kinase 
Protein accessionYP_001508363 
Protein GI158315855 
COG category[K] Transcription
[L] Replication, recombination and repair
[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG0515] Serine/threonine protein kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00170756 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCGGAGA AGGCGGAGCA ACCAATGCAG AGCGACCGGG CCGCGGGCCG GGAGCGCCGA 
CTGGCCGGCC GGTACCGGCT GGGCCGCGTG CTCGGCAGCG GCGGGGCGGG AATCGTCCGC
GAGGGCGAGG ACATGCTCCT GCGCCGGCCG GTCGCCATCA AGCAGGTGCG CCTGCCGCCG
CTCGCCTCCG AGACCGAGCG GGAGGTCATC GGCGAGCGGG TGCTGCGCGA GGCGCGCGCG
GCGGCGCGGC TGCGCCATCC CGGTCTGGTG GCCGTCTACG ACGTCATCGA CGCCGAGGAT
TCGCCGTGGA TCGTCATGGA GCTGGTCGAC GGCCGGTCGC TGGCCGAGGA GATCCGTGCG
TCGGGCCGGC TCGCGCCGGC GTGGGTGGCG CGCATCGGGG TGAGCCTGGC CTACGCGCTG
GAGGCGGCGC ACCGCGCCGG GGTCGTGCAC CGGGACGTCA AGCCGGGCAA CGTGCTGCTG
ACCGCGGACG GGCAGGCCCG GCTCACCGAC TTCGGCATCG CCGTCAGCGA GGGCGACGCG
ACCCTGACCA GCACGGGGAT GGTGGTCGGC TCGCCGGCCT ACATTCCGCC CGAGCGGGCC
CGCGGCGCGC GGGTGGGCGT GCCCGGTGAC GTGTGGGGCC TGGGTGCCAC GCTGTTCACC
GCGGTAGAGG GCGAGGCGCC GTACGCGGGT GAGGGCGCCC TGGCGACGCT GGCGGCCATC
ATCCAGGATC GGCGCCGTCC TTACCGGTAC GCGGGCCCGT TGCAGGATGT GATCGACCAG
CTGCTCGATC CCGATCCGGT GCGGCGCCCG TCGTTGGCGC AGGCCCGGGC GCAGCTGCGG
CGGATCGCGG CCACCGCCGA GCCGCATCCG ACCGCCGTGT TCGACGCGGA GGCCGACGCG
CTGCTCGACG CCGACGTCGG CGACCACCCG GACGCGCCCC CGTCCGGGCC GGCGGGAGCC
GAGGAGCATG CCGATCTCGG GCCCGGTGAC GACCTCGGAT CCGGCACCGG CGGCAGTGCC
GGCAGCGGCG CTGGCCGCGG TGCCGGCCCG GGTCCGCCGG TCGGCACCGG GCCGCGGACG
GGCCCTCGCG CCGGCTTCGG TTTCGGCTCC GGTACCGGCC CCGCCGCCCG GACCGGATCC
TCAGAGACAG GCCCAGGACC GGGGCCAAGC CCAGGTGCAG GTCCGGGGCC AGGGCCGGAT
TCAGCTGCGG GTGCGCGCGC CGGCGGGGCC GGGGGCGCAC CGGAACGGCG CCGGCGGGTG
GTGCTGCTGG CGATGGTTGT CGCACTGCTG CTCGCCGCGG GTGGGGTCGG GCTCGGGCTG
GCCCTGACCA GCGGTTCGGG TACCCCGTCC GCCGGCCCGG CGTCGGGCAC CGGCGCCCCC
GCTACGCCCG CGGGTACCGG TGGGCCGTCC CGGGGCGCGT CGACCAGCCC CGCCGGCGGG
CAGGCCGTGG CACCGGGGGC ATCCGCCACT CCGAGCGGCA CCGGCACGGC GTCGGCGACG
CCTGGTCAGG GCGGGTCGGT GGGCGTGCCG TCGCTGGACA GGCTCGGCAC CGCGATCCCG
AACGACACCG CGCCCGCGGC GGCGCCGGCC GGCCTCGAGA CGCGCCGCGG CGCGGCCGGG
TGGTCGCTCG CCGTCCCGCC GCAGTGGGCC GACGCGAGCC GTGACCGGCA GCACGAGACC
TTCACGGCGC CCGGAGGCTA CCCGGACCTG CTCGTCGAGA CGCAGGATGT CGCCGGCCCG
TCGTCGATCC GGGCGTGGGA GGAACTGGAG CCCGGTGTCC GGTCAAAGAC GGCCGGCTAC
CAGCGGGTCT CGATCCGCCC GTCCGACGGG GCGGACGGCA CGACGTCCGC GGTGTGGGAG
TTCACCTTCA CCGCGGGCGG GCAGACGGTG CACGTCCTGG ACTTCGGTGT GGTCCGCAAC
GGGCATGGCT ACGCGCTGCG CTGGCGGGTG CCCGAGGCCC AGTGGCAGGA CCAGCTCGAG
CTGATCCGCA CCATCACCGC GACCTTCCGC CCCGGGCCCT GA
 
Protein sequence
MPEKAEQPMQ SDRAAGRERR LAGRYRLGRV LGSGGAGIVR EGEDMLLRRP VAIKQVRLPP 
LASETEREVI GERVLREARA AARLRHPGLV AVYDVIDAED SPWIVMELVD GRSLAEEIRA
SGRLAPAWVA RIGVSLAYAL EAAHRAGVVH RDVKPGNVLL TADGQARLTD FGIAVSEGDA
TLTSTGMVVG SPAYIPPERA RGARVGVPGD VWGLGATLFT AVEGEAPYAG EGALATLAAI
IQDRRRPYRY AGPLQDVIDQ LLDPDPVRRP SLAQARAQLR RIAATAEPHP TAVFDAEADA
LLDADVGDHP DAPPSGPAGA EEHADLGPGD DLGSGTGGSA GSGAGRGAGP GPPVGTGPRT
GPRAGFGFGS GTGPAARTGS SETGPGPGPS PGAGPGPGPD SAAGARAGGA GGAPERRRRV
VLLAMVVALL LAAGGVGLGL ALTSGSGTPS AGPASGTGAP ATPAGTGGPS RGASTSPAGG
QAVAPGASAT PSGTGTASAT PGQGGSVGVP SLDRLGTAIP NDTAPAAAPA GLETRRGAAG
WSLAVPPQWA DASRDRQHET FTAPGGYPDL LVETQDVAGP SSIRAWEELE PGVRSKTAGY
QRVSIRPSDG ADGTTSAVWE FTFTAGGQTV HVLDFGVVRN GHGYALRWRV PEAQWQDQLE
LIRTITATFR PGP