Gene Franean1_5419 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5419 
Symbol 
ID5673750 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6540882 
End bp6544865 
Gene Length3984 bp 
Protein Length1327 aa 
Translation table11 
GC content72% 
IMG OID641244274 
Productserine/threonine protein kinase 
Protein accessionYP_001509680 
Protein GI158317172 
COG category[K] Transcription
[L] Replication, recombination and repair
[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG0515] Serine/threonine protein kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.388014 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGGGCGA GGATCGTCGT CCATGGTCCG TGGAGCGGGC CGGGTGAGGA GGCGACCGCC 
GTCTACCTGC GTGACCACCT GGACGAGCGG TGGACGGTCG TCTGCGGCCG CCAGCTCGTC
GCCGGCTCGG GTACGCGTGA CTCCGACTTC ATCGTGGTCG GGCCGAACAG TGTCACCGTC
ATCGAGGAGA AGTCCTGGCG CGTCGACCTG ATCGGCACCG AGGAGCGCTG GTACGAGCGT
AGGACCCTGG AAGAGCGTGG TAATCCGATC AACCAGGCCG TCGGCGTCGC GCGGATCCTC
GCCGGCAACC TCAACCGGTT CAACAAGCCG CTCGCGTCCG CGCTCAGGAG CGCGGCGCCG
CCCGGCCAGA GGCAGCTCCA TCTCGTCCAC CCGCTGGTCG TGATGTCCTC GCCCGAGGCC
ACGTTCCAGA TCGAGGACCC GCGAGCGGCC AGCCAGGTCG TGCGGCTGGC CGGCTGCGAG
AAGCGCCTGA CGGAGATCGA CCGTTCGGTC GCGGCGTACT TCGACCTCGC CCCGTTCCGT
GACGCGGTCC TGTCCGTCGT CACCGGCCTC AGGCCGAGGC CCGAGACGCC GGAGACGCTC
GGCGCCTACA AGATCGTCGG CAACATAGAG CCGACGCCAC GTGGCCGCCG CTACGTCGGC
CGGCACCCGG GCGGCTCGGT GCGGGTGCTG ATCACCTACG AGCGCCAGGG CCTCACCGAG
GCCGAGCTCA AGGAGAGCGA CAGGCTGATC CTGCGCGAGT ACGACGCGCT GCAGCGCCTC
GCGCACCTGC ACCGGGTGTT CCCCGTCGAC CCGTACTTCG GCGTCAACGA CGACCAGATG
TGGGTCGCTC CGATGCACCC GCCGGGCCCG AAGCAGAAGA CGCTGCGCAA TCTGATCCGG
GCCGGCAGGC AGCCGTCGGC GGCCACCTTC GCCACGGTCG CCGCGGACGC GTTCGCCGGG
CTCGCCGACA TCCACGCGGC GGGCATCACC CACCGGGCGC TGCACCCGAG CCGGGTCTGG
GTGACGCCGG AGACGAACCG GGTCGTGTTC TCCGACTTCC TGATCGCGAA GATCGCGGAC
GCGCGCACGG TGCTCGACGC GGACGACGTC GATCACGACG GCGGGCCGTT CCGGGCGCCG
GAATGCCGGC GTACCGCGCA CCTGGCGATC AACCCGTCGG ACGTCTACTC GCTGGCGCTG
TGCCTGCTCG ACTGGTGGGA GCTGCGCCAG TCCGACCCGC CGCGCGAGCC GCAGGCGCCG
CCGTCGCTGC CGGACCGGGT CCGCGAGGCG CTGAACGCGT GCTTCGCGTC GATGCCGCAG
GACCGCCCGT CGGCCCGCGA GGTCGCGGAC CTGCTGGCCG CGCACCTGGC CGAGGAGCGC
AAGCGCGCGG TGCGTCTCGC ACCAGGTGTG AAGATCGGCA AGTACGAGCT GGTGCGCCAG
CTCGGCGACG GCGCCACCGC CACCACCTGG CTCATGATCG ACCGGGCGTT CGACATGCAC
CACACGCTCA AGATCATGAA GTCGGCCGAG CTGGTGGACA ACCTGCGCGG CGAGTTCGTC
CTGCTGCACA GGCTCAAGCA CGACCGGATC GTCCGGGTGC ACGACTACCT GCTCGAGCCG
CCCGGCCCGG CGCTGATCTC CGAGTATGTC GAGGGCAGGA CCGTCGCCCA GCTCGCGCCG
AGGCTGCGCG GCAGTCTCGG CGCCGCCCGC AGGATTCTCG CCGACATCCT CGACGCGCTC
GAGTACGCCC ACAACCGCGA CGTCCTGCAC CGCGACCTGT CCCCGAACAA CATCATCGTC
GATGCGGACG ACCGGGCGAC GTTGATCGAC TTCGGGGTCG CGTCCCGAGC CGAGGCCGAC
GCGCACAGCC TGGTCGGCAC ACCGCCGTAC CAGGCGCCGG AGATCGCGGC CGACGGCGCC
TGGTCGCCGG CGGCGGACCT GTACTCGCTG GCCGTCCTGG TCTTCGAGGC GCTGGTCGGC
CGCGCGCCGT ACGCGGTCGA GGGACGGAAC CGCAACAAGT ACCTTCTCGT CCCACCGACG
CCGGCGGAGG CCGAGGCGGC GGGCACCACC GGCACCATGT TCCTGGAGGT GCTGGCCCGG
GCCGGCGACC CGAACCCGGC GACCCGCTTC CCTTCCGCCG CCAAGTTCCG TGACGCGATC
GAGGCGGCGT TCGAGCCGCC TACGCCCTCC CTGCCGCCGG CGGGCACGGT TCCGGAAACG
CCGGCCATCC CAGCGGGGGG CTCGCCAGTG ACGGTGACGC CGCCGCCCAC GACGGCGACG
GTGACGCCTA CGACGGTGCC GGTGCCGCCA GCCGTCTCGG CCCAGCGCGG CCTGGTCGTC
AATCCGGCCG TCGACGAGGT CCGCCAGCTG TTCCGCAACA GCAGGCTCGG TAACCGGGGC
AACCGGGGCC TGGACTCGGA CTTCGCCGAC CAGACCTACA TCCCGACCCG TCTCGACGAG
AAGCTGATGC CGCGGATCAT CGACGGCCGG CCACGGCTGG TCGTCTTCAG CGGCAACCCG
GGCGACGGGA AGACGGCCTT CCTGGAACGG CTCGGCGTCG AGCTGCGCCG CCGCGGCGCC
CGGGTCCTGG AGAGCGACGC GGCCGGCTGG CGGATGACGC TCGACGGGCA CGAGTTCGCC
TCGGTCTACG ACGCCAGCGA GTCGCACGAG GGCATATCGG CGGACCAGCT GCTGCGCCGC
GCGCTCGACC CGCTGAACGA CCAGTCCGGT GCTGGCCGCT ACACCGCGCT GCTTGCCGCC
AACGACGGCC GCCTTCTCGA CTTCTTCGAC CGCGACGCCC ACCGTTACCC GGAGATCGCG
AAGCTGCTCG GCGGCGGCGG CGGCGACGAC GGCAGCGGCA GCGGCCAGGC TGCCGACGCC
GCGGGAGTGC TGCGGATCGA CCTCAAGGGC CGGTCGATGG CGTCGGTGGG CCTGGCGGCC
GGCTCGCTGA CGGTGCGGAT GCTCGACGCG CTGCTCACCG ACGAGCTGTG GGCGCCGTGC
GGCGGCTGCG CGGCCGAGCA CCGCTGCCCG ATCGCGGCCA ACGCCCGCGG CCTGCGCGAC
GGGCGCGTCC AGGAGCGGCT GCACCGGCTG GTGCTCACCT CGCACCTGCG TCGTGGCCGG
CGTCCCACGA TCCGCGACCT GCGTTCGGCG CTCGCCTACC TGGTGACCGT CGACCTCGGA
TGCGCCGACA TCCACGACGA GTTCGGCGAC GGCTTCGGGA ACCCCGCCCG CCCGGAACGG
CACTTCAGCG CGGCCGCGTT CGACGCCGCG GCCGGGCACG ACCGGCTGCT CGGCGAGTGG
CGCAGCCTTG ACCCGGCGGC GGTGCCGGCG CCGCGGGTGG AGCGGTGGCT CGCACCCGGC
ACCCGTTCCA CCGCGGACAT CGCCCGCGCC AAGCGGCTGC GCTACTTCAC CGCGACCGAC
GCGGAGCTCG ACGCCGTCGA CCACCTCGGC CCGTACCGGC ACCTCGACGA GTTCCGTGCC
CTGCTCACCA GCGCCGGCCC CGACCCCGGC ACCGACGGTG CGGCGCGCCG GACCGCGGTG
GTGTCGAAGC TGCTGCGCGG CCTGTCGCGC TGCGTCGGGC CGGTCGGCTA CGACGGCGAC
GGGGTCGCGG TCTCGGTGGG CGAGCCTTCC GAGGACGGCG GCGCGGTCGT CAAGGTACTG
CCAGCCGACG AGTTCGAGGT CGTCGTGGCC GATGTGGACG ACCGGTTCGT CGAGACGGCG
GCCGACATCG TCGTGCTGCG GCACCGTTCC GGCCAGGCGG ACCTGCCGGT CGACGTCGAC
CTGTTCGAGC TGCTGATGCG CGCGAACGCC GGCCTCCTGC CGACCAACAC CGAGGTCGCG
CCGCTGCTCG AGGAACTGGG CCTGTTCCGC GCAAGCCTCA CCCTCCAGCC CGCGAGCCGA
GTGATCATCA TCGAGCCCAA CGGCCAGCGG AACGTCATCA AGGTGACCGG AACGACCATC
GAACTGCTGC GGGACGCCCG ATGA
 
Protein sequence
MGARIVVHGP WSGPGEEATA VYLRDHLDER WTVVCGRQLV AGSGTRDSDF IVVGPNSVTV 
IEEKSWRVDL IGTEERWYER RTLEERGNPI NQAVGVARIL AGNLNRFNKP LASALRSAAP
PGQRQLHLVH PLVVMSSPEA TFQIEDPRAA SQVVRLAGCE KRLTEIDRSV AAYFDLAPFR
DAVLSVVTGL RPRPETPETL GAYKIVGNIE PTPRGRRYVG RHPGGSVRVL ITYERQGLTE
AELKESDRLI LREYDALQRL AHLHRVFPVD PYFGVNDDQM WVAPMHPPGP KQKTLRNLIR
AGRQPSAATF ATVAADAFAG LADIHAAGIT HRALHPSRVW VTPETNRVVF SDFLIAKIAD
ARTVLDADDV DHDGGPFRAP ECRRTAHLAI NPSDVYSLAL CLLDWWELRQ SDPPREPQAP
PSLPDRVREA LNACFASMPQ DRPSAREVAD LLAAHLAEER KRAVRLAPGV KIGKYELVRQ
LGDGATATTW LMIDRAFDMH HTLKIMKSAE LVDNLRGEFV LLHRLKHDRI VRVHDYLLEP
PGPALISEYV EGRTVAQLAP RLRGSLGAAR RILADILDAL EYAHNRDVLH RDLSPNNIIV
DADDRATLID FGVASRAEAD AHSLVGTPPY QAPEIAADGA WSPAADLYSL AVLVFEALVG
RAPYAVEGRN RNKYLLVPPT PAEAEAAGTT GTMFLEVLAR AGDPNPATRF PSAAKFRDAI
EAAFEPPTPS LPPAGTVPET PAIPAGGSPV TVTPPPTTAT VTPTTVPVPP AVSAQRGLVV
NPAVDEVRQL FRNSRLGNRG NRGLDSDFAD QTYIPTRLDE KLMPRIIDGR PRLVVFSGNP
GDGKTAFLER LGVELRRRGA RVLESDAAGW RMTLDGHEFA SVYDASESHE GISADQLLRR
ALDPLNDQSG AGRYTALLAA NDGRLLDFFD RDAHRYPEIA KLLGGGGGDD GSGSGQAADA
AGVLRIDLKG RSMASVGLAA GSLTVRMLDA LLTDELWAPC GGCAAEHRCP IAANARGLRD
GRVQERLHRL VLTSHLRRGR RPTIRDLRSA LAYLVTVDLG CADIHDEFGD GFGNPARPER
HFSAAAFDAA AGHDRLLGEW RSLDPAAVPA PRVERWLAPG TRSTADIARA KRLRYFTATD
AELDAVDHLG PYRHLDEFRA LLTSAGPDPG TDGAARRTAV VSKLLRGLSR CVGPVGYDGD
GVAVSVGEPS EDGGAVVKVL PADEFEVVVA DVDDRFVETA ADIVVLRHRS GQADLPVDVD
LFELLMRANA GLLPTNTEVA PLLEELGLFR ASLTLQPASR VIIIEPNGQR NVIKVTGTTI
ELLRDAR