Gene Franean1_5936 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5936 
Symbol 
ID5674257 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp7209247 
End bp7212336 
Gene Length3090 bp 
Protein Length1029 aa 
Translation table11 
GC content74% 
IMG OID641244784 
Productserine/threonine protein kinase 
Protein accessionYP_001510186 
Protein GI158317678 
COG category[K] Transcription
[L] Replication, recombination and repair
[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG0515] Serine/threonine protein kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0192244 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAGGCG AGCGCCGGGG CGAACCCGGT CCAGCCGGTC CACGCGCTCT CGTCGGGCGT 
CGGTACCGGC TGGACGGGGT GATCGGCCAG GGCGGCTTCG GTGTCGTGCA CCGGGCCACC
GACGAGCTGC TCGGCCGGCA GGTCGCCGTC AAGGAGGTCC GGCTCCCCAC CGACGGCAGC
GAGCGGGAGC GGGAGCTCGC GCGGGAGCGG GTGCTGCGGG AGGCGCGCGC CGCCGGCCGG
CTGCATCATC CCGGCGCGGT CGCGGTCCTG GACGTCATCG CCGAGGGCGA CCTGCCCTGG
ATCGTCATGG AATTCGTCGA CGGGCGGTCC CTGTCGGCGA TCATCGAGGA GCGCGGGCGG
CTCGGGGTCG GCGAGGTCTG CCGCATCGGC ATCAGCCTCG CCTACGCGCT GGAGGCCGCC
CACCGCCTGG GCATCGTGCA CCGTGACGTG AAGCCGAGCA ATGTCCTGGT GACAGCGGAC
GAACGCGCCC GGCTCACCGA CTTCGGGATC GCCGTCAGCC ATGGCGACCC TCGGCTGACC
AGCACCGGGA TGGTTCTCGG CTCCCCGGCC TACCTGGCAC CCGAGCGGGC CCGCGGCGAC
GCCGGAACGG CCGCCAGCGA CCGGTGGGGG CTGGGCGCCA CGCTGTTCAC CAGCGTCGAG
GGGGTCTCCC CGTTCCCCGG GAACGACCCG ATCTCCATTC TCGCCGCGGT GGTGCAGAGC
CGGCGCCGGC CGTTCCGCGC CGCCGGGCGG CTCGCGCCCG TGATCGACGA TCTCATGGCC
ACCCACCAGG CACGCCGGCC GAGTCTGGCG ACGGTGCGCT CGCGGCTGCG CGACATCCTC
GAACGCGGCG GGGACACCCG GCCCTCCCGG GCCCGCAGCC GACCATCCCG GCCCGCGGTA
ACAGCGATCA CTCCGACGTC GGGCACACCC GCGGCCCCCG CACCCGTCGC CCCCGCACCC
GTCGCCCCCG GGCCTCCGAC GGCCCCGATC CCGGCCGGGT TCGCGGCGGA CGGCGGTGAC
ACGTTCACCG CGGGCGGAGC ATCCACGGGC GGCGCATCCC CGGCCGACCA GGACACCGTG
GTCAGCCAGT ATGCCGGGCC CGACGAGACC ACCGTCCTCG GCGAGTCCGT CCTCGGTGGA
CCGGCGACCG CCGGCGCGCG CGCCGGTGAG ACGCACAACG CCATGGCGGA GGACGACGAA
ACGTCCGACG ACCACTCGTC CCCGGAGGAC GGCGGCGCGG ACCTGACCGC GCTGCTCACG
GGCACGACGA CAGCACCGGC CGGCGGCGCA CCCGCCGGCC CACCGCCAGC CGCCCCGGGG
CTGGCCGGCA CCGTGCTGAT GAACGCGGCG CCCAGCGAGG CACCACCGGA CGACACCATG
CCTGGCGAGG CACTGTCAGA CGACACCGTG CCCGGCGGCC CACTGTCGGA CGACACCCTG
CCCGGCGAGA CACCATCAGA CGACACCCTG CCCGGCACCC TGCCCGGCGA GGCGCCGCCG
AATGACACCG TGCCCGGCGG GGTGCTGGCG GACGACACCC TGCTGGCCGA GGCATCGCCG
GACGGAACCG TGCTGACCCG GACGTCGCCG GCGAGCACGG GCAACACCGG GTCCGACGCG
GACAATGACA CCCGCATCGA CATCGGTACA GACAGCGCGG CGGTCATCGC CGCGGAGGAT
GCCGGCTCCC CGATACCGGA CTCGGCGGCC GTCGAACCCA CGGCGAGCCG GGTCGGGGCA
GGCGAGCCGG CGACGAGCGA CGCACGGCCG GCGGGAACGG CGAGACAGTC GGGAACGGCT
GGTGCGCACC CGCGGTTGAC CGGTCGGCTG GAGCGGCTCC GGGCGATGGA GCGGCGGCGG
GGCGCGACGG TTGAGCGTCC CCGGGGCGCT GAGCGGCTGC GACCGGTCGA CCGCATGCGG
GCACTCGGTG GCATCCCGGC GCGGGCGCGG ACATCCCAGG ACTCGTTGGA TCCGATGACC
GGCGGCCTGC CGATACCCGG CGCCGGTGCG GCATCGGCCG CGACGGGAAC GGTGCCGGGC
CCGGCCGGTT CCGCGGGCAC CGAACCGGCC GCCACCACGG CCACCGGGAC AGCCAAGGCC
ACCGGGACAG CCGGGACGAG GAGTACGCCC GACGCCACGG GAACGCCAGG CACCGCCACG
GAACCGACGT CCGCTGTGGG GCGCCGGTAC TCGGCGGCCG CGGCGTTGTT CTCCCGGCCG
GGCAGGTCCA CGGCCTCCGG CGATTCCGGT CCGGGGATCT CCGCGGGTCC GTCGAGCGCA
TCCCGCGCCG CCGCCCATCC GCGACGGGAG CCGGAGCAGG AGCCGCCGAC GAGTGGGCAG
AACCTGTTCC GCCCGCCCTC CGAACGGCAG CCGGAGGAGC GGCGGCACCC GGGTGGGGTG
CTGGGGCGTC CGGACGGGGC GAGCGCCGTT ACGCCCAGCC GCGGACGTCG CCTGGCTGTG
ATCCTCATCG CGTCCGTCGT CGTGATCGCG CTTGCGGTCG TCGCCGCCGT CCTGGTGTTC
GGCGGGGGTG ACGACGGTGC GGAGCCGGCC GGAGCCGGGC CGCGCCCGAC CGTCCAGGCT
GACCAGGAAG CCGCAGCAGT CCAGGCTGGA GACCTGGTCG CGTCGAATTC TGAGCCGGTC
ACGGCGCCTC CGGGCTGGGT CTCCTACGTC GATCCGACAG GCTGGTCGAT CGCCTACCCG
TCGCGCTGGC AGCGGCGGCC CGGCCCCGGG GGCGAGGGGA ATACGGACTT CGTGGATCCG
GCTACTGGTA CGTTCTTGCG CATCGGGAGC ATCGCGAGTG CGAACACCTC CGCTATTGAG
GACTGGCGCA CGAACGAAAT CAGCTTCCGG GATCGAGTGC GGGACTACCG GCGGGTACGG
ATCGAACCAG GTGACGGGGC GGACGGCGCG ACACAGGCCG ACTGGGAGTT CACCTACGCC
TCCGGTGGGG GGACGGTGCA TGTGCTCAAC CGTGGCGCGG TGCGAAACGG GCATGGATAC
GCTCTGTACT GGTACACGGC CGAGGAGCTG TGGGAGGCGG ATCAGCCGCT CATGCGACAG
CTCTTCGCGA CCTTCAGGCC GGCCTCATGA
 
Protein sequence
MSGERRGEPG PAGPRALVGR RYRLDGVIGQ GGFGVVHRAT DELLGRQVAV KEVRLPTDGS 
ERERELARER VLREARAAGR LHHPGAVAVL DVIAEGDLPW IVMEFVDGRS LSAIIEERGR
LGVGEVCRIG ISLAYALEAA HRLGIVHRDV KPSNVLVTAD ERARLTDFGI AVSHGDPRLT
STGMVLGSPA YLAPERARGD AGTAASDRWG LGATLFTSVE GVSPFPGNDP ISILAAVVQS
RRRPFRAAGR LAPVIDDLMA THQARRPSLA TVRSRLRDIL ERGGDTRPSR ARSRPSRPAV
TAITPTSGTP AAPAPVAPAP VAPGPPTAPI PAGFAADGGD TFTAGGASTG GASPADQDTV
VSQYAGPDET TVLGESVLGG PATAGARAGE THNAMAEDDE TSDDHSSPED GGADLTALLT
GTTTAPAGGA PAGPPPAAPG LAGTVLMNAA PSEAPPDDTM PGEALSDDTV PGGPLSDDTL
PGETPSDDTL PGTLPGEAPP NDTVPGGVLA DDTLLAEASP DGTVLTRTSP ASTGNTGSDA
DNDTRIDIGT DSAAVIAAED AGSPIPDSAA VEPTASRVGA GEPATSDARP AGTARQSGTA
GAHPRLTGRL ERLRAMERRR GATVERPRGA ERLRPVDRMR ALGGIPARAR TSQDSLDPMT
GGLPIPGAGA ASAATGTVPG PAGSAGTEPA ATTATGTAKA TGTAGTRSTP DATGTPGTAT
EPTSAVGRRY SAAAALFSRP GRSTASGDSG PGISAGPSSA SRAAAHPRRE PEQEPPTSGQ
NLFRPPSERQ PEERRHPGGV LGRPDGASAV TPSRGRRLAV ILIASVVVIA LAVVAAVLVF
GGGDDGAEPA GAGPRPTVQA DQEAAAVQAG DLVASNSEPV TAPPGWVSYV DPTGWSIAYP
SRWQRRPGPG GEGNTDFVDP ATGTFLRIGS IASANTSAIE DWRTNEISFR DRVRDYRRVR
IEPGDGADGA TQADWEFTYA SGGGTVHVLN RGAVRNGHGY ALYWYTAEEL WEADQPLMRQ
LFATFRPAS