Gene Franean1_5424 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5424 
Symbol 
ID5673755 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6562333 
End bp6563889 
Gene Length1557 bp 
Protein Length518 aa 
Translation table11 
GC content71% 
IMG OID641244279 
Productserine/threonine protein kinase 
Protein accessionYP_001509685 
Protein GI158317177 
COG category[K] Transcription
[L] Replication, recombination and repair
[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG0515] Serine/threonine protein kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.178925 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.196392 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCAGGAC AGCGGACGGT CATCGCGGGG CGATACGAGC TGACCGCGCC CATCAGCCGA 
GGCGGGATGG GGCAGGTCTG GCACGGATAC GACACCGTTC TCGACCGCGA CATCGCGGTC
AAGCTGATCA GACCGGAAAT CGTAGAGTCC GCGGACCGCG CCGGGTTCAT CAGCCGGTTC
CGCCGGGAGG CGCGGGTCAC CGCGAAGATC GAACACCCCG GCGTGCCCAC CGTCTACGAC
GCGGCGTTCG ACGAGACGTC CGACATGCTC TACATCGCGA TGCAGCTCGT GCACGGAGTG
TCCCTCTCCG ACCTCCGGCA CGAGCATGAC GGACCGCTCC CCCTCGACTG GGTCGTGTGC
ATCGCCGCGC AGATCTGCGC CGTGCTGTCG CACGCGCACG CGATCCCGGT CATCCACCGC
GATCTCAAAC CGCAGAACGT GATGGTCGAC CGCGCGGGAA CCGTCAAGGT GCTCGACTTC
GGCATCGCCG CCGTGCTCGG CACCGACGTC GCCCAGCTCA CCACCACCGG GCAGGTACTC
GGCACCAAGC CGTACATGTC CCCTGAACAG ATCAAGAGCA CGCCGGTCAG CCCCCGCAGC
GACCTCTACT CGCTGGGCTG CCTGCTGCAC GAGCTGCTGG CCGGGCAGCG GGTGTTCCGC
GCGACGGACG AGCTGTCCCT GATCTACCAG CACCTCCACA CCTCGCCGAC GCCCGTGCGC
GAGCTGCGCC CGGACGTGCC GGCCAGCCTG GAGGCACTCG TCCTCGACCT GTTGGCAAAG
GAGGCCGTCG ACCGGCCAGC GAACGCCTGG GAAGTGCATG ATCGGCTCGT GCCGTTCCTG
CCCGCCGCCG CACCGGGCGC ACCCGACTAC TCCAGGGCGC CGGCCGCGCT GCCCGACCCG
ACGCGCCCAT ACCGCCATCC AGGAGCACCG CTCCCCCGGC CACGGCCTCC CGCCGGTTCT
GGTGTCGCCG ACGCTTCCAC CACGATCCTC CGCGACGACG AGCTGGACGC CGCCCACAGC
CAGGCCGTGG AGCTGCTCGA AGAGGAACGG TTCTCGCAGG CCGCCGATCT TCTCGCCGGC
ATCCTCGCCG CGGCCGCCGC CCGCCGCCCG CCGACGGACC CGCGGCTGTT GGATCTGCGC
CTCACCCTCG CCGCCGCCCG GTTCTACGGC GGCGACTTCC GCCAGGCCCT CCCCGAGCTG
GACCATCTCG CGGAAATCCT CGCAGACGTG CGGGGCCCGC AGGACGAGGA GGCGATCAAC
TGCCGTCGAC AGGCCGCCTT CTGCCATGCC GAGCTCGGCG ACTTCGACCA CGCCCTGCGT
GACGTCCTGC GGCTCACCGA ACCGATCGTC GGCAGGTACG GGCCCGACTC CGAGCCCGCG
CTGGCAGTGC GCCTGGAGAT CGCCCGGTTC CAGGCGGCAG CCGGTCACGC CACCGATGCC
CAGGTGACCC TGCGCGGCCT CCACGCGGAC GCCGGTCGCC TCCTCGGTCG CCAACACCCC
GTCACCCAGC AGGCCGCCGG CCTCCTCGCC CGCCTCCGCC CGCAGCCCGA CATCTGA
 
Protein sequence
MPGQRTVIAG RYELTAPISR GGMGQVWHGY DTVLDRDIAV KLIRPEIVES ADRAGFISRF 
RREARVTAKI EHPGVPTVYD AAFDETSDML YIAMQLVHGV SLSDLRHEHD GPLPLDWVVC
IAAQICAVLS HAHAIPVIHR DLKPQNVMVD RAGTVKVLDF GIAAVLGTDV AQLTTTGQVL
GTKPYMSPEQ IKSTPVSPRS DLYSLGCLLH ELLAGQRVFR ATDELSLIYQ HLHTSPTPVR
ELRPDVPASL EALVLDLLAK EAVDRPANAW EVHDRLVPFL PAAAPGAPDY SRAPAALPDP
TRPYRHPGAP LPRPRPPAGS GVADASTTIL RDDELDAAHS QAVELLEEER FSQAADLLAG
ILAAAAARRP PTDPRLLDLR LTLAAARFYG GDFRQALPEL DHLAEILADV RGPQDEEAIN
CRRQAAFCHA ELGDFDHALR DVLRLTEPIV GRYGPDSEPA LAVRLEIARF QAAAGHATDA
QVTLRGLHAD AGRLLGRQHP VTQQAAGLLA RLRPQPDI