Gene Franean1_0631 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0631 
Symbol 
ID5669048 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp731245 
End bp732879 
Gene Length1635 bp 
Protein Length544 aa 
Translation table11 
GC content74% 
IMG OID641239558 
Productserine/threonine protein kinase 
Protein accessionYP_001504996 
Protein GI158312488 
COG category[K] Transcription
[L] Replication, recombination and repair
[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG0515] Serine/threonine protein kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.108633 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.449147 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCACAA ACGGCGTCGA GGTTCAGCCA CCGCGACCGA GTGACCCCGT CTCGCTGGGA 
CGGCACCGGG TGCTCGGCCG TCTCGGCGCC GGTGGCATGG GCGTGGTCTA CCTGGCCGAG
GGCCCGTTCG GCCGGGTCGC GGTGAAGCTC GTCCGCGAGG AGCTCGCCGA CGACCCGGAC
TTCCGCCGCC GGTTCCAGCG CGAGGTCCAG GCCTGCTTCC GGGTGGGCGG CGCACACACC
GCCCGCCTGG TCGACTTCGA GCTGGACGCC GACCGGCCCT GGCTGGCGAC CGAGCTCGTC
GACGCGCCGA GCCTGTCGAC CCACGTCCGG TTACACGGCC CGCTCGGCCC GGACGAGCAG
ACGGTGCTCG CGGCCGGGCT CGCCGACGCG CTGATCTCGA TCCACGCGGC CGGGATGATC
CACCGGGATC TCAAGCCGTC GAACGTCCTG TGGACGGCGC ACGGGCCGAA GGTCATCGAC
TTCGGCATCG CCGCGGCGGC CGACGCCGCG GTGATCACCC TCAGCGGGCA GTTTGTTGGC
ACTCCCGGCT GGCACGCCCC GGAGCAGGTC TCCGGCGCGG AGGCGACCCC CGCCGCCGAC
ATCTTCGCCT GGGGCGCGCT GCTGTGCTTC GCCGCGACCG GCGAGGCGCC GTTCGGCACC
GGGCCCAGCG CGGCGGTGCT GCACCGGGTG CTGGAGGCCG AGCCGTCCAC CGACCTCGGA
CGGATCACGC CGGGGCTACG CCAGCTCGTA TCCAGCGCGC TGGCCCGCGA GCCGGCGGAC
CGGCCCACCG CCGAGGCCCT GTTCACCGAG CTGGTGGGTT CGCTGCCCGA CGGCGCCGGC
GGCTCGGCCA CCCGGTTCCT GCAGACACGG CCGGTGGTGG CGGCGACACC GCCGTGGTCG
GCGACCATCC CGCCGACAGG CGACAAGGAC GACCCGGCAC CCGGCCAGCC AACGGCTGGT
CAGCCAACGC CAGGCCAGTC GGCGTCCGAT CAGGCACGGC CCGGCCAGTC GACGCCGCAG
CAGCCACAGC CGGGACGGCG CAGGCGGCGC CGGCTGGTCA TCGCGCTCGC GGCGGGCGTG
GTCGTGCTGG CCGCAGGCGG GACGGCGGCC GTCCTGCTGA CCTCCGACGG CGACGACGGC
ACGTCCGACA CCACCTTCAC CTCGGACGCG CCGTGGCGGC TGCGGATCTC CGACGAGATC
CAGGGCACCG ACGACACCGG CTGCGCGGTC ACCGTCACCG ACACCGCGAC GGGTGACAGC
CGCTCGATCA CCGGCGTCTA CGGGGAGAAG ATCTACCAGG TCTCGCAGGT CGGGACGTTC
CGCCTCGGGG CTGACAACCC AGGCTGTGTG GTCCAGGGCC TGGAGGCCGC GGGTGACGCG
TCGCTGCCGT TCGCGACAAC CTCGTACACC GGTGACACCG AGGCCTTTCG CACCGAGGGC
ACGGTCACCG TCAAGGTCGT CGACTTCGGC GGCAGCGAGG ACTGCGCCCT CGAGCTCCAC
TCGGCCACCG ACGGCCGGCT GCTCAACTTC GGCACCGCCA GCCCGGACAA CCCGGTAATC
ACCCTGGACG CCGGCGGCCC CGACCAGGTC TACCTCGCCG AGCCGCCCTG CGGGCTGCGG
GTCCTCCCCG GCTGA
 
Protein sequence
MSTNGVEVQP PRPSDPVSLG RHRVLGRLGA GGMGVVYLAE GPFGRVAVKL VREELADDPD 
FRRRFQREVQ ACFRVGGAHT ARLVDFELDA DRPWLATELV DAPSLSTHVR LHGPLGPDEQ
TVLAAGLADA LISIHAAGMI HRDLKPSNVL WTAHGPKVID FGIAAAADAA VITLSGQFVG
TPGWHAPEQV SGAEATPAAD IFAWGALLCF AATGEAPFGT GPSAAVLHRV LEAEPSTDLG
RITPGLRQLV SSALAREPAD RPTAEALFTE LVGSLPDGAG GSATRFLQTR PVVAATPPWS
ATIPPTGDKD DPAPGQPTAG QPTPGQSASD QARPGQSTPQ QPQPGRRRRR RLVIALAAGV
VVLAAGGTAA VLLTSDGDDG TSDTTFTSDA PWRLRISDEI QGTDDTGCAV TVTDTATGDS
RSITGVYGEK IYQVSQVGTF RLGADNPGCV VQGLEAAGDA SLPFATTSYT GDTEAFRTEG
TVTVKVVDFG GSEDCALELH SATDGRLLNF GTASPDNPVI TLDAGGPDQV YLAEPPCGLR
VLPG