Gene Franean1_0059 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0059 
Symbol 
ID5668485 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp74912 
End bp76456 
Gene Length1545 bp 
Protein Length514 aa 
Translation table11 
GC content68% 
IMG OID641238988 
Productserine/threonine protein kinase 
Protein accessionYP_001504433 
Protein GI158311925 
COG category[K] Transcription
[L] Replication, recombination and repair
[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG0515] Serine/threonine protein kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTGGCG TGACCGAGGT CGCCGAGATC CTCGGAGTCA GCCGACAGCG CGTCGCGAAG 
CTGCGGGATC GCGCCGACTT CCCCGACCCC ATCGCCGAGA TCGCGCAAGG ACCGATCTGG
GATCTTGACA AGATCGAGGA ATGGGGCGGG TCGGACCTGC GGCTCTCGCC CGGCCGGCCA
CGAGCCGACA CCGTGGCGCG GACCCTCGGC GGCCGGTTCG TTCTGGAGGA GCCGCGCATC
GGACACGGCG GCTTCGGTGA CGTCTATCGC GCGATGGACC GCAAGCAGCC TGGCCGCAAC
GCCGCGCCCG TCGCTGTGAA GCTGATGCGC GATGCCAGCA TGGTCGACCC GGAAGCAGTG
CGCCGCTTCG AGCGGGAGCT GCGGCTGCTC GAGCCGATCA GGCACCCGAA CATCGTCCCG
ATTCTCGGCC ACGGCAAGAC CCCGGCAGGC GCAATTTGGT ACGCCATGCC TCTTGCGCAG
GGCAGCCTGG TCGACTTCGC CGAGGAGCTC CACGGCAGGA ACGCCCTGAT CCTCGACCTA
ATGCGGCAGG TCGGCGCGGG CCTGACGCAC ATCCATGATC GAAAGATCTA TCACCGGGAC
CTCAAGCCCG GCAACATCCT GCGCCTAGCG GACGGCGTTT GGGCGATCGC GGACTTCGGG
CTGGCTGTCG ATGCGGAACG CGCTACCACC GCGCTCACGT CCACGCTCCG CGGGATCGGA
ACGGCATGGT ACACGGCGCC GGAGCAGTGG CGCGACGCCC GCAATGTCAA CCACCTGGCA
GACGTGTTCG GCCTCGGCAA GGTCCTGCAG GAGCTCGTCA TCGGCGATGC ACCGGTCACC
AACGAGGTCC CACCCGGACC GCTACGCCCG ATCGTGCAGA GAGCGATCGC GGAGCGGCCC
GAGCACCGCT ACGCCTCCGT CCGGGACTTC CTCGCCGCGC TAGCGACCGC GATCGAGACG
CCGAGGGACG GCTGGGAAAG CGCCGAGGGC ACCGCCGAGC GGTTGCTTGA ACGGGTCAGG
CTACCCAAGG CGGCTGAGGT AGACCTCGAC GAGCTGGCGA CCTGGGCGCT CGCTCTCGAC
GAGAGCGACA CGGACGACAT GACGGCCCTT GCCCGGGTTC TCCCCTGGAT CTCGACCAGG
TCGATTCACT ATCTCTGGGC CGCAGACCCC GCAGGTTTCC AGAGGATCTT CAGGCACTAT
TCGAAGCACG TCGAGACCAC CGGTTTCGGC TTCGAGTACT GCGACGTGCT CGCCGACTTC
TCCCGCAGGG CCGTCAAGGA AACCGACGAC TCAGACGTCC TTCGCGAGGC CATCCGATCC
CTGGTCGAGC TCGGCCACCG TCATAGCCGT TGGCGGGTGC GCGGCGTCGT CACGACGATC
TTGCAGGGCA TCCGCAAGCC GGAGCCCGCC CTCGCCGCCG TCGAGGCACT ACGCGCTGCC
GACGTGGAAG CCGTCGAATG GACGCTCAGC GAATTCTCGA TCCGCTCTCT ACCGCCCATC
CTCCGCAACG AGATCAACAT GCTGCTCAGC GCCGCCAGCC GCTGA
 
Protein sequence
MGGVTEVAEI LGVSRQRVAK LRDRADFPDP IAEIAQGPIW DLDKIEEWGG SDLRLSPGRP 
RADTVARTLG GRFVLEEPRI GHGGFGDVYR AMDRKQPGRN AAPVAVKLMR DASMVDPEAV
RRFERELRLL EPIRHPNIVP ILGHGKTPAG AIWYAMPLAQ GSLVDFAEEL HGRNALILDL
MRQVGAGLTH IHDRKIYHRD LKPGNILRLA DGVWAIADFG LAVDAERATT ALTSTLRGIG
TAWYTAPEQW RDARNVNHLA DVFGLGKVLQ ELVIGDAPVT NEVPPGPLRP IVQRAIAERP
EHRYASVRDF LAALATAIET PRDGWESAEG TAERLLERVR LPKAAEVDLD ELATWALALD
ESDTDDMTAL ARVLPWISTR SIHYLWAADP AGFQRIFRHY SKHVETTGFG FEYCDVLADF
SRRAVKETDD SDVLREAIRS LVELGHRHSR WRVRGVVTTI LQGIRKPEPA LAAVEALRAA
DVEAVEWTLS EFSIRSLPPI LRNEINMLLS AASR