Gene Franean1_1302 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1302 
Symbol 
ID5669715 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1571428 
End bp1573197 
Gene Length1770 bp 
Protein Length589 aa 
Translation table11 
GC content75% 
IMG OID641240234 
Productserine/threonine protein kinase 
Protein accessionYP_001505662 
Protein GI158313154 
COG category[K] Transcription
[L] Replication, recombination and repair
[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG0515] Serine/threonine protein kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0592381 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGGAGC CGCTCATCGA CTCCGACCCG CGTGCCATCG GGCCGTACCG GCTGTCGGGC 
AGGCTCGGCA CCGGTGGGAT GGGCGCCGTC TACCTGGGGT TTGACCCCGC CCGCCGACCA
GCGGCGGTCA AGGTCGTTCG GCCTGACCTG CTCGGCGACG CCGAGTTCCG CGGCCGGTTC
CGCCAGGAGG TCGCCGCCGC CCGCCGGGTG CGCGGCTCGT TCGTCGCCGC CGTTCTCGAC
GCCGACGTCG ACGCACCGAC GCCGTGGATG GCGACCGAGT ATGTCGACGG GGTGAGCCTG
CAGGCGGCGG TGAGCCGGCG CGGGCACCTC GACGGGCCGA TGCTCGCCGG CCTCGCCGCC
GGTCTGGCGA ACGCGCTGGT GGCCGTGCAC GCCGCCGGCC TGGTGCATCG CGACCTCAAG
CCGTCCAACG TGCTGCTGGC CTGGGACGGC CCGAAGATCA TCGACTTCGG CATCGCCCGG
TCGCTCGACG CCACGAGCCA CACGACCGCC GGGGGCGTGC TCGGGACCGT CGCGTGGATG
GCACCCGAGC AGCTTCGCGG CGAGCGGGTC GGGCCACCCG GAGATATCTT CGCCTGGGCT
CTGTGTGTGG TCTTCGCGGC GCGGGGGCGG CACCCGTTCC CCGCCGACGC GCCGGCGGTG
TCCGCGATGC GCATGCTCGC CGACGACCCC GATCTGACCG GTGTGCCCGA CCACCTGCTC
CCGCTGCTCG CCCGTGCCCT CGCCAAGGAC CCGGGCCTGC GGCCGACGGC GACGCAGGTC
GTGTCCAGCC TCGCCGGGGT CGCGGTGGCC AGCCTCGACG AGGCCGACGC CGCGGTCCGC
GGACTGATCG GAACCGGCTG GGTTCCACCG ACGCCGCCAG GCGCCGGCCC CCCGCCCGAC
GTGTCGGACG TCCAGACCGC GGCGGCGCGG GCGGGCTCGG TGGCACCGGC GGGCTCGGTG
GTGGCGCCGA GCATCGGCAC GGTGGCGGAT CGGACCCTGC GGGCGCGCCG CCGCGGGACC
GGCGGCACCG GTGGCGGCCG CTACGGCCTG CTGACCGGTC TCGCCGCCGC CGGCCTGGCC
CTCGCCGTGC TCGCGGTCAC GCTCGCCGCG ACCGGTGTGG GATGGGACCG GTCGGCGGGG
AGCGGTGACC GGAGCGACCA GGACGCGCGG ATGCGGGTCG GGGCGCTCAA GGACATGGTG
CCCCCGGTGC CGGCGGACAA GGGGGCACGC GCGAGCCGGC CGGGCTTCCC GGCCGCGGAT
CCCGCGAAGC CGGGCGCGAC CCAGACCGCC TCCGGCTCCC CCGGCGTGGC GCCGGGCTCG
AATCCGCCGG GCCCTTCAAC GGGCACACCG GGTGTGCTGC CGTCGGCGAC CCCCGCGCCC
GCGCCGTCCC AGACTCCGCA GCTGCCGCCG GCCACGCCGC TGTACCGCTA CTACAACGGT
CAGGATCACG CCTCGCTCAC GAGCGCGGCG CCGGCGGCCG GCTTCCACCT GGAGCATGTG
CTGGGCAACC TCTTCACCAG CGGTGACGCC CCTGGGACGG TTCCCGTCTA CACGTGCCTC
AACGGAGCGG AGTCGTTCAC CTCGCTGGCG TCCTCATGCG AGAGTCGCAC GCCGGTGGGC
GTGCTGGGCT GGATCTACGC GGCGAAGCCG GCGGAGCCGG CCACCCAGGC GGTGTACCGG
TGCCGTATCG GCGAGGATCA CTTCGAATCA CTCCAGGCCG ACTGCGAGGG CCAGACAGCG
GAGGGGCTCA TCGGCTACGC GCCGCCCTGA
 
Protein sequence
MLEPLIDSDP RAIGPYRLSG RLGTGGMGAV YLGFDPARRP AAVKVVRPDL LGDAEFRGRF 
RQEVAAARRV RGSFVAAVLD ADVDAPTPWM ATEYVDGVSL QAAVSRRGHL DGPMLAGLAA
GLANALVAVH AAGLVHRDLK PSNVLLAWDG PKIIDFGIAR SLDATSHTTA GGVLGTVAWM
APEQLRGERV GPPGDIFAWA LCVVFAARGR HPFPADAPAV SAMRMLADDP DLTGVPDHLL
PLLARALAKD PGLRPTATQV VSSLAGVAVA SLDEADAAVR GLIGTGWVPP TPPGAGPPPD
VSDVQTAAAR AGSVAPAGSV VAPSIGTVAD RTLRARRRGT GGTGGGRYGL LTGLAAAGLA
LAVLAVTLAA TGVGWDRSAG SGDRSDQDAR MRVGALKDMV PPVPADKGAR ASRPGFPAAD
PAKPGATQTA SGSPGVAPGS NPPGPSTGTP GVLPSATPAP APSQTPQLPP ATPLYRYYNG
QDHASLTSAA PAAGFHLEHV LGNLFTSGDA PGTVPVYTCL NGAESFTSLA SSCESRTPVG
VLGWIYAAKP AEPATQAVYR CRIGEDHFES LQADCEGQTA EGLIGYAPP