Gene Franean1_7287 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_7287 
Symbol 
ID5675588 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp8900180 
End bp8902858 
Gene Length2679 bp 
Protein Length892 aa 
Translation table11 
GC content74% 
IMG OID641246124 
Productserine/threonine protein kinase 
Protein accessionYP_001511512 
Protein GI158319004 
COG category[K] Transcription
[L] Replication, recombination and repair
[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG0515] Serine/threonine protein kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0443497 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.301808 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCTTATCG ACAGATCTCT CGTGGAGGCG GCGCTCCCCG GATACGCAGT CGAGGGGGAC 
CTCGGCAAGG GGGGCTACGG CCTCGTTCTC GCCGGGCAGC ACCGGCTGAT CGGCCGCAAG
GTCGCGATCA AGATCCTCCT TGACACGACC GACGACCCCG ACCTGCGCGC CCGTTTCCTC
TCCGAGGCTC GCGTCCTGGC CGAGCTCGAC CATCCGCACA TCGTGCGCAT CCACGACTAC
GTGGAGCACG AGGGAACCTG CCTGCTGGTG ATGGAGCTGC TCTCCGGCGG CACCCTCAAG
CAGCGGATGA CCGCCGGGAA GCTCTCCCCG GAGACGATCT GCGCCGTCGG CGTCGCCGCG
TCGGCCGCAC TCGCCACCGC GCACCGCCAG GGCGTGCTGC ACCGCGACGT CAAACCCGAC
AACATCATGT TCGACGGCGC CGGGCTGCTC AAGGTCACCG ACTTCGGCAT CGCGAAGATC
TTCGACGGCG CGGAGACCAC GGCCAGCGCC ATCCTGGGCA CACCGCGCTA CATGGCGCCC
GAGCAGATCA TGGGCGCTCG GCTGTTCCCC GCCACCGACC TGTACGCGCT GGCCGGCGTC
CTCTACGAGA TGTTCGCCGA GCGGCCGCTG TTCGGACGTC CGATGGCGGT GCAGCCGCTG
ACCCACCACC ACCTGACCGT GATGCCCGAC CCGCTCACGA CGGTGCCGCC GGCGATCTCG
GCGGTCATCA TGCGGACCCT GGCCAAGGAC CCGGGCGCCC GCTTCTCCGA CGCCTCCGAG
TTCGCCCTCG AGCTGGCCCG CGCCGGCGCC GGCGCGTACG GGCCGAACTG GGTGGCGCGC
TCGGACGTCA AGCTGCGGGT GGACGACGAG ATCCGCGGCG TGATCGGCGG CTCGACCACC
TCCGGGAACT CCGGCGGGTT CCCGGCCGCC ACGCGCCCGG GCGGCTATCC GGGTCCTGGC
GGGCCCGGCG GGCCCGGTTA CGCGCCCGGG CAGCCGGGCG GGCCCGGTTA CCGCGGCCCG
GTGCCGGGCT CCGGCCCGCA GCAGCCCGTC GGCGGCGGCT TCGGGCCCGG CGGGCCCCGG
CCCAACCAGC AGATGCGCCC GACCGGCGGG CATCCGTCCG CCTCGGGCTT CCCCGGCGCC
ACGCCGGTCA TGCCCGGGCA GGGTGTGCCG GGCGGGCAGG GCCGGCCCGG GCCGACTCCC
GGAACGCCCC CTGGCGGCTG GCGCGGCTAC GGCCAGCCCG GCGGGGCCAC CCCGCAGCCG
ATGCCGCTCA CGCCACCCGC CTCGACGCCC CGGCCCCCCG GCCCCATGGG CCCGCCCAGC
TCCACGCCGC CGCCGCGGCC CGTCCCCGGC TACGGCGGTC AGCCAGGGCA GCCCAGGCAG
CCCGTCCCGG GCCGGCCGGG GCCGGGCGGC CCGCGACCCA CCGGCGCCCA GAACTTCGGT
GGCCAGGGCT GGCAGGGCTC CGGCCCCCGC CCGCCCGCGC ACCAGGTCCC GCCCCGGCCA
CCCATGGGCG GCCACCGCCC CCGGCCGGGA CCCCCGAACC GCAACAACCG GACGCTGATC
CTCGCCGCCG TCGCCGCCGC CGTGGTCGTG GTGCTGACCA TCGGCCTCAT CGTGGGCCTG
TCGGGCGGCG GCAGCAGCGA CGACAGCGGT TCGAGCCGCG ACCAGGTGTC GACGCTCCCG
GTCCCCGGTG TCGCCTACAC CGGAACGCCG GTGACGGTGC AGGGCCTCAG CCCGTACAGC
GTCGCGATCG ACCCGCAGGG CACGCTGTTC ATCACGAGCC TGTCGTCCGA CCGGATTCAG
AAGGTCACCA GGACCGGGGA GGTCTCCGAC CTCGCCGGCA CCGGCGCGGA CGGATACAGC
GGCGACAACG GCCCCGCCAC GGCCGCCAAG CTCAACGGCC CCGGCTCCGC GGTGCCGGAC
AAGAACGGCA ACATCTACAT CCCGGACGCG CAGAACTACC GCATCCGGAA GATCACCCCG
GACGGGATCA TCACCACGAT CGCCGGCACC GGCACCGCGG GCTTCTCCGG CGACGGCGGC
CCGGCCACGG CCGCCCAGAT CAACAGCGCG GAGAAGGTCG CCATCGGCCC GGACGGCTCG
ATCTACATCG CCGACTACGA CAACCACCGC ATCCGGAAGA TCACCCCGGA CGGGATCATC
AACACGATCG CCGGCACCGG CCTCCAGGGC TACTCCGGCG ACGGCGGCCC CGCCACCGCG
GCCAAGCTGG ACGGCCCGAA CGACGTCGAG CTGGGCGACG ACGGAACGCT CTACATCGCC
AACCTCGGCA GCAACACCAT CCAGAAGATC ACCAAGGACG GGATCGTCAC CACGGTTGCG
GGCAACGGGC AGAAGGGCTT CTCCGGCGAC GGCGGCCCCG CCACCGCGGC CCAGCTCTCC
GTCCCGTCGG TGTCCCTCGG CAACGGCGGG GAGATCTACA TCGCCGACTA CGGGAACAAC
CGGGTCCGCA AGGTGGACCC GAACGGGACG ATCACCACGA TCGCCGGCAC CGGGGCCGAG
GGCTCCGGCG GCGACGGGGG CCAGGCCACG GCCGCCCAGT TCAACGAGCC CAGCTCGGTC
GCCGAGGACG CCGACGGCGC GCTCTACATC GCCGACTCGG GCAACAACCG GCTGCGCCGC
ATCGCTCCGG ACGGGACGAT CACGACGGTC GCGCAGTAG
 
Protein sequence
MLIDRSLVEA ALPGYAVEGD LGKGGYGLVL AGQHRLIGRK VAIKILLDTT DDPDLRARFL 
SEARVLAELD HPHIVRIHDY VEHEGTCLLV MELLSGGTLK QRMTAGKLSP ETICAVGVAA
SAALATAHRQ GVLHRDVKPD NIMFDGAGLL KVTDFGIAKI FDGAETTASA ILGTPRYMAP
EQIMGARLFP ATDLYALAGV LYEMFAERPL FGRPMAVQPL THHHLTVMPD PLTTVPPAIS
AVIMRTLAKD PGARFSDASE FALELARAGA GAYGPNWVAR SDVKLRVDDE IRGVIGGSTT
SGNSGGFPAA TRPGGYPGPG GPGGPGYAPG QPGGPGYRGP VPGSGPQQPV GGGFGPGGPR
PNQQMRPTGG HPSASGFPGA TPVMPGQGVP GGQGRPGPTP GTPPGGWRGY GQPGGATPQP
MPLTPPASTP RPPGPMGPPS STPPPRPVPG YGGQPGQPRQ PVPGRPGPGG PRPTGAQNFG
GQGWQGSGPR PPAHQVPPRP PMGGHRPRPG PPNRNNRTLI LAAVAAAVVV VLTIGLIVGL
SGGGSSDDSG SSRDQVSTLP VPGVAYTGTP VTVQGLSPYS VAIDPQGTLF ITSLSSDRIQ
KVTRTGEVSD LAGTGADGYS GDNGPATAAK LNGPGSAVPD KNGNIYIPDA QNYRIRKITP
DGIITTIAGT GTAGFSGDGG PATAAQINSA EKVAIGPDGS IYIADYDNHR IRKITPDGII
NTIAGTGLQG YSGDGGPATA AKLDGPNDVE LGDDGTLYIA NLGSNTIQKI TKDGIVTTVA
GNGQKGFSGD GGPATAAQLS VPSVSLGNGG EIYIADYGNN RVRKVDPNGT ITTIAGTGAE
GSGGDGGQAT AAQFNEPSSV AEDADGALYI ADSGNNRLRR IAPDGTITTV AQ