Gene Franean1_3742 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3742 
Symbol 
ID5672107 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4429599 
End bp4432019 
Gene Length2421 bp 
Protein Length806 aa 
Translation table11 
GC content75% 
IMG OID641242623 
ProductWD-40 repeat-containing serine/threonin protein kinase 
Protein accessionYP_001508043 
Protein GI158315535 
COG category[K] Transcription
[L] Replication, recombination and repair
[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG0515] Serine/threonine protein kinase
[COG2319] FOG: WD40 repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0575594 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGGTGA GCTCGTCCAG CCTCGGTGGA TCCGATGGGT TCGACGGCGC CGATCCGGTG 
CGGGTCGGGC CGTACCGGCT GCTGCGCCGG CTCGGCGCGG GCGGCATGGG CACCGTCTAC
CTCGGGCTGA ACGGTTCCGG GCGGCAGGTG GCCGTCAAGC TCGTCCGACC CGACCTGGCG
CGGGTCGCCG AGTTCCGCGA GCGGCTGAAG CGGGAGGCGG ACAGCGCCCG GCGGGTCGCC
CGGTTCTGCA CCGCGGCGGT GCTCGACGTG AACGTCACCG CCGACGTGCC CTACCTGGTC
ACCGAGTTCG TGGACGGCCC GACGCTGGCC GAGGTGGTGC GCGAGCGCGG GCCGCTGCAC
CCCGCCGAGC TGCACCAGCT CGCCGCCAGC ATGACCACCG CGCTGATGGC GATCCACCGC
GCCGGCATCG TCCACCGCGA CCTCAAGCCC AGCAACATCG TGCTCTCCCG GCTCGGCCCG
AAGGTCATCG ACTTCGGGAT CGCCCGCGCG CTCGACACAG CGTCCGTCCT GAGCGGGGAG
CACCCCGTCG GCACCCCCGC GCTGATGGCC CCCGAGCAGG CCCGTGGCGA CACGATCACC
TCCGCGGCGG ACGTCTTCGC CTGGGGCGGG GTGCTCGTGT ACGCCGCCAC CGGCCGCTAC
CCGTTCGGCA CCGGCCCGGC CGCCGCGCTG CTCTACCGCA CGGTCAACGA CACCCCCACG
CTCGACGGCT TCGAGGACTC CCTGCGTCCG CTGGTCGAGG AGGCGATGCG CAAGGCCCCG
GCCGATCGGC CCACCGCCGA GCAGCTCTAC GCCCGGCTGC TCGACATGCG CGTGGACGAG
GCCGAACTAC CCCTGGGACC CCCGCTCGCC GAGGTCGCCG CGCTCGTCCA GCCGGTCGCG
CCCGGAACGC CTGGGACCCC TGGGACGCTC GGGAATCCGC CGACGCCGAC GCCGTCGTCA
TCGTCATCGT CATCGGTGGA GACGCAGGCC GTCCCGCCGA CCCCGGTTAC CGTTCTGTCC
CCGCACGGCT CGCCGGCGCC GGCGCCGTCC ACGGCCGGTC GCGCCGGAAC GGATCGTCGC
GCCGGAACGG TCGGGCGGGC CAAAGCGCAC GGGCGGAGCC AGGGCAGGCG GAGCCGACGT
GGGTGGCGCG GGATCGCCCT GCTCGCGGCG ATCCTCGCCG CGGTCATCGC GACGGTCGTG
CTCGTCGTGG TCGACAACCG TGGCGCGCCC AGCCCGACCG GTGTGCCCGA GCAGGTCGCG
GCCAGGGCGC TGCGGCTGCA GTTCCAGGAC CGTCCACTCG CCCGCCGGCT CGCGCTCGCG
GCCTACCACG CCGCGCCGGA GTCGGCGTCC ACCCGCAACG CGGTGATCGG CCTGTTCGCG
GCCGACGTCG ACCCGGTGGT GGCCCCGCTG GGCAGCCGCG TCCTGAGCGC CGCTCTGCGT
CCCGACGGCC GGCTGCTCGC CGCTGGGACG GAGGCCGGGA CCATCGAGCT CTGGGATCTC
ACCGACCTCG CCCATCCCGT CCACGCCGGC ACCATTTCCG GGGTCGGTGA CTGGGTCTAC
TCGGTGGCCT TCAACCCGGG CGGGAACCTG CTCGCGGCCG GTGTCGGCGA CGGAGCCGTC
CGGCTGTGGA ACGTCACCGA CCCCGCCCGC GCCGGCGCGC TCGCCACCAT CGCGTTCCAC
CGCGACCGGG TGCGCTCCGT CGCGTTCGCC CCGGACGGCG GCACCCTCGC CTCCGGCGGC
GACGACGGCC AGGTCGGCCT GTGGGCGGTG ACCGACCCGT CCCATCCGCA GCGGCGGTCG
GCGACCGACG GCGCCGTCGC CGGGATCCGG TCGCTCGCGT TCTCCCCGCG CGGCGGCCTG
CTGGCCCTCG CCGGCAACGA CGGCTCCGTC CGGCTGTGGA ACGTCGCCGA TCCGGCGCGG
CCCGCCACCA GCTCCACCCT GCGTGGCACC GGCCGCACGG TGCAGTCCGT CGCCTTCTCC
GCCGACAGCT CGACCCTCGC CGCCGGCGGC ATCGACGGCT CCGTGCACAC CTGGCGGGTC
GACGGCCCGG GATCGGTCGT CGACCTCGGC TCCACCCCCG GCGGTGTCGG CGGGGTGACC
AGCGTCGGCT TCAGCCCGGA GGGCGCCATC CTGGTCTCGG CCAGCGAGGA CGAGACCGTC
CGGCTCACCG ACATCTCCGC CCCGGCGGAC CCCGTCCTGC TCACCGACCT GCGCGGGCAC
ACGAAGGCGG TGAGCGCGGC GATGTTCGTC CCCGGTGGGC GGACCGTCGT GTCGGCCAGC
GGCGACGGCT CCGTCCGGCT GTGGACGGTC GACCCCGCGG CGCTCACCCG CAACGCCTGC
GCCGACCCGT CAAACCAGAT CGGCTCCAAG GACTGGTACG CCAACTTCCC GAACACGCCC
TACGCCCCAC CCTGCCCCTG A
 
Protein sequence
MTVSSSSLGG SDGFDGADPV RVGPYRLLRR LGAGGMGTVY LGLNGSGRQV AVKLVRPDLA 
RVAEFRERLK READSARRVA RFCTAAVLDV NVTADVPYLV TEFVDGPTLA EVVRERGPLH
PAELHQLAAS MTTALMAIHR AGIVHRDLKP SNIVLSRLGP KVIDFGIARA LDTASVLSGE
HPVGTPALMA PEQARGDTIT SAADVFAWGG VLVYAATGRY PFGTGPAAAL LYRTVNDTPT
LDGFEDSLRP LVEEAMRKAP ADRPTAEQLY ARLLDMRVDE AELPLGPPLA EVAALVQPVA
PGTPGTPGTL GNPPTPTPSS SSSSSVETQA VPPTPVTVLS PHGSPAPAPS TAGRAGTDRR
AGTVGRAKAH GRSQGRRSRR GWRGIALLAA ILAAVIATVV LVVVDNRGAP SPTGVPEQVA
ARALRLQFQD RPLARRLALA AYHAAPESAS TRNAVIGLFA ADVDPVVAPL GSRVLSAALR
PDGRLLAAGT EAGTIELWDL TDLAHPVHAG TISGVGDWVY SVAFNPGGNL LAAGVGDGAV
RLWNVTDPAR AGALATIAFH RDRVRSVAFA PDGGTLASGG DDGQVGLWAV TDPSHPQRRS
ATDGAVAGIR SLAFSPRGGL LALAGNDGSV RLWNVADPAR PATSSTLRGT GRTVQSVAFS
ADSSTLAAGG IDGSVHTWRV DGPGSVVDLG STPGGVGGVT SVGFSPEGAI LVSASEDETV
RLTDISAPAD PVLLTDLRGH TKAVSAAMFV PGGRTVVSAS GDGSVRLWTV DPAALTRNAC
ADPSNQIGSK DWYANFPNTP YAPPCP