Gene Francci3_3395 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3395 
Symbol 
ID3905977 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4025043 
End bp4027232 
Gene Length2190 bp 
Protein Length729 aa 
Translation table11 
GC content72% 
IMG OID637880717 
ProductWD-40 repeat-containing serine/threonin protein kinase 
Protein accessionYP_482478 
Protein GI86742078 
COG category[K] Transcription
[L] Replication, recombination and repair
[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG0515] Serine/threonine protein kinase
[COG2319] FOG: WD40 repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0235207 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.820289 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCCAGC CTGCCACGGC GCTTCGTATC GACGACCCCA CCCACATCGG GGTCTACCGT 
CTGCGCGGCC GGTTAGGGGT CGGGGGGATG GGCGTGGTCT ACCTCGCCGA GGACCCGCAC
CATCATCCGG TCGCCGTGAA GGTGATCCGC GTAGAGTTCG CCGCCGACCC GGAGTTTCGC
GCCCGGTTCC GCCACGAAGC GGAAGCCGCC CGGCGGGTGC CGCGGTTCTG CACCGCCGCC
TTCCTCGACG CCGACCCCAA CGCCGAACGG CCCTACCTGG TCACCGACTA CGTGCCCGGC
CCGACCCTCG CCCAGGCCGC CCGCCGCCCG TTACGCGGCG CCGAGCTGGA ACAGGTTGCG
GTCCACATCG CAGTGGCGTT GACGGTCATC CACGGCGCCG GAGTCGTCCA CCGCGACCTC
AAACCGTCGA ACGTCATACT CTCGCCCACG GGCGCTCGCG TCATCGACTT CGGCATCGCC
CGCGCCCACG GCATGACCAT GTTCCACATC GACGAGCAGA TCGGCACCCC CGCCTACACG
GCCCCCGAAA ACATCGACGG CGCCCCACCC GATCCGGCCG CCGACATCTT CGCCTGGGGC
GGTGTCGTCC TCTACGCCGC GACCGGCCAA CCCCCCTTCG GGGACCGATC ATCCGAACTA
CTCCTGCACC GCATCCGGTA CGACCATCCC ACCCACCTAA ACCAGCTTCA CGGCCGACTA
CACGACACCG TCACCGCCGC CATGGCCAAA GCCCCCGAGC AGCGGCCGAC CGCCGAGCAG
CTATACCGGA TGCTCACCAG CACCCAACCC CCAACACCGC CCCCCGCCGC ACCCACCCGG
CACGGCAGGC AGGGACCGCA GCCCGCCCCA CCACCACCGG CAATCCCCCC GCACTACGCC
CGGCCCATAC CGCCGGCTCG CCGAACCCGG ATCACGATCC TCGCCACCGC GACCGCCGTG
GTCCTCGCCG CCGCTGTCCT GCTCATCACG GACCGCCCCC GGCACACCCC GACCACGGCG
AACCGCCCCG ACCGGCACGG CGCCGCCGCG ATCCTCGCCG ACCAGGCCAC CACCGATGAT
CCGGATCTCG CGGTCCGTCT CGCCGTCGCC GCCTACCGGC TCAACCCCGA CCCGGCCGCG
CGCATCGCCC TGCTGGCCGC CGCCGCCCGA GGTCTGCCCC CGCTGGCAAC GTTCCGCCAC
AGCGGCAAAA TCCTGTCCGT CGCCATCAGC CCGGACGGGC ACACCCTCGC CACCGGCAGC
GCCGACCACA CCGCACGCCT GTGGAACCCC ACCCACCCCG ACCAGCCGGT AGCCACCCTG
GGCCACGACG ACGGGGTGAA CCACGTGGCG TTCAACCCCG CCGGCACCCT GCTCGCCACC
ACCAGCGACG ACACCACCAT CCGCCTGTGG AACATCACCG ACCCCCACCA CCCCGACCAG
GTCAACACCC TCACCCTGCA CACGGGCGGA ACCCCCTACG GCGCCGCGTT CAACCCCGCC
GGCACCCTGC TCGCGATCAG CACCTCCACC GGCGCCGTGC TCCTAATCGA CATCACCGAC
CCCCGCGCCA CCTCCACCCT CGCCACATTC ACCCCCCACA GCTACATCGC CGGAAACGTG
GCGTTCAGCC CCGACGGCCA CACCCTTGCG ACGGCCAGCC TTGACGGCAC CGCCCGCCTA
TGGGACGTCA CCGACCCGCG CACCCCCCGA CCATTGGCCA CCCTCGCCCC CGGCCCCACC
TTCGACGCCA CCTTCAGCCC CGACGGCACG ATGCTCGCCA CCGCCCAGCA GGACGGCACC
ACCCTCCTGT GGACCCTCAC CACCCCCACC CAGCCGCAGC CCGCCGCCAC CATCCCCGAA
ACCGGCATGA CCACCACCGC CGTCTTCGCC CCCGACGGCA CGACGCTGGC CACCGCCAGC
ACCGACGGCA CCGCCCACCT GTGGGATCTC ACCAACCCGC GCACCCCCCG CCCGCTGGCC
ACCCTCACCG GCCACACCGG ACCCGTGGAA ACCCTCGCCT TCGACGGGAC GATGCTCGCC
ACCGCCGGTG ACGACACCAC CGCCCGCCTG TGGGATCTCA ACCCCACCAG CCTCACCCGC
CGCGCCTGCA CCACCCCCAC CGGCCGACTC AGCGAAGACG AATGGCACCG CTACCTCCCC
ACCTTCCCCT ACCAACCCCC CTGCCCCTAA
 
Protein sequence
MPQPATALRI DDPTHIGVYR LRGRLGVGGM GVVYLAEDPH HHPVAVKVIR VEFAADPEFR 
ARFRHEAEAA RRVPRFCTAA FLDADPNAER PYLVTDYVPG PTLAQAARRP LRGAELEQVA
VHIAVALTVI HGAGVVHRDL KPSNVILSPT GARVIDFGIA RAHGMTMFHI DEQIGTPAYT
APENIDGAPP DPAADIFAWG GVVLYAATGQ PPFGDRSSEL LLHRIRYDHP THLNQLHGRL
HDTVTAAMAK APEQRPTAEQ LYRMLTSTQP PTPPPAAPTR HGRQGPQPAP PPPAIPPHYA
RPIPPARRTR ITILATATAV VLAAAVLLIT DRPRHTPTTA NRPDRHGAAA ILADQATTDD
PDLAVRLAVA AYRLNPDPAA RIALLAAAAR GLPPLATFRH SGKILSVAIS PDGHTLATGS
ADHTARLWNP THPDQPVATL GHDDGVNHVA FNPAGTLLAT TSDDTTIRLW NITDPHHPDQ
VNTLTLHTGG TPYGAAFNPA GTLLAISTST GAVLLIDITD PRATSTLATF TPHSYIAGNV
AFSPDGHTLA TASLDGTARL WDVTDPRTPR PLATLAPGPT FDATFSPDGT MLATAQQDGT
TLLWTLTTPT QPQPAATIPE TGMTTTAVFA PDGTTLATAS TDGTAHLWDL TNPRTPRPLA
TLTGHTGPVE TLAFDGTMLA TAGDDTTARL WDLNPTSLTR RACTTPTGRL SEDEWHRYLP
TFPYQPPCP