Gene Francci3_0044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_0044 
Symbol 
ID3903523 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp53428 
End bp56019 
Gene Length2592 bp 
Protein Length863 aa 
Translation table11 
GC content71% 
IMG OID637877374 
Productserine/threonine protein kinase 
Protein accessionYP_479167 
Protein GI86738767 
COG category[K] Transcription
[L] Replication, recombination and repair
[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG0515] Serine/threonine protein kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0701705 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGGGGTC AGGGCCGTAG GATCACCAGG AATCCGACGA AGGGCGTCAT TGCAGGCCAG 
GATGGTTCTC GGCGCACGGG TGGCGGTACA GCGGATGCCC GGCGTGGTAG CGGACGGATC
TGGACCTTGT TGATAGATAA ATCTCGGGTG GAGGCGGCGC TTCCCGGTTA CTCCGTCGAG
GGCGACTTGG GCCGGGGTGG CTACGGGCTC GTGCTGGCCG GCCAGCACCG GCTCATCGGC
CGCAAGGTCG CGATCAAGAT CCTGCTGGAC ACCTCCGACG ACCCCGATCT GCGTACCCGC
TTCCTGTCCG AGGCTCGGGT GCTGGCCGAA CTGGATCATC CGCACATCGT GCGTATCCAC
GACTACGTGG AGCATGAGGG CACCTGCTTG CTCGTCATGG AGCTGCTCTC CGGCGGCACG
CTCAAACAGC GGATGAGCTC GGGACCGGTC TCCGCCGAGA CCACCTGCTC GATCGGGCTC
GCGGCCGCGG CCGCCCTCGC CACGGCGCAC GGCCACGGAG TGCTGCACCG GGACATCAAG
CCGGACAACA TCATGTTCGC CGGCGACGGG CTGCTGAAGG TGACCGACTT CGGCATCGCC
AAGATTTTCG ACGGAGCGGA GACCACCGCC AGCGCGATCC TGGGAACGCC GCGTTACATG
GCTCCCGAGC AGATCATGGG GACGCGTCTC TTCCCGTCCA CCGATCTTTA CGCCCTCGCG
GGTGTGCTCT ACGAGATGAT CGCCAACCGG CCGCTGTTCG GCCGTCAGAT GGCGGTGCAA
CCGCTGACCC ACCACCATCT GACGATCATG CCGGAACCGC TCACCATGGT TCCGCCACCC
GTCTCCGCGG TGATCCTGCG GGCGCTGGCG AAGGATCCGA GCATCCGCTT CGCCGACGCC
GCCGACTTCG CCCTGGAACT CGCCCGAGCG AGCAGCCGCG CGTTCGGCCC GACCTGGCTG
TCCCGGTCCG ACGTCAAGGT GCGGATCGAC GACGAGATCC GGGAGGCGGC GCTCGCCACC
TCGACCACCC CGCGCCCGCC GGCCGCCGGT CGACCGGGCT TCCCCGGTAG CCCCGGAGCA
CCCGGTAGTC CGGGCTTCCC CGGTAGCCCC GGAGCACCCG GTAGTCCGGG CTTCCCCGGT
AGCCCCGGAG CACCCGGTAG TCCGGCCGGC CATCCGATGG GGGGATATCC ACCACCGGGT
GGGCCGGGCT GGGGTGGATT CCCGCCGGGC AACACTCCAC CGCCGCGGTC GACCCCACCG
CCGCGGTCGA CCCCACCGCC GCGGTCGCTC GGCCCCGGTT ATGGCGGACC GGATGCCCCA
GGCGGACCGG GTGCCCCCGG CGGTCCCGGC GGCCAGACGT ACCGGCCCGG TCCGGGGGGA
CCGGTGCACG GGATGGCGGG GGTACCGCCG GCCGCCACCC GCCAGGCCGG GCACCAGTCG
AGTCCGAACG CCCGTAACCG GACGCCGCTC ATCATCGGAG CGGTCGCGTT CGTCGTCATC
GTCGCCATCA CCGTCGGCAT CGTGGCAGCG GTGACGAACT CGGGCGGCGG CAGCCGCGGC
GGTGACCGCG GCGGCGGCAC GGCACGCCTC GCAACCGCCT ACCGGGGAAC GGCGCTGTCG
GTGCAGGGCC TGAGCCCCTA CAGCGTCGAT GTCGATCCCG ACGGCTCGCT ACTCGTCTCC
AGCCTCGCGA CGGACCGCAT CCAGAAGATC ACCCCCGCCG GGGCGGTCAG CGACCTCGCG
GGCACCGGAG CCGGCGGGAT CAGCGGCGAC GGCGGCCCGG CCACCGCGGC GCAGCTTGAC
GGGCCCGGAT CGACCGCCCG TGACAAGGCC GGCAACATCT ACATCGGGGA CGCGAAGAAC
AATCGGATCC GCAAGATCAG CCCAGCCGGG ATCATCACCA CGATCGCCGG CACCGGTGAC
GCCGGTTACG GCGGCGACGG CGGCCCGGCC ACCGCGGCGA AGATCAACAG TGCGGAAAAG
GTGACCACCG GGCCGGACGG CAGCGTCTAC CTCTCCGACT ACGAAAACCA CCGGATCCGC
AAGATCAGCC CACAGGGGAT CATTACCACG TACGTGGGCA CCGGAGTCGC GGGTTACACC
GGCGACGGCG GCCCGGCCAC CGCAGCGAAG ATCAACGGCC CGAACGATCT CCAGATGACC
GACGACGGCA CCCTCTACTT CGCCGACCTC GCCAGCGACA CCATTCAGAA GGTGACCCCG
GACGGGATCA TCACCACCGT CGCCGGGACC GGTGAGGGGG GCTTCTCGGG TGACGGCGGC
CCGGCCACCC GGGCCAGGCT GAACGTGCCG TCGCTCACCG TCGGCCCGGA CGGCCGGACG
CTCTACCTCG CTGACTACCG CAACCACCGG ATCCGCCGGG TCGACCCGAA CGGCGTGATC
ACCACGATCG CCGGCACCGG CGGCGAAGGC TCCGGCGGCG ACGGCGGCCC GGCGACCGCG
GCCCAGTTCA AGAACCCGAG CTCGGTCGCG GTCGACGGCA GCGGCGCGCT CTACATCGCG
GACAACGGCA ACGATCGGGT GCGCCGCATC GATCCGAACG GCACCATCAC GACCGTCGCC
CAACCGGGCT AG
 
Protein sequence
MGGQGRRITR NPTKGVIAGQ DGSRRTGGGT ADARRGSGRI WTLLIDKSRV EAALPGYSVE 
GDLGRGGYGL VLAGQHRLIG RKVAIKILLD TSDDPDLRTR FLSEARVLAE LDHPHIVRIH
DYVEHEGTCL LVMELLSGGT LKQRMSSGPV SAETTCSIGL AAAAALATAH GHGVLHRDIK
PDNIMFAGDG LLKVTDFGIA KIFDGAETTA SAILGTPRYM APEQIMGTRL FPSTDLYALA
GVLYEMIANR PLFGRQMAVQ PLTHHHLTIM PEPLTMVPPP VSAVILRALA KDPSIRFADA
ADFALELARA SSRAFGPTWL SRSDVKVRID DEIREAALAT STTPRPPAAG RPGFPGSPGA
PGSPGFPGSP GAPGSPGFPG SPGAPGSPAG HPMGGYPPPG GPGWGGFPPG NTPPPRSTPP
PRSTPPPRSL GPGYGGPDAP GGPGAPGGPG GQTYRPGPGG PVHGMAGVPP AATRQAGHQS
SPNARNRTPL IIGAVAFVVI VAITVGIVAA VTNSGGGSRG GDRGGGTARL ATAYRGTALS
VQGLSPYSVD VDPDGSLLVS SLATDRIQKI TPAGAVSDLA GTGAGGISGD GGPATAAQLD
GPGSTARDKA GNIYIGDAKN NRIRKISPAG IITTIAGTGD AGYGGDGGPA TAAKINSAEK
VTTGPDGSVY LSDYENHRIR KISPQGIITT YVGTGVAGYT GDGGPATAAK INGPNDLQMT
DDGTLYFADL ASDTIQKVTP DGIITTVAGT GEGGFSGDGG PATRARLNVP SLTVGPDGRT
LYLADYRNHR IRRVDPNGVI TTIAGTGGEG SGGDGGPATA AQFKNPSSVA VDGSGALYIA
DNGNDRVRRI DPNGTITTVA QPG