Gene Francci3_3002 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3002 
Symbol 
ID3905499 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3556292 
End bp3558799 
Gene Length2508 bp 
Protein Length835 aa 
Translation table11 
GC content75% 
IMG OID637880322 
ProductWD-40 repeat-containing serine/threonin protein kinase 
Protein accessionYP_482088 
Protein GI86741688 
COG category[K] Transcription
[L] Replication, recombination and repair
[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG0515] Serine/threonine protein kinase
[COG2319] FOG: WD40 repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.774494 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.214944 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGGGGA CGACACCACC TCCGCTGCCC GGCGACGCCG TGCGCCCGCT TCAGCTGACC 
GATCCGCGGC GGCTCGGGGT CTACCAGGTG ATCGGCCGGC TCGGGCAGGG CGGCATGGGC
ACCGTCTTCC TCGGCCGGGC ACCCGACGGA AGCGCCGTCG CGATCAAGAT GATCAGGCCG
GAGCTCGCGC AACGGCCCGA ATTCCGCGCC CGGTTTGCCC GCGAGGCCGA GAGTGCTCGT
CGGGTCCGCC GGTTCACCAC GGCCGCCGTG CTGGACGCTG ATCCGTACGG GCCCCAGCCC
TATCTCGTGA CCGAGTTCGT CGAGGGTCCG ACGCTGTCGA GGCGCGTCTC CGTACGGGGG
CCGCTGCGGC CCGCCGATCT CGAACAGCTC GCGGTCAGCG TGACGACCGC CCTGAGCGCC
ATCCACGCGG CCGGGATCGT GCACCGCGAC CTCACCCCCG GCAACGTCCT GCTGTCCCCG
GTCGGCCCCA AGGTGATCGA CTTCGGCCTG GCCCGCGAGT TCAACGCGGA CACCGACCTG
AGCCACAACG TCCGGCACGC CATCGGTACG CCCGGTTACA TGTCCCCGGA GCAGATCCTC
GACGCACCGA TCACCTCGGC GGTCGACATC TTCGCCTGGG GCGCCGTCAT AATCTTCGCC
GCCACCGGGC ACGCGCCGTT CGGCACCGGT CGCATCGACG CCATTCTCTA CCGCATCGTC
AACGAGCCGC CGCGGCTCGA CGGAGTTGGC GGCGAGTTGC GCAACCTCGT CGAGATCGCG
ATGGCGAAGG ACCCGGCGGC CCGGCCGAGC GCGGAGGAGC TGCGGACGGC GCTGATCGGC
GGGGGTACCC TCCCCGCGCG CCCGGAGCCC GGCTCCATAC CGTCCGGACC GCCCGGCGCC
GGGCCCACCG GACGTGCCCG CCGCTGGGCT CGGCGCCGGC CGTCAGGCGC CGCGGGGTCG
GCGCCCAGGT CGGAGCCGCC GCCCCGCGGC ACGGGCAGCA CCGGCAGCAC CGGCGGTACC
GCCGCGAACG TGGCGGACAT GCCCGTCACC CAGCTGTCCC CGCCGCCTGT CGCCTCGCCG
CCGCCGATCC CAGCCCGGAC CCCGCCGCCG ATACCGGCCC GGACCTCCCC GCCGCGTGGA
CAGCCCGGGT CGGTACCCCC GCATCCAGCA GGAACCCCAC ACCCGCCGGC ACCCAGCCCA
TCACCGGCCC CATCCAGGCT GCGGTGGTCT CGGACGGCAC TGCTCGTCGC CGGACTCGCC
ATCGCGATCA CCGCCGCAAC GGTGCTGATC ATCGTGCCGC GCGGCGGTGG CCCGTCGCCG
GTGTCGGCGG CCGACCGCGC GTCCATCTCG TCGCGCCTGG CCGCGGACGC GGCGGCCCAG
CGGGCCCGGC AACCGGACCT GGCCGGCCGG CTGAGCCTCG CGGCCTACCG GATCGCCCCG
ACGGAGGCCG CGCGCGCGGC CGTCCTCGCG TCCTTCGCCC AGTCCACCGC CGCGCGGATC
CCCGCCGGCC CCGCGGCCTT CTCCGACATC GCGCTCAGCC CGGACGGCAC GACCCTCGCC
GGCACCGACG ACACCGGCAG CCTCCACCTC TGGAAGGTCG ACGCTGCCGG CCGGCCCACC
GCCACCACCG GGGGCTCCGC GAACGACCAT GCGCACGGCG TCGTGTTCGA CCGCTCGGGC
ACCCGACTCG CGACGGGTGG GGAGACCGAC GCCGGCCGGC TATGGGACAT CGCCGATCCG
GCGCGCCCCC GGCCGCTCAG CACGCTCGAC CCCCAGGCCA CGCCGGTCCA TCGGCTCGCG
TTGTCGAGCA GCGCGCATCT GCTCGTCACC GCCGGCGAGG ACTGGTCGGT GGGGCTGTGG
GACGTCGCCG ACCCGGCGCG GCCCGTCTCG ATCCAGTTGC TGATCGGCCG CGCCGGGCCG
GTGACGGATG TGGCACTGCG GCCGGACGGC GCCGTCCTCG CCATCGCCGG CGCCGGCGGC
CCCGTGCAAC TGTGGAACGT CCGCGACCCC CGCAGGCCCG TCCAGACGGC CTCGGTGCCC
GGCCACACCG GTGCGGTGAA CACCGTCGCG TTCAGTCCGG ACGGACGGCG GCTGGCCACC
GGCGGCGACG ACCGGATCCT GCAGGTCTCC GACGTCGGCG ATCCCGACCA TCCGCGGGTC
CTGCGCCGGC TGTCCGGCCA CACCGCCCCC GTCGCCGCCG TCGCCTTCAC CACCGACGAT
CATCTGGTGA GCGCGGACGG CGGTGGCGCG GTCGCCTACT GGGACCTCTC CGCCCCCACA
CCCCCGATGA CCCCCCTGGG CGTCCTGGAC GCACCCGCCC GGGCGGTGGC CGGGACCGGC
ACCGAGACCG TCGCGCTGAC CACCGACAAG GGGTCGGTCC TCCTCGGGAC GCTCGACCCG
GCCAGGCTGC GCCGGCTGGC CTGCGCCAAG CCGGGCGCGG CCCTGAGCCC GGCCGAATGG
AGCCGGCTCG TTCCCCGCCT CCCCTACACC GACTCCTGCT CCGGTTAG
 
Protein sequence
MTGTTPPPLP GDAVRPLQLT DPRRLGVYQV IGRLGQGGMG TVFLGRAPDG SAVAIKMIRP 
ELAQRPEFRA RFAREAESAR RVRRFTTAAV LDADPYGPQP YLVTEFVEGP TLSRRVSVRG
PLRPADLEQL AVSVTTALSA IHAAGIVHRD LTPGNVLLSP VGPKVIDFGL AREFNADTDL
SHNVRHAIGT PGYMSPEQIL DAPITSAVDI FAWGAVIIFA ATGHAPFGTG RIDAILYRIV
NEPPRLDGVG GELRNLVEIA MAKDPAARPS AEELRTALIG GGTLPARPEP GSIPSGPPGA
GPTGRARRWA RRRPSGAAGS APRSEPPPRG TGSTGSTGGT AANVADMPVT QLSPPPVASP
PPIPARTPPP IPARTSPPRG QPGSVPPHPA GTPHPPAPSP SPAPSRLRWS RTALLVAGLA
IAITAATVLI IVPRGGGPSP VSAADRASIS SRLAADAAAQ RARQPDLAGR LSLAAYRIAP
TEAARAAVLA SFAQSTAARI PAGPAAFSDI ALSPDGTTLA GTDDTGSLHL WKVDAAGRPT
ATTGGSANDH AHGVVFDRSG TRLATGGETD AGRLWDIADP ARPRPLSTLD PQATPVHRLA
LSSSAHLLVT AGEDWSVGLW DVADPARPVS IQLLIGRAGP VTDVALRPDG AVLAIAGAGG
PVQLWNVRDP RRPVQTASVP GHTGAVNTVA FSPDGRRLAT GGDDRILQVS DVGDPDHPRV
LRRLSGHTAP VAAVAFTTDD HLVSADGGGA VAYWDLSAPT PPMTPLGVLD APARAVAGTG
TETVALTTDK GSVLLGTLDP ARLRRLACAK PGAALSPAEW SRLVPRLPYT DSCSG