Gene Francci3_3419 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3419 
Symbol 
ID3905659 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4063071 
End bp4065767 
Gene Length2697 bp 
Protein Length898 aa 
Translation table11 
GC content72% 
IMG OID637880742 
ProductWD-40 repeat-containing serine/threonin protein kinase 
Protein accessionYP_482502 
Protein GI86742102 
COG category[K] Transcription
[L] Replication, recombination and repair
[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG0515] Serine/threonine protein kinase
[COG2319] FOG: WD40 repeat 
TIGRFAM ID[TIGR02019] bacteriochlorophyll 4-vinyl reductase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCCACC CACCCGGCCG GAGTGGACCC GCGGACGGAC CCGCGGATCG TGCTGCCTCC 
GAACCGACCG AGGTCGGTCC CTACCGGCTG GTGCGGCGTC TCGGCGCGGG TGGGATGGGG
ACCGTCTACC TGGGGGAGAA CGCCGCCGGT GGTCTGGTCG CGGTCAAACT GATCCGGGCG
GATCTCGCCC GGTTGGCCGA GTTCCGCAGC CGGCTCAAGC AGGAGGCCGA CAACGCCCGG
CGGGTCGCTC GTTTCTGCAC CGCGGCGGTC CTCGACGTCG ACATCACCGC CGATCCGCCT
TATCTGGTGA CCGAGTACGT CGACGGGCCG ACGCTGTCCG AGGCGGTTGG CACGCGCGGC
CCGTTGACCC CGGCGGAGCT TCATCAGCTG GCGGTGAGCA TGACCACCGC GCTGATGGCG
ATCCACCGGG CCGGACTCGT GCACCGGGAT CTGAAGCCGT CCAACATCCT GCTGTCCCGG
CTCGGGCCGA AGGTGATCGA TTTCGGTATC GCGCGCGCCC TGGACTCCGC GACGGTACTC
AGCGGGGATC GCCAGCTCGG GACGCCGGCG TTCATGGCGC CGGAACAGGC CCTGGGCGAG
CAGGTCACCT CGGCGGCGGA CGTGTTCGCC TGGGGCGGTG TCCTCATCTT CGCCGGGACC
GGTCGTTACC CGTTCGGTAA CGGCCCCGCT CCCAGCGTGC TGTACCGGAC GGTCAACGAT
CCTCCGACGC TTGACGGCTT CGAGGACAGC CTGCGGCCCC TGGTCTCCGA TGCGATGCGC
AAGGCAGCGG CGGAGCGACC CACCGCCGAG AAGCTCTACG CGCGCCTTCT GGACCTACGT
GTCGAAGCCC CGATGGCGGT CAAGGGGCTG TCCCTGTCCG AGGTGACCGC GCTGATCCGG
CCGCTGAACA CGTCCCCCGG CGGCGCCTCC GGAACGTCGG CGAGCGACGC CGACCTCGCG
GGTCCGATCA CGCCCGTCAC CGGCCATCAG CCGATCGGAC CCCCGCCGCC GGCGAACCTG
ATCTCGTTCC CGGGGCCGGG CATTCCCGGG GCACCAGCCC GGTCCGGCCC GTCGTCGGTT
CCGGGCCCGT CGTCGGTTCC GGGCCCGTCG TCGGTTCCGG GCCCGACCTC TTCTCCCGGG
CTCGGGTCGT CCTCGTCGGG GAGCGAGGCG TCACCGTCCG GGCCGGATGC CCGCTGGCTG
CCGGGCGCAG GCTCGTCGTC CCGGTCCCGG GACTTCGCCG ATGCTCTGGT GACGGTCTGG
ACCGAGGACG AACACAGCCA GGACCCCGGG CGAGGAGCGT CCCCGTCGTC CTCGTCGTCC
TCGTCTCAGC GGTCGTCCTC GTCTCAGCGG TCGTCCTCGT CGTCGCGGAG GTCCTCGTCC
GCCGCGCCAC CGGGCGACCG GGCCCGCAAC CGACGTCCGC TGATCATCGT TGCCCTGGTG
GCCGCCGTCA CCGTGCTCGT GACGACGGTC GCCATCGTGT CCACCAGGGG CGGGGGCTCG
CGGACCGAGA TGAGCGTGCC TGAGGCGGTC GCCGAGCGGG CGCTGCTCCT CCAGGACGGC
GACACGGGAC TCGCTCGCCG GCTCGCGCTC GCGGCCTACC GCGCCGAGCC GCACTCGGCC
AGGACCCGGA GCGCGATGAT CGCGCTGTTC GGTGCCGGGA TCACCCCGAC CACCATCCCC
GTCGGGACCG GTGCCCTGCT GGCGCTCGCG GTGAGCCCGG ACGGGCACTG GATCGCGGCG
GGGAGCAACA ACGGAACGGT CACGTTATGG GAAGTCGTCG GTCGGACCGA GCTGGTCCGG
CGCACCTCCG TCTCGGTGCC GAGCCGCAGC TGGATCGAGT CGCTGGCGTT CAACCGGGAC
GGTGGCCTGC TCGCCGCGGG CCATTCCGAC GGCACGATCC GACTGTGGAA CCTGCACGAC
CCCGACCAGA TGGTCCGATG GTCAACCATC CAGGCCCACA CCGACGCGGT GCAGTCGGTG
GCCTTCAGCC CGGACAGCAA CACTCTCGGG TCCGCGAGTG CGGACGGCAT CGTCGCGCTG
TGGGACGTCA CGGACCCGGC GCGTCCGAAG CAGCGCGTCC GGGCGGACGG TCAGACCGGG
GGAGTGCGCT CGATGGCCTT CGCGCCTAAC GGCACCCTGC TCGCGTTCGC TGGTGAGGAC
GGCACCGTCC ATCTGTGGAA CATCCGGGAC GCGGCGCGGC CGACCGCGGG TGGAATCCTG
CGCGGGCACA GCCGTGGGGT GCGGTCCGTG GTCTTCACCG GCGACGGAGG CGTTCTGGTC
TCCGGCGGCG TCGACGCCAC GGTGCGCCTG TGGGAGGTGC GGTATCCGGA CAATCCCGCC
CGTGGCGTCG CCACCGGCTC GCTGGGCGGG ATCCAGAGCG TGGCCTTCGA GCCGGGTGCC
GACGTCGTCG CCTCCGCCGG CGATGACGAG ACGGTCCGGC TGACCGACAT CTCCCGTCTG
GACACGCCGA TCCTGCTGAC CCAGTGGCAC GGGCACACCC AGCCGATCAG CGCGATCGCC
TTCGTCTCCG GCACGGGCGT CGTCGTCTCG GCTGGTCACG ACGGGACATT GCGGCTGTGG
GACGCCGAGC CCGGCCGGCT CGCGGACACC GCGTGCGCCG ATCCGGCGAA CCGGATCACC
GCCGGTGAAT GGAGCACTGC CTTCAGGGAT ATGGGCTACC GGGCTCCCTG CGGCTGA
 
Protein sequence
MSHPPGRSGP ADGPADRAAS EPTEVGPYRL VRRLGAGGMG TVYLGENAAG GLVAVKLIRA 
DLARLAEFRS RLKQEADNAR RVARFCTAAV LDVDITADPP YLVTEYVDGP TLSEAVGTRG
PLTPAELHQL AVSMTTALMA IHRAGLVHRD LKPSNILLSR LGPKVIDFGI ARALDSATVL
SGDRQLGTPA FMAPEQALGE QVTSAADVFA WGGVLIFAGT GRYPFGNGPA PSVLYRTVND
PPTLDGFEDS LRPLVSDAMR KAAAERPTAE KLYARLLDLR VEAPMAVKGL SLSEVTALIR
PLNTSPGGAS GTSASDADLA GPITPVTGHQ PIGPPPPANL ISFPGPGIPG APARSGPSSV
PGPSSVPGPS SVPGPTSSPG LGSSSSGSEA SPSGPDARWL PGAGSSSRSR DFADALVTVW
TEDEHSQDPG RGASPSSSSS SSQRSSSSQR SSSSSRRSSS AAPPGDRARN RRPLIIVALV
AAVTVLVTTV AIVSTRGGGS RTEMSVPEAV AERALLLQDG DTGLARRLAL AAYRAEPHSA
RTRSAMIALF GAGITPTTIP VGTGALLALA VSPDGHWIAA GSNNGTVTLW EVVGRTELVR
RTSVSVPSRS WIESLAFNRD GGLLAAGHSD GTIRLWNLHD PDQMVRWSTI QAHTDAVQSV
AFSPDSNTLG SASADGIVAL WDVTDPARPK QRVRADGQTG GVRSMAFAPN GTLLAFAGED
GTVHLWNIRD AARPTAGGIL RGHSRGVRSV VFTGDGGVLV SGGVDATVRL WEVRYPDNPA
RGVATGSLGG IQSVAFEPGA DVVASAGDDE TVRLTDISRL DTPILLTQWH GHTQPISAIA
FVSGTGVVVS AGHDGTLRLW DAEPGRLADT ACADPANRIT AGEWSTAFRD MGYRAPCG