Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_3419 |
Symbol | |
ID | 3905659 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | - |
Start bp | 4063071 |
End bp | 4065767 |
Gene Length | 2697 bp |
Protein Length | 898 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 637880742 |
Product | WD-40 repeat-containing serine/threonin protein kinase |
Protein accession | YP_482502 |
Protein GI | 86742102 |
COG category | [K] Transcription [L] Replication, recombination and repair [R] General function prediction only [T] Signal transduction mechanisms |
COG ID | [COG0515] Serine/threonine protein kinase [COG2319] FOG: WD40 repeat |
TIGRFAM ID | [TIGR02019] bacteriochlorophyll 4-vinyl reductase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCCACC CACCCGGCCG GAGTGGACCC GCGGACGGAC CCGCGGATCG TGCTGCCTCC GAACCGACCG AGGTCGGTCC CTACCGGCTG GTGCGGCGTC TCGGCGCGGG TGGGATGGGG ACCGTCTACC TGGGGGAGAA CGCCGCCGGT GGTCTGGTCG CGGTCAAACT GATCCGGGCG GATCTCGCCC GGTTGGCCGA GTTCCGCAGC CGGCTCAAGC AGGAGGCCGA CAACGCCCGG CGGGTCGCTC GTTTCTGCAC CGCGGCGGTC CTCGACGTCG ACATCACCGC CGATCCGCCT TATCTGGTGA CCGAGTACGT CGACGGGCCG ACGCTGTCCG AGGCGGTTGG CACGCGCGGC CCGTTGACCC CGGCGGAGCT TCATCAGCTG GCGGTGAGCA TGACCACCGC GCTGATGGCG ATCCACCGGG CCGGACTCGT GCACCGGGAT CTGAAGCCGT CCAACATCCT GCTGTCCCGG CTCGGGCCGA AGGTGATCGA TTTCGGTATC GCGCGCGCCC TGGACTCCGC GACGGTACTC AGCGGGGATC GCCAGCTCGG GACGCCGGCG TTCATGGCGC CGGAACAGGC CCTGGGCGAG CAGGTCACCT CGGCGGCGGA CGTGTTCGCC TGGGGCGGTG TCCTCATCTT CGCCGGGACC GGTCGTTACC CGTTCGGTAA CGGCCCCGCT CCCAGCGTGC TGTACCGGAC GGTCAACGAT CCTCCGACGC TTGACGGCTT CGAGGACAGC CTGCGGCCCC TGGTCTCCGA TGCGATGCGC AAGGCAGCGG CGGAGCGACC CACCGCCGAG AAGCTCTACG CGCGCCTTCT GGACCTACGT GTCGAAGCCC CGATGGCGGT CAAGGGGCTG TCCCTGTCCG AGGTGACCGC GCTGATCCGG CCGCTGAACA CGTCCCCCGG CGGCGCCTCC GGAACGTCGG CGAGCGACGC CGACCTCGCG GGTCCGATCA CGCCCGTCAC CGGCCATCAG CCGATCGGAC CCCCGCCGCC GGCGAACCTG ATCTCGTTCC CGGGGCCGGG CATTCCCGGG GCACCAGCCC GGTCCGGCCC GTCGTCGGTT CCGGGCCCGT CGTCGGTTCC GGGCCCGTCG TCGGTTCCGG GCCCGACCTC TTCTCCCGGG CTCGGGTCGT CCTCGTCGGG GAGCGAGGCG TCACCGTCCG GGCCGGATGC CCGCTGGCTG CCGGGCGCAG GCTCGTCGTC CCGGTCCCGG GACTTCGCCG ATGCTCTGGT GACGGTCTGG ACCGAGGACG AACACAGCCA GGACCCCGGG CGAGGAGCGT CCCCGTCGTC CTCGTCGTCC TCGTCTCAGC GGTCGTCCTC GTCTCAGCGG TCGTCCTCGT CGTCGCGGAG GTCCTCGTCC GCCGCGCCAC CGGGCGACCG GGCCCGCAAC CGACGTCCGC TGATCATCGT TGCCCTGGTG GCCGCCGTCA CCGTGCTCGT GACGACGGTC GCCATCGTGT CCACCAGGGG CGGGGGCTCG CGGACCGAGA TGAGCGTGCC TGAGGCGGTC GCCGAGCGGG CGCTGCTCCT CCAGGACGGC GACACGGGAC TCGCTCGCCG GCTCGCGCTC GCGGCCTACC GCGCCGAGCC GCACTCGGCC AGGACCCGGA GCGCGATGAT CGCGCTGTTC GGTGCCGGGA TCACCCCGAC CACCATCCCC GTCGGGACCG GTGCCCTGCT GGCGCTCGCG GTGAGCCCGG ACGGGCACTG GATCGCGGCG GGGAGCAACA ACGGAACGGT CACGTTATGG GAAGTCGTCG GTCGGACCGA GCTGGTCCGG CGCACCTCCG TCTCGGTGCC GAGCCGCAGC TGGATCGAGT CGCTGGCGTT CAACCGGGAC GGTGGCCTGC TCGCCGCGGG CCATTCCGAC GGCACGATCC GACTGTGGAA CCTGCACGAC CCCGACCAGA TGGTCCGATG GTCAACCATC CAGGCCCACA CCGACGCGGT GCAGTCGGTG GCCTTCAGCC CGGACAGCAA CACTCTCGGG TCCGCGAGTG CGGACGGCAT CGTCGCGCTG TGGGACGTCA CGGACCCGGC GCGTCCGAAG CAGCGCGTCC GGGCGGACGG TCAGACCGGG GGAGTGCGCT CGATGGCCTT CGCGCCTAAC GGCACCCTGC TCGCGTTCGC TGGTGAGGAC GGCACCGTCC ATCTGTGGAA CATCCGGGAC GCGGCGCGGC CGACCGCGGG TGGAATCCTG CGCGGGCACA GCCGTGGGGT GCGGTCCGTG GTCTTCACCG GCGACGGAGG CGTTCTGGTC TCCGGCGGCG TCGACGCCAC GGTGCGCCTG TGGGAGGTGC GGTATCCGGA CAATCCCGCC CGTGGCGTCG CCACCGGCTC GCTGGGCGGG ATCCAGAGCG TGGCCTTCGA GCCGGGTGCC GACGTCGTCG CCTCCGCCGG CGATGACGAG ACGGTCCGGC TGACCGACAT CTCCCGTCTG GACACGCCGA TCCTGCTGAC CCAGTGGCAC GGGCACACCC AGCCGATCAG CGCGATCGCC TTCGTCTCCG GCACGGGCGT CGTCGTCTCG GCTGGTCACG ACGGGACATT GCGGCTGTGG GACGCCGAGC CCGGCCGGCT CGCGGACACC GCGTGCGCCG ATCCGGCGAA CCGGATCACC GCCGGTGAAT GGAGCACTGC CTTCAGGGAT ATGGGCTACC GGGCTCCCTG CGGCTGA
|
Protein sequence | MSHPPGRSGP ADGPADRAAS EPTEVGPYRL VRRLGAGGMG TVYLGENAAG GLVAVKLIRA DLARLAEFRS RLKQEADNAR RVARFCTAAV LDVDITADPP YLVTEYVDGP TLSEAVGTRG PLTPAELHQL AVSMTTALMA IHRAGLVHRD LKPSNILLSR LGPKVIDFGI ARALDSATVL SGDRQLGTPA FMAPEQALGE QVTSAADVFA WGGVLIFAGT GRYPFGNGPA PSVLYRTVND PPTLDGFEDS LRPLVSDAMR KAAAERPTAE KLYARLLDLR VEAPMAVKGL SLSEVTALIR PLNTSPGGAS GTSASDADLA GPITPVTGHQ PIGPPPPANL ISFPGPGIPG APARSGPSSV PGPSSVPGPS SVPGPTSSPG LGSSSSGSEA SPSGPDARWL PGAGSSSRSR DFADALVTVW TEDEHSQDPG RGASPSSSSS SSQRSSSSQR SSSSSRRSSS AAPPGDRARN RRPLIIVALV AAVTVLVTTV AIVSTRGGGS RTEMSVPEAV AERALLLQDG DTGLARRLAL AAYRAEPHSA RTRSAMIALF GAGITPTTIP VGTGALLALA VSPDGHWIAA GSNNGTVTLW EVVGRTELVR RTSVSVPSRS WIESLAFNRD GGLLAAGHSD GTIRLWNLHD PDQMVRWSTI QAHTDAVQSV AFSPDSNTLG SASADGIVAL WDVTDPARPK QRVRADGQTG GVRSMAFAPN GTLLAFAGED GTVHLWNIRD AARPTAGGIL RGHSRGVRSV VFTGDGGVLV SGGVDATVRL WEVRYPDNPA RGVATGSLGG IQSVAFEPGA DVVASAGDDE TVRLTDISRL DTPILLTQWH GHTQPISAIA FVSGTGVVVS AGHDGTLRLW DAEPGRLADT ACADPANRIT AGEWSTAFRD MGYRAPCG
|
| |