Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_7125 |
Symbol | |
ID | 5675801 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 8696761 |
End bp | 8700966 |
Gene Length | 4206 bp |
Protein Length | 1401 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641245963 |
Product | WD-40 repeat-containing protein |
Protein accession | YP_001511352 |
Protein GI | 158318844 |
COG category | [R] General function prediction only |
COG ID | [COG2319] FOG: WD40 repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACATCCG AGCGTTCGGA GCGCGGCGGA CCGCACGGTT CCGGTGAGGA TTCGCCGGGC GCCGCGGCCC GGCCGGACCA GCGGACCTCT CGGACTGTGG CCGGGAACGC GGCGGAGATG CGTACGCCGC CGCTGGTCGA GTCCCAGGCG GCGCCCACCC AGATCATCGA CGGATACCCG TCGAAGCCAC CGACCAGCGT CCCGCCGGCC GGTGCGTACT CGACGGCCTG CCCGTACCCG GGGCTGCGCG GCTTCGACGA GTCGAGTGAG CAGTGGTTCT TCGGGCGTGA GCGGATGGTC GCCGACCTGG TGGCGAGGGT CTCCTCGGAG GCGTCGAGGG TGGGGCCGCT GGTGCTCGTC GGCGCCTCGG GGTCGGGGAA GTCGTCACTG CTGCGTGCCG GGCTCCTGCC CGCCCTCGCG CGCGGAGCGC TTCCCGGCTC GCGTGGCTGG CCACGGCTGG TGATGACACC GGGCGAGCAC CCGCTCGAGG CGCTCGTCCA GCGTGTTGTC GAGGCGACCG GGATGCCGAC GATGGCGCGG ATGCTGGGTG ACAGCCTGCG CCGGGAGCCG GAGCGGCTCA GCGAGATCGT CCGTGAGCTG CTCGCCGCGC GCGCCACGCC TCGGTCTGCC CCAGAGGGGG GCGAGACCAC GATCCGCCGC GGCCCACGAA ACGACGCCAC CACGCTCTAC AGGCCAGGCA ACCCCCACGA GCCAGCCGAT CCCCGCGAGC CGCGTGAATC CGCCGCGCCG CACGAACCCC GCGAGCTCTC CGCGTCACGC GCGCCCCTCA AACCCGCCGT GCCCCACGAA CCCCCCACAC CGCGCAAAGC CCACGAACCA CGCAAAGCTC ACGAACCACA CGCTCCGCAC GAGCCGCGGA TGGTGATCGC GGTCGACCAG TTCGAGGAGG TTTTCACGCT CTGCGCGGAC CAGGCCGAGC GGGAGGCTTT CGTCCGGGCC CTGTGCGCGA CCGCGGCGGG CAGTGCGGTC GTCGTGATCG GCCTGCGGGC GGACTTCTAC GGGGCGTGCG CCTCCTTCCC CGAGCTCGTC GAGGTCCTGC AGGTCAATCA GGTGGTGGTC GGGCCTATGG CGGCCGCCGA CATCCGCGAG ATCGTGGTGA ATCCGGCTCG TGCCGCGGGT GGGGACGTTG AGCCCGGGCT CGTCGAGCTC GTCCTGCGCG ATCTCGGCGC CGCCGCTGGC ACGGGGATCA CCGGAAGCGA CCAGGGCACC GGCCACGGCA GCAGTCGCAG CCCCGAGCTC GTCTCCAACC TCAGCTCCGG GTTCGTGGCT GATCCGGGCT CGCTGCCACT GCTCGCCCAC GCGCTGCGCG CGACCTGGTT CGCCCGTGCG GGCGAGGCGC TCACCGTCGC CGATTATCTG CGCGTCGGCG GGCTGACCGG GGCCATCGCC CAGACCGCCG AGGCGGCCTA CACGAGTCTG GACGGGGCCG CCCAGCAGGC CGTCCGACCA CTACTGATGC GCATGATCCG GCTTGGTGAG AACGGCGCGG ACACCCGGCG GCGGGTACGG CGGGCGGCGC TGCTCGCGGA GGTGCCGGGG CCGGAGTCGG CGACCGTCCT GGATGCGCTG GTAGCCGCGC GTCTCGTCGA GGCCGATGCC GATGGGGATC AGGACGGTCT GCAGATCGCG CACGAGGCGT TGCTGCGTTC CTGGCCACGG CTGCGCGAGT GGATGGACAT GGACCGCGCC GGCGCCGTGG CGTTGCAGCA GCTGCGCGAC GCCGCTGAGG TGTGGGAGCG GGGCGGGCGG GATCCGTCCT ACCTTTTCGC CGGTTCGCGT CTCGCCGCGG CCCGCGACTG GATGGACGAC GACCTGGACG TCGATGCGAC GACCAGGCAG TTCTTCGACG AGAGTGTCCG CGCAGAGGCC GAGCAGCAGC GGGCGGCGGC GCGCCGGACC CGTAGGCTGC GCCAGCTCGT GGCGGCGCTG GCCGTCCTGC TGCTCGTCGC CGCGTCGCTG GCGGGCCTGA CGTTCCAGCA GAGCGCCTCG TCGGGGCGGG CTCGGGACCA GGCGCTCTCG CAGCGGATCG CGACGCAGGC GGAGGCTGCC AGGCGTAACA ATCCGGCGCT GGCCGCCCAG CTCAGCCTGG TCGCGCTGCG CACGGCGGAC ACGCCGGAGG CACGCGGGGC CGTCCTGTCG TCGTTCAACG GCGGTAGCGG GGTGCCGACT CGGTACCAGG CGCATACCAA GTCGGTCGGG ACGGTCGCCT ACAGCAGGGA CGGCCGCCTG CTCGCGACGG GCAGCGACGA CTGGACAGCC GCGGTCTGGG ACGCGGCCGA TCCGCGGCGG CTCACGCCAC TCGCACGCAT CCCGGACGAA CGCGGTGGCG GGCATGGCAG GGCGGTGAAG GCGGTGGCCT TCAGCCGGGA CGGGACGGTG CTGGCGACCG GCGGCGCCGA CGGCCTGGCC AAGCTGTGGA ACATCACCGA CCGGGCCCGA CCGCGCCTGC TGGCGACCCT GCCGAAGGCG GATTCGGAGG TCTACGGCCT GGCGTTCGAT CCGACGTCTG ATCGTCTCGC CGTCGGTGGT TACGGCAAGT CGGCGTACAT CTACGACGTC TCCGATCCGG CGCGCCCGGA GAAGAAGGGG CAGCTCTTCC TGCACCTGGC GCAGGTGGTG GCGCTGGAGT TCAGCCCGGA CGGCGCGTTC CTGATCGCCG GGGACGAGGG CGGTTCCGCC CTGTTGTGGT CGGTCAGTGA TCCGAGCAAC CCGAGGCCGC TCAAGGTTCT CGTCGAGGAC GGCGGCCCGA GCTCGGACGG CGCGGGCGCG ATCCGGTCGA TCACCTTCGG CGGCGACGGC CACACGGTCT ACACCGCCGG GGACGGCGGC TACGTCCGCA AGTTCTCCGG GCCGGACCTG CCGCGTCTGG AGTACGACGG CCGGGCGGGG ACCGGGGACG CGCCGATGAC CGGCCTCGCC GTGGACCCGG TCAGCGGCCT GGTCGGCGTG GGCGGGTTCC GGTACGTCGG AGTTCCGATT TTCGATGTGG ACGTCGACCA GTACAGCCTG ACCTTCCTCG ACGAGGGCGC CACGGTCTGG GATGTGGCCT TCAGCCCGGA CGGCCGCCGG CTCGCGTCGG TCTCGGTGGA CGGCTCGCTG CGGGTCTGGG AGATGCCCGG CCCGGCGCTG ATCGGGCGCA ACGGCGCGCA GGAGGACGCC GTCGTCAACC CGGTCACCGG CATCGTAGCG ATCACCACCG ACAAGGCGGT CGAGCTGTGG GACGTCGATG ATCCTTACGC GCCCCGCCGA CTGCACGTGC TCACCGACGT GATCGTGGAC GAGTACGACC CGACGGGCTC GTCGGCCTTC AGCCCGGACG GGAACGTGCT CGCGGTGGGC ACGGGCAAGA ACATCGTCTT CTACGACGTC CGCGACCCTG CGAAGCCGTC CCGGATCTCG GACGTGCCGG GGCCGGCCGG GGGCACCGCG GAGCTGTTGT TCAGCCCGGA CGGGAGGACC CTGGCGCTCG GCGGCCTGAA CTCCCCGCCG GAGCCGGCAT TCCAGGCCAG GGTCGAGACC TGGGACGTCA CCGATCTCTC CCGGCCGCGG CGGCTCGCGT CGCTGATCGC GCACCGCTCA TCCGTCCGTG ACCTGACCTT CTCACCGGAC GGCCGGACCC TGGTGAGCGC GGCCGAACGA TCGGTGAAGC TCTGGGATGT GACGGACCCG CGACGGCTCC GGCTGGTCTC GGAGCTGCCC GAGTTCCCCG GCGGGGTCTG GGAGGTGCGC TTCTCCCCCG ATGGCAGGAC GCTCGCGGCC GGCGGGGCCA ACCCGTTCGC GACGCTGTGG GACGTGACCC GCATGGACGC GCCGCGCCAG ATCGCGGACC TGCCGGGCCA TTCGGCGTCC GTGACCAGCG TCGCGTTCAG CCCGGACGGC ACGCAGCTCG CCACTGGCAG CAACGACAAC ACCGTACGGA TATGGGACGT TACGGAACAC GACTCCCCGA CGCTCATCGA GAAGCTCGCA CGCTCGGCGG GCAGCGAGGC CGGTATCGAG GAGATCCTCT ATACCCGGGA CGGCGAGAAG CTGGTCGGCG TGATCTTCAC CGTGCCCGCG GTGGTGTGGG ACCTCGATGT CAACCGGGTG CGGGCCCGGA TCTGCGAGCG GGCGGGCGTG GGCATCACGG CCGCGGAGTG GCGCCGTTTC CTACCGGATC TGCCCTACGA TCCGGTCTGT GACTGA
|
Protein sequence | MTSERSERGG PHGSGEDSPG AAARPDQRTS RTVAGNAAEM RTPPLVESQA APTQIIDGYP SKPPTSVPPA GAYSTACPYP GLRGFDESSE QWFFGRERMV ADLVARVSSE ASRVGPLVLV GASGSGKSSL LRAGLLPALA RGALPGSRGW PRLVMTPGEH PLEALVQRVV EATGMPTMAR MLGDSLRREP ERLSEIVREL LAARATPRSA PEGGETTIRR GPRNDATTLY RPGNPHEPAD PREPRESAAP HEPRELSASR APLKPAVPHE PPTPRKAHEP RKAHEPHAPH EPRMVIAVDQ FEEVFTLCAD QAEREAFVRA LCATAAGSAV VVIGLRADFY GACASFPELV EVLQVNQVVV GPMAAADIRE IVVNPARAAG GDVEPGLVEL VLRDLGAAAG TGITGSDQGT GHGSSRSPEL VSNLSSGFVA DPGSLPLLAH ALRATWFARA GEALTVADYL RVGGLTGAIA QTAEAAYTSL DGAAQQAVRP LLMRMIRLGE NGADTRRRVR RAALLAEVPG PESATVLDAL VAARLVEADA DGDQDGLQIA HEALLRSWPR LREWMDMDRA GAVALQQLRD AAEVWERGGR DPSYLFAGSR LAAARDWMDD DLDVDATTRQ FFDESVRAEA EQQRAAARRT RRLRQLVAAL AVLLLVAASL AGLTFQQSAS SGRARDQALS QRIATQAEAA RRNNPALAAQ LSLVALRTAD TPEARGAVLS SFNGGSGVPT RYQAHTKSVG TVAYSRDGRL LATGSDDWTA AVWDAADPRR LTPLARIPDE RGGGHGRAVK AVAFSRDGTV LATGGADGLA KLWNITDRAR PRLLATLPKA DSEVYGLAFD PTSDRLAVGG YGKSAYIYDV SDPARPEKKG QLFLHLAQVV ALEFSPDGAF LIAGDEGGSA LLWSVSDPSN PRPLKVLVED GGPSSDGAGA IRSITFGGDG HTVYTAGDGG YVRKFSGPDL PRLEYDGRAG TGDAPMTGLA VDPVSGLVGV GGFRYVGVPI FDVDVDQYSL TFLDEGATVW DVAFSPDGRR LASVSVDGSL RVWEMPGPAL IGRNGAQEDA VVNPVTGIVA ITTDKAVELW DVDDPYAPRR LHVLTDVIVD EYDPTGSSAF SPDGNVLAVG TGKNIVFYDV RDPAKPSRIS DVPGPAGGTA ELLFSPDGRT LALGGLNSPP EPAFQARVET WDVTDLSRPR RLASLIAHRS SVRDLTFSPD GRTLVSAAER SVKLWDVTDP RRLRLVSELP EFPGGVWEVR FSPDGRTLAA GGANPFATLW DVTRMDAPRQ IADLPGHSAS VTSVAFSPDG TQLATGSNDN TVRIWDVTEH DSPTLIEKLA RSAGSEAGIE EILYTRDGEK LVGVIFTVPA VVWDLDVNRV RARICERAGV GITAAEWRRF LPDLPYDPVC D
|
| |