Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_0276 |
Symbol | |
ID | 3905718 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | - |
Start bp | 317381 |
End bp | 320365 |
Gene Length | 2985 bp |
Protein Length | 994 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 637877604 |
Product | WD-40 repeat-containing protein |
Protein accession | YP_479392 |
Protein GI | 86738992 |
COG category | [R] General function prediction only |
COG ID | [COG2319] FOG: WD40 repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.741402 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGCACGAG TCTTCGTCAG CCATGCGGGC TCCGACGCGG AGCTGGCCGA CAGAGTGCAT CAGACGCTGA CCCTTCTTGG CCACCGGGTG TTCCTCGCTC AGAACATCCG GGTGGGCGTC CAGCCCGGCG ACATGTGGAA GGATCGGTTG CACCGCGAGC TGCGAGCCGC CGACGCCGTG GTCTGTGTCA TCACCAACGC CTACAACGCC TCCCCGTGGT GCGCCTACGA ACTGGGAATC GCCTACGAGG TCGGCAGCCT GCTGCTGCCG CTGCGCGTGG AGCCGACGGT CACCTCCCCG CTTCTGGAGG ACCGGCAGCA CGTTCACCTG CACGAGAATC CGGGGTGGGC CGACAACCTG AACATTGGGC TGCGGCGCGT CGAGGCCGGC GGCGGGCTGG GCTGGGCGGA CGACCGGTCC CCGTTCCCGG GTCTGCGGCC GTTCACCCGG GACATGGCCA AGGTGTTCTA CGGGCGCGGC AACGAGCTGC GGCGACTCAC CCACCGGCTA CGCTCCCTCG GTGAACAGCG TCTGCTACTG GTGGTCGGCG CCTCCGGATG CGGCAAGTCG TCGCTGGTCA CCGCCGGCCT GCCGGCCCAG CTGGCGGGCG AGCCCGGCTG GGAGGTGGCC GACCCCTTCT GGCCGGGCCG GGACCCGGTC CGCAAGCTCG CCCACACCCT GACCCAGGCT GCGAGGCGTG TCGGCCAGCA GTGGGCGGTC ACCGACATCG ACCGGCAGCT CAGGGAGGAC GACCTGGCCC TGATGGGGCT GGCCGAGGAA CTTCTGACGG CCGGCTCCGG CCCGGTGCGG GACAGCCTGC TGCTACTGAT CATCGATCAG GGCGAGGAGC TGTTCACCCG ATCGGACGAA GAGGAACGGG CCCGGTTGGC GACCCTGTTG CGGCATGGGG TCACCGGTCC CGTGCGGGTG GTCATGGCGC TTCGCTCCGA GTTCCAGGAC CAGGTCTTCG CCCTGCCCGA ACTGAGCGGC CCCCCCGAAC AGGGCGGGGT CAACCTGCAC GTCTTCCCCC TCCGCCCGCT GGCCCGGGAC ATGCTGCCGG TCGTCATCAC CGAGCCGGCC CGGGTAGCCG GCCTGCACGT GGAGCCGGAG CTGGTCGCCA GGATGGTCGC GGACACCGAG GACGGCGAGG CGCTGCCGCT GCTGGCGTTC ACGCTCCATG AGCTGGCCAA GGACCAGACC CGCGGCGGCA CCCTGTCGAT GGACCGGTAC CACGAGCTGG GTGGCGTGCG GGGGGCGCTG GCCCGCCGCG CCGACGCCGT CCTCGACGAC GCCGTGGAGA CGAGCGGGCT GGGCCGGGAG GAGATCCTGG TCGGCATGGC CGAGCTCGCC ACGCTCGACG AGGCCGGTCG GCGGACCCGC CGACGCATCG ACCTCAAGGA CCTGCCGTCC GACGCGTTGC GGAGAACCTT CCAGGTCTTC GTGAACCAAT GGCTGCTGTC CACCCCCAGC GACAGCCTGG GGACCGCCGT CGGGGTCGTC CACGAGGCGT TGCTGACCGC CTGGAAGCCA CTGGATGCCG TCATCCGGGA ACGGGAAGCC GCCCTGCTCA GTGAGGGCAA GGTCGAGCGC GCCGCCGCCG AGTGGGAGCA CAACGACGGC CCGCTCTGGG ACGAGGACCG GCTGACCACC GCCATCACGA CGCTGTGGCA GTCCTATGAG CGTTTCCGCG CCGGTGAGGA GCCGGTGGTC CATCTCAAAC CGGCCGGGCG GAGGTTCATT GACGCGTGCG ACCAGCGGAT CGAGGAAGCC CGTCGCCAGG AACGTACGCG CGTCGAGGCG GCCCGCCGCC GGGAACGTCA CGAACGTCAA CGCCGCCAGC AGGTCATCGC TGTCCTTTCC GTGCTCCTCG TCTTCACCGT CGTCGGGGCC GCGGTCGCCA CCTGGCAGTC CCTCACCGCC CGGCAGGCGC AGCGCACCGC GGAACGGGCG CAGCGCACCA CCGAACGGGC GCGGCACGCC CTGATCACTC ACACCCTGCT CGCCGACGCC GACGCCGCGC GGGCGACGGA CCAGGTGCGC GCGTTACGAC TCGCCATCGC CGCCGAGGAC ATCGACCCCG GTACCGGTGG TGCGGCCGGC GGTAAGGCCG CCAGTGGTGA GGCCGCCAGT GGTGAGGCCG GCGGTAAGGC CGACAGGGGC GCCGACAGCC GGGCGAACTT GCTGCAAACC CTCGCCGAGG ACCCGAACCC ACTCGCCAGC CGCGGCGATC CCGTCACCTC GGTGGCGTTT TCCCCGGATG GACGCATCCT GGCCGTCGGC GGCGCGAACG CAACGGTCGT GCTGTGGGAT GTGACGGAGG GTCAACATCG TCGGATCGAC CCGCCCCTCA CCGGCCAGCA CAGCGGGATC ACGTCGGTGA CGTTCTCGCC GGACAGCCGG AACCTCGACG CGACCGGCGT CGACGGGACG GTGCTGCGGT GGGACGTCAC CGACCCAGAC CGGCCACAAT TGCGGGGGCG TGTGAACGCC GCCGCGGCCC TCCCCACGCC GTGGGATGCG TTCTCCCCGG ACGGGCGCCT CAGAGCCACC GGCCAGGACG GATCAATCAT GCTGGCGGAT GTCACCGCCC CGGATCGGCC CCGGCCGCCC AGCATCCTTC TCGCCGACCC CGGACCGGTC CGGGCCGTGG CCTTCTCCCC GCACGGGCAG GTCCTGGCCG CGGTCGGATC TGGCCGGAGG GTATGGCTCT GGGATACCAG CGTCACCCCG CCGCGGCAGA TCGGGCAGCC GCTGACCGGG CACACCCGCT CCGTCCTGTC GCTGGCGTTT TCCCCGGACG GCGGGACGCT GGCCTCCGGC GGCAACGATG GCACCGTCCG GCTGTGGCGG CTCGCCGCGA TCGACGCGTT CCGCCGCAAC GGCGTCGTCG AGTACGCCTG CCACGAGGCC CAGGGCGGGC TCGACCGGTC GGCGTGGAGC TTCCACATTC CCGGCCTGCC CTACCAGGAC ACCTGCGCGG GGTGA
|
Protein sequence | MARVFVSHAG SDAELADRVH QTLTLLGHRV FLAQNIRVGV QPGDMWKDRL HRELRAADAV VCVITNAYNA SPWCAYELGI AYEVGSLLLP LRVEPTVTSP LLEDRQHVHL HENPGWADNL NIGLRRVEAG GGLGWADDRS PFPGLRPFTR DMAKVFYGRG NELRRLTHRL RSLGEQRLLL VVGASGCGKS SLVTAGLPAQ LAGEPGWEVA DPFWPGRDPV RKLAHTLTQA ARRVGQQWAV TDIDRQLRED DLALMGLAEE LLTAGSGPVR DSLLLLIIDQ GEELFTRSDE EERARLATLL RHGVTGPVRV VMALRSEFQD QVFALPELSG PPEQGGVNLH VFPLRPLARD MLPVVITEPA RVAGLHVEPE LVARMVADTE DGEALPLLAF TLHELAKDQT RGGTLSMDRY HELGGVRGAL ARRADAVLDD AVETSGLGRE EILVGMAELA TLDEAGRRTR RRIDLKDLPS DALRRTFQVF VNQWLLSTPS DSLGTAVGVV HEALLTAWKP LDAVIREREA ALLSEGKVER AAAEWEHNDG PLWDEDRLTT AITTLWQSYE RFRAGEEPVV HLKPAGRRFI DACDQRIEEA RRQERTRVEA ARRRERHERQ RRQQVIAVLS VLLVFTVVGA AVATWQSLTA RQAQRTAERA QRTTERARHA LITHTLLADA DAARATDQVR ALRLAIAAED IDPGTGGAAG GKAASGEAAS GEAGGKADRG ADSRANLLQT LAEDPNPLAS RGDPVTSVAF SPDGRILAVG GANATVVLWD VTEGQHRRID PPLTGQHSGI TSVTFSPDSR NLDATGVDGT VLRWDVTDPD RPQLRGRVNA AAALPTPWDA FSPDGRLRAT GQDGSIMLAD VTAPDRPRPP SILLADPGPV RAVAFSPHGQ VLAAVGSGRR VWLWDTSVTP PRQIGQPLTG HTRSVLSLAF SPDGGTLASG GNDGTVRLWR LAAIDAFRRN GVVEYACHEA QGGLDRSAWS FHIPGLPYQD TCAG
|
| |