Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4246 |
Symbol | |
ID | 5672601 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 5056197 |
End bp | 5059073 |
Gene Length | 2877 bp |
Protein Length | 958 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641243119 |
Product | WD-40 repeat-containing protein |
Protein accession | YP_001508536 |
Protein GI | 158316028 |
COG category | [R] General function prediction only |
COG ID | [COG2319] FOG: WD40 repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.713375 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGTTT CATCGAACTG GGGGCGAGCC GGCTCCGCAC ATCACTACGA TGCTTTCATC TCCTACACCG AGGCAGACGT CGCGATCGCC CGGGAACTAG AACGCGCAAT CGCCTCGCAT GTTTCTCCGG ACCCCCGACT GAAGAGCCCG AGAGTCTTCC GGGACCGTAC CAAACTAACT ACCGCTCCAC GGCTCGCCGC GGCCATCGAG CAGGCTCTCG ATCGCTCGGA ATACTTCGTG CTGGTGGCCT CACCGGACGT CGTCCATTCC AGGTGGGTAC GGCTGGAGAT CGAGGCGTGG CAGCGGAGGG ACTACCGGTT CGAGCGACTC CTTGTCGTCC TGGCCGGGGA GGATCTTCGC ACGTCGCTCC CCGGCCCACT CCGTGCGTTC TACTTTCCCG AGCGCGTCGA CAGCACCGGG GACGCCCGAC ACACACCCCT GGAGCTCTCC CCCAACTACC TGTCCGTCAA GCCAGTGATC CGCTCACTGC ACAAAAAGCA CGGCAGACGC CCCAGCCGCC CGGGCCGACT GTCCCAGAAC GCGACCGAAG AAGAGATAGG AGAGTTCGAG AGCCAGAGAC AGAAATGCGA GGAATGGGAC CAGAATGCCT CGTCCCTAAT CACCAGCCAT GACGAATTCG CCGCGAGCGC GGGCGAGATC ACGGCACGCC TTTATAATGT CGACAAGGAT GAGCTCGACC AGGCAAAGAT ACGCGAGGAG AGGAAGAAGA AGCGGAAGCG GCGCTGGCTG AGAGTCATTT CGGCGACGGC CACGGCAGGT ATTCTGAGCC TCGTCGTGAT CCTCTTGGCG CTACGGCAGG AGGCGGGCCG GCAGCGTTCG GACGCGGACC AGAAGCGTGC GGTAGCACGC TCCGAGCAGC TTGCCCAGAC TGCCCGCGCC CTGCTTGAGA CCTCGCCGAA CACGGCGAGA CAGATCAGCC TGGCCGCCTA CGACGTCGCT CCGACAGCCA CCGCGCGGAG CGCCATGATG GCGGCCGGCC GGGAGCCGGG AACAATCGAT GCCCGGTCGG TGGGGGGACT GCGCCTCAGT CCAGACGGTC GCCTCCTCGC GATCGTCGAG ACCGCCACGA CCGACAACGT CGTCAGGCTC TGGAATCTTG CCACCCATCG TGTCGCCGCG ACGGTCACCG GCCACCGGGG CAAGGTGAAC GCGGTCGCCT TCAGCCCCGA CGGCCGGCTT TTCGCGACTG CGGGCGACGA TCATACGGCC CGGATCTGGT CGGTTGCCGA CGCCGTCGCC GCGCGTGAGG TCAACGTTCT CCGGCCGGGC CTCGGGCCGC TCACGGCGCT GGATTTCGGA GCCGCGAACC AGTTGGTGCT TGGTAGCGCC CAGATTCGCT CGGTCGCCTC CGATCCAACG AGCCCCGGCA GAGAGATCGA CAGTGCTACG AGTTCGCTGC AGGCCTGGGT TGGCGTCGAG GGGCGAAACC CCGTTGGGGG GCCTGTGCTG ACAATGGACG GCGGGAATGT GACCTCAGTC GATGTCGCAC CCGAGGGCGG GCGTCTGGTC GTTCACGCAG GCGCCACCAT GCTGTGGGAC GTCGGTCCGA CGAGCATGAT CACAGGCGGC CCGCTCACCC TGCCAGGGAC GCGCGACACC GACATCTACG CCGATGACCG CCAGGATCTC GACGGGGACG GCTTCGCCGA CCGTGTCGCC TGTTTCACCA CCGGCGGCCG GGTCGTCGTC TCGGGCCCGC AGGTCTTCGA TGTCGGTGCC GGCACCCTGG TTCGGGCCGC GTCTCCGCTG GGTGCATCAC AGGACAGCCC GCTGGCAGCG GCCTGTGACA GCGGAGTCGT CAGCGCCGCG CCGGACATCC GGGGTGTCCG TGTCTGGTCA ACGCCCGCGA ACGACGCAGG TGCGCTGGTC GCCCGGCCGA TACCCGATCT GGGCGAGGCA CACGGCGCGA CCGCAGTCCT GAGTACGGAG GGCGATCTGC TCGCTTTCGC GGAACCGGGC GGCGCGGTCC GCCTTATCGA CATCCGTGAC CCGGCCCGGC TTGGCCGGGT CCAGATCGAG CAGGCCGAGC CACGGAAGCC GGACCCGGGC ATGTCCTACC GATCGGTCAG ATCGGGTGAG CGCCATTCGA GCCGGAACGG GACGCTGGCG GTGATAATGG ACGACAGTTC CACGACGGTG AGTTTCGGGG TGTGGGACTA CTCGGACTCG GCGCATCCGC GCCGCACGGG AACGATCAAT CTCCCGTCCA GGTCATCACC GGTGCCACCA TCGATCAGCG ATGACGGCAG GCGGCTGGCC ATGTCGAACA CCGTCAACGC GATCGACATC TTCGGGTTGG ACGACCAGGA TGCACTGGTC CACATCGGCC GATACGAGGC GTTCGGCGCG CAGATCCCGG GGGCCCCGGG CGCACGCTAC GCGCCTATTC CCGGAACGTT CAGCGCGAAC GGGCGCATCT ACGCCGCACC GGTCACGGAC GGCCCGCGGT CCATCCAGCT GTGGGAGGTG CACGAGGATC GCCTCACGGC AGTCTCCACC GTCCCGACGG CCGGAGAGGT ACTAGCGCTC GACGACGAGG GCGACATGCT GGCCGTCGCG GATACGGACG GCACCGTGCG GCTCTGGGAC CTGCGGGACA TCCAGGCTCC TGCCCTCTAC GTTGAGTTCA CTGCCCTGCC CGACAGGCCT GTCACCGAGC TACGGTTCAC CGGCGGTGAC GACGCCGGAA TGCGTGTCGG TGTCACGCAG GACGACACGG TCTCGTGGTG GGCTGTCAGT CAGGATGCTC TGGTTGACGA TCTGTGCGCC GAGGTCGGTG ACCTGCTGAG CGCCGACGAA TGGCGCCGAC TCATCGCCGA CGTGCCTGCG CGCCGGCCCT GCGCAGCGCC CGGCTGA
|
Protein sequence | MSVSSNWGRA GSAHHYDAFI SYTEADVAIA RELERAIASH VSPDPRLKSP RVFRDRTKLT TAPRLAAAIE QALDRSEYFV LVASPDVVHS RWVRLEIEAW QRRDYRFERL LVVLAGEDLR TSLPGPLRAF YFPERVDSTG DARHTPLELS PNYLSVKPVI RSLHKKHGRR PSRPGRLSQN ATEEEIGEFE SQRQKCEEWD QNASSLITSH DEFAASAGEI TARLYNVDKD ELDQAKIREE RKKKRKRRWL RVISATATAG ILSLVVILLA LRQEAGRQRS DADQKRAVAR SEQLAQTARA LLETSPNTAR QISLAAYDVA PTATARSAMM AAGREPGTID ARSVGGLRLS PDGRLLAIVE TATTDNVVRL WNLATHRVAA TVTGHRGKVN AVAFSPDGRL FATAGDDHTA RIWSVADAVA AREVNVLRPG LGPLTALDFG AANQLVLGSA QIRSVASDPT SPGREIDSAT SSLQAWVGVE GRNPVGGPVL TMDGGNVTSV DVAPEGGRLV VHAGATMLWD VGPTSMITGG PLTLPGTRDT DIYADDRQDL DGDGFADRVA CFTTGGRVVV SGPQVFDVGA GTLVRAASPL GASQDSPLAA ACDSGVVSAA PDIRGVRVWS TPANDAGALV ARPIPDLGEA HGATAVLSTE GDLLAFAEPG GAVRLIDIRD PARLGRVQIE QAEPRKPDPG MSYRSVRSGE RHSSRNGTLA VIMDDSSTTV SFGVWDYSDS AHPRRTGTIN LPSRSSPVPP SISDDGRRLA MSNTVNAIDI FGLDDQDALV HIGRYEAFGA QIPGAPGARY APIPGTFSAN GRIYAAPVTD GPRSIQLWEV HEDRLTAVST VPTAGEVLAL DDEGDMLAVA DTDGTVRLWD LRDIQAPALY VEFTALPDRP VTELRFTGGD DAGMRVGVTQ DDTVSWWAVS QDALVDDLCA EVGDLLSADE WRRLIADVPA RRPCAAPG
|
| |