Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4252 |
Symbol | |
ID | 5672607 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 5068674 |
End bp | 5072948 |
Gene Length | 4275 bp |
Protein Length | 1424 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641243125 |
Product | WD-40 repeat-containing protein |
Protein accession | YP_001508542 |
Protein GI | 158316034 |
COG category | [R] General function prediction only |
COG ID | [COG2319] FOG: WD40 repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.501692 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.871737 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGCAGGA TTTTTGTCAG CTTTCACGCG GCCGAGGGCG GGGCGCTGGC TGACCGGGTA GTGCGTGCGC TCGAAGCGGC CGGCTACACC GACTACTACA ACTATCGGGT GCCGGGGCAC CGGACTGCTC CGGGGGCGGC ATGGGCGCCG GACATCAGGA GAAACCTTCT GGTGAGCGAG GCGTTCATCG CCATCACGAC CCCGAAAGCG CTTGACGAGT GGTGTTCAAC CGAGGTCGCG ATCTTCCGTG AACGCAAGCC GCGTGCCCCG TTCCTCGAGA TCGTCGTGGG CGAGGCCTCC CGACGTGCGC TGACCGGAGC CCTGCAGGGA ATCCGGGTCC GCCCGGACAA CGAGGAGTCC GTCGCGGCCG CGTTAGCCAC GGTGATCGCG TTCCTGCAGG CGCGCGGGGT CGGGACCGGG CCCGGTCGAA CCTCGCCCTA CCCCGGCCTT GCGGCATTCG GGGAGCGGGA CGCCGCGGTC TTCTTCGGCC GGGATGACGA CATCGGCAGG CTGGCGGCCG CCATCGAGCC GCGTGCGCAC GGCGCCATGG CGGTCGTCGG CCCTTCCGGG GTTGGAAAGT CGTCGCTGGT TCTGGCCGGC CTGGTGCCTC GGCTGCGTGG TATGCCCTCG TCCGCCGCAC GGACAGCGCG CTGGCATGTC CTCGGCCCGG TCACGCCGAG CACGGGCGCG GTTCGCGGGC TCGCCGAGGC GCTGTCGATC CAGCGCGCCG AGGTCGGCCT GCCCCCGGTC CCCGCGGCGA CACTCGAGGC GCGCATCCGC GCCGACCGCG TTGCTCTGGC CGCCATGCTG GACGAACTCA TAGCCGCGGC GTCCGCCCGA CCGCCCGGCG GACAGATACG TGTGCTCATT GTCCTCGACC AGGCCGAGGA GCTCGTGTTG GCGGCATCCG AGCCGGTCCG GGGGCGGCCC GAGAAAAGGC CGGATGAGCG GTGGAACGAG TCGTCCGGCG GACGGTCCGA CGACCTGTCA GTCCAGCAGG CGGAGGCCGT GGTGCTGACC GAGGCGCTCG TCGAGGCTTC GTACCGCAAC GCCTGGCTGG TATACACGGT GCGGTCCGAC TATCTCGACG ACCTGCTTCG CGGGACGACG TTTCGTGCGC TGATGCAGCA GGCCCATCTG GTCCGTCCTC TGGATCGGCG CGAGTTGCTC GCTGTGGTCA GTGAGCCCGT CCATCAGCTG GGCTGGTCGT ATGAGACGGA TGCGCTGAGC GCGATCGTCG AGGACGGTGC CCGCGGCTCG CTACCAATGC TGGCGTACGC CTTGGCGCGA CTGTGGGACC GGGTCAACCC GGGCGGGGAC CGGGAGCCGC GCGCGATCCG CCGGGGCGAG TACGACCGCG CCGGCCGGGT GCAGGACGTG CTGCGCGAAC AGGCCGAAGC GGCGCTGGCG GAGGCGCTCG AGCAGATCGG CCGGGCCGTC CTCGGCCGTT CAGGCGCGAC GGTCGACCGG CGCGCCGCAG AGCATCGCAT GCTCGACCTC CTCAGCAGGC TGGCGTCGGT CGGCCCGGAC CGGGGCTTTA CCCGGCGGCC GTTGATGATC ACGGAAATTG GCGACGAGGA GCTTGGACTG CTAAGGCCGT TCGTCGAGCG CCGAGTGCTG ACCACCACCC GGTCGCTGTT TGTCGCCCAG GGCACGGAAC AACGGGCTGG GGAGCTCGCG GAACGGGTCC TCGGCATCGC GGACGTCGGC GACAACGCTC TGGAGGTCGC ACACGAGAGC CTGTTCGGCG ACTGGCCGCG GCTGCGTGCC CACCTCGGTC GGCAGCGTGA GGCCCTGCGG GCCCGCGGTG AGCTGGAGGA ACTCGCGGCC GCCTGGCTGG CGGGTGGTAA GCGGGACGAG GACCTCATAG GCCAGTCCCG TGTCCAGTTC CTGCTCGCTG ACCTCCTGCC GTCCCGGGAC GTGCGGGGCG ATCAGACGGT GACGGCGTGG GGATGGCCGA ACCTGGCGGA CGCACTGGTC ACGGCTGGCA TCTCCGCTGA CGCCGTCCAG CTGGCTGGGC TGTCAATCCA GCGGGTTCTG GGCCGGCTCG TCCAGGAGGC GGTGCTGCTG GAGCGGCCGG AGGACGGCCT GCGTCTTCTA GTCGGCGCGG GAACCAACTC GGACCCCGAG ACACTGGCTG TTCAGCTGTC CGCGCCGAGC CTCGAGGGCT GGCGGGCAGC GGTGCACCGA GCCGAGGCTC AGCTGCGGAG CACCAGGCTG CTGACAGACC GTTCCGTCGG CCGTTCCGCG GGTATGTGGG GAGTCGCCTG GTCGCCGGAC GGATCGCGGG TGGCCACGGG CTCACGCGAT GGCGTCATCC GGATCTGGGA TCTGAGCGCC GGCGACCTCG TCAACGCGTT CGCGCACGGC GAGGACCAGA TTGAGCAGGT CGATGGATGG GTGCGTTCGG TTGCCTGGTC ACCGTCCGCG GATCTGGTGG CATCCGCCTC GACCGACGAG ACGACACGAG TGTGGTCACT TGACACGGGG CGAGAGGTCC GGTCGTTCCG CCTGCCGGAC CGTCCCTGGG CAGTCCGGTT CAGCCCGGCC GGCGACCAGG TGCTGACCGC CTGTGCCTCG GGCCATGCGG TGCTCTGGTC CACCGAGCGG CGCAGTCGTC GCCCGGACTA CGAGCTTCTC AGCAAGCCAT CCGAGGCGGC CACGGCGGAG GCCGGTCCGA CCGAGGGGGG CCAGTCTGAT GGAGGCTCTC GGGGTGGGGG CCCTCGGAGC CGGTCGGACC CGGAACGCGG CATCCGAATC TGGGACGCCG ACATCGTGAC CGGGGACGTC ACGCGGATCG CGACGGCCTG CGAGGACGGG GTGGTCGATC TGTGGACCCT GCCCTCGGAT CGGGACGCCG ATGTCCCGGC GCCCCGCCGC ATCGCGGCGC ATCCCGGTGT AACCGTTCGG TCTGTGCGCT TCAGCCCGGG CGGCGGGCGG ATTGCGACCG CGGGGCAGGA CAACATCATC AAGGTTTTCG AGATCGAAGG CGGAGTCGAG CTGCGCCGGA TGTCCGGCCA CACGGACCAG GTCCGCAGGG TCGCGTGGTC ACCGCTCGGC AATCGGGTTG CCAGCGCGTC GGCCGATACC TCCATCGGAG TGTGGGACGT CCATTCCGGC CGGAGGGTCC TCACTCTGCG CGGTCACCGG CAGGGCGTCT GCGACGTGGC CTGGTCCGCC GCCGGCGACC AGCTGATCTC CGTCTCGGAC GACGGGACCG CTCGGGTGTG GAAAATCGGC GCGGTCGAGA CGGCGTTGGT CCTGGTCGGG GCGCCGGCCA GTGCGATGGA CCGGTCACGG GTCACGGGTG AAATCGCGGT GGCCGTTGCC GCGGGCCCGA CCAGCGCTAC CGGCAACAGT TTTGATGTCG TCGTGCTGCG GCCGGACGGC ACGCGGACGC GGACGCTCTC CGCCGCGCAC CGGAACACGA TTCGGTCTGT GTCGTGGTCT CCGTCAGGCA CCCGGCTGCT GACCGCGTCA CGGGACAACA CCGCCCGGAT CTGGCGCGTC GACAACGTGG ACTACGCGCT GGAAAACACG CTGTACGCCC GCGAGGGCGT GGAGGACGCC CGGTGGTCGC CGGACGGCGT CGAACTGGTC ACCGCTGCCC GCGACCGGGT GGTCCGTAGA TACCGTGCCG ACGGTGAGTG GCTCGACCGG GACGAGCGGC GCCCGCACCC GAACTTCCTG CGAGCGGTGG CATGGCACCC GAGCCGTCCC GAGTTCGCAT TGGCGGCGGA GGACAACCGG CTCACCATCA ACACCGCGGA GGCGACCGTG GACGAGCATC GCACTGACAA GGTGCTCACC TCCGTTGCCT GGAGCCCGAC CGGGGAGAAG CTCGCCCTTG GCGCGACAGA CGGCACGGTG CTGGTGCTCG GCGCCGCGGA CGGAAGGATA GCCGGGGAGC TGCACCGACT CGTCGGTCAT GAAGGTGAGA TCAGTACGGT CGACTGGTCC ACGGACGGCG AGCGCATCGT CACCTCGTCC GCGGACCACA CGGCGTCGGT ATGGGACTCA CACACCGGAG CACGGTTGAC CTCGCTGGTC GGGCATACTG GCCCGGTTGT CGGGGCGGTG TGGGCCGTCG ATGACGCCAG CACCACGAAC ATGGTCGTCA CCACCGCGTC CGCCGACGGG ACGATTCGGA CCTGGGACGT CTCCGATATC AGGCGGACGC CGCTCAGCGG CCTGCCGGAC ACGGAGCCCG GCGAGCCCTC GGCCCACGCT ACGGACCAGC TGATACGCGA CGCGCGCGTG CGCCTCGGTC GATAG
|
Protein sequence | MGRIFVSFHA AEGGALADRV VRALEAAGYT DYYNYRVPGH RTAPGAAWAP DIRRNLLVSE AFIAITTPKA LDEWCSTEVA IFRERKPRAP FLEIVVGEAS RRALTGALQG IRVRPDNEES VAAALATVIA FLQARGVGTG PGRTSPYPGL AAFGERDAAV FFGRDDDIGR LAAAIEPRAH GAMAVVGPSG VGKSSLVLAG LVPRLRGMPS SAARTARWHV LGPVTPSTGA VRGLAEALSI QRAEVGLPPV PAATLEARIR ADRVALAAML DELIAAASAR PPGGQIRVLI VLDQAEELVL AASEPVRGRP EKRPDERWNE SSGGRSDDLS VQQAEAVVLT EALVEASYRN AWLVYTVRSD YLDDLLRGTT FRALMQQAHL VRPLDRRELL AVVSEPVHQL GWSYETDALS AIVEDGARGS LPMLAYALAR LWDRVNPGGD REPRAIRRGE YDRAGRVQDV LREQAEAALA EALEQIGRAV LGRSGATVDR RAAEHRMLDL LSRLASVGPD RGFTRRPLMI TEIGDEELGL LRPFVERRVL TTTRSLFVAQ GTEQRAGELA ERVLGIADVG DNALEVAHES LFGDWPRLRA HLGRQREALR ARGELEELAA AWLAGGKRDE DLIGQSRVQF LLADLLPSRD VRGDQTVTAW GWPNLADALV TAGISADAVQ LAGLSIQRVL GRLVQEAVLL ERPEDGLRLL VGAGTNSDPE TLAVQLSAPS LEGWRAAVHR AEAQLRSTRL LTDRSVGRSA GMWGVAWSPD GSRVATGSRD GVIRIWDLSA GDLVNAFAHG EDQIEQVDGW VRSVAWSPSA DLVASASTDE TTRVWSLDTG REVRSFRLPD RPWAVRFSPA GDQVLTACAS GHAVLWSTER RSRRPDYELL SKPSEAATAE AGPTEGGQSD GGSRGGGPRS RSDPERGIRI WDADIVTGDV TRIATACEDG VVDLWTLPSD RDADVPAPRR IAAHPGVTVR SVRFSPGGGR IATAGQDNII KVFEIEGGVE LRRMSGHTDQ VRRVAWSPLG NRVASASADT SIGVWDVHSG RRVLTLRGHR QGVCDVAWSA AGDQLISVSD DGTARVWKIG AVETALVLVG APASAMDRSR VTGEIAVAVA AGPTSATGNS FDVVVLRPDG TRTRTLSAAH RNTIRSVSWS PSGTRLLTAS RDNTARIWRV DNVDYALENT LYAREGVEDA RWSPDGVELV TAARDRVVRR YRADGEWLDR DERRPHPNFL RAVAWHPSRP EFALAAEDNR LTINTAEATV DEHRTDKVLT SVAWSPTGEK LALGATDGTV LVLGAADGRI AGELHRLVGH EGEISTVDWS TDGERIVTSS ADHTASVWDS HTGARLTSLV GHTGPVVGAV WAVDDASTTN MVVTTASADG TIRTWDVSDI RRTPLSGLPD TEPGEPSAHA TDQLIRDARV RLGR
|
| |