Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_6600 |
Symbol | |
ID | 5674915 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 8029777 |
End bp | 8034978 |
Gene Length | 5202 bp |
Protein Length | 1733 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641245451 |
Product | transcriptional activator domain-containing protein |
Protein accession | YP_001510843 |
Protein GI | 158318335 |
COG category | [R] General function prediction only [T] Signal transduction mechanisms |
COG ID | [COG2319] FOG: WD40 repeat [COG3629] DNA-binding transcriptional activator of the SARP family |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCCGGG AGGATGCCGA CGCGCCGGCC GGGCGGACCT GGCGGGCTAC GGAGCCGGGC GGGTTGACGG TCGTGCCACG CCAACCCGCC CCAGACACCG GCAGTGTCGC TTTCCCCGGT CTGTGGTTCG GCGTGCTCGG GCCGTTGGAG GTGCGTAGCG ACGGTCGCCC ACTGGCGCTG CGGGGGCCGC GAGAACGTGC GTTGCTGGGC CTGTTGCTGG CGGAACGTAA TCGGGCTGTT TCGGTCTCGC GGCTGGTTGA CGGAATTTGG GGGGCGGTGC CGCCGCCGAC GGCCGAAAAG ACGCTGCAGA GCCATATTTC CCGATTGCGT CGTCTTCTGG AGCCGGAACG CCCGCCAGCG GGTTGGACAG TACTGGTGAC GGTGCCTTCC GGGTACAGCC TGCGGGCCGA CTTGACTGCC CTTGACTCGG CATTGTTCGA ACAACTCGTG GAACAGGCGC GGCGGGCTTT TGCGGTCGGT GCTCCTGAAC TTGCCCGCGC TCGGTTCCGG CGGGCGCTGG GATTGTGGCG GGGGGACGCG TACGAGGATC TTCTCGACCT CGAGTTCGCG GCGGCGGAAA GGGACCGGCT TGTCGAACTG CGGCTGGCCG CGGTAGCGGA TCAGATCGAC GCCGAGTTCG CACTCGGCCG GTCGGCGGCA GTATTCGGTG AGCTGCAGCG GTTGGTTGCC GCCCATCCGT TGCGGGAGCG ATTCCGTAGC CAGCTGATGA TCGCGCTGTA CGAATCGGGC CGCCAGGCCG ACGCGCTGGA GGCCTACCGG TCGGCGCGGG CCCGGCTGGT GGCGGAGATC GGCGTGGAGC CGGGCCGGCA GCTTCGAGCG TTGAACGCGG CCATTCTCGA GGAGCAGCCC AGCCTGCTCC CGGACATCGT GCAACCGCAG CGCCTTCCGG CGGCGTTGGG TGTCGATCGC CGTGCATTGA TCGGTCGGGA CGTGGAGATG CGCTGGCTCG AGCGCGTCTG GGACTGCGTC CTCGACGACC GCGGTGAGGT TGTCGTCGTC CGCGGCGAGC CAGGAATGGG TGTCCGCCGC CTCGTCGCCG AGTTCGCCCG CCGGGTCGCG GGCCGGGGTG CCGCGGTCGT CCTCGGACCG ATCGACACCG GGACTCTGAC GGGGATCGCA CAGGAGCGCC CCGTGCTCAT GGTCGTCGAC GTCGACGACG ACCAGCCGGA CACCAGCCCG GGCTGGGCCG CGGTGCAGGC CTCCGCGGTC GCGCGCCTGC CGGTGCTCGT TGTCGTCACC GCCCGACCCG GCGACGGCCC GCCGGAGGTA CGGCCCGGCG GGTTCGACGT CGGCGCGCAG TCGGTCCTGC AGCTCGGCCC GCTCGGGTCG GCTCAGGTCA CCGAGATAGT GGCTGGCTAC GTCGGTGCAG GCGAGGTGCC GGCCGCGACC GCGGCTGTGA TCGAATCCTC CGGCGGCGTC CCGGCCCGGG TGCACGCGGA TGCGGCCGAC TGGGCGGCCG TCCGAGCCGG CGAACGGGTC GGTCAGCGGG CGGCGCGGGT CGCGACCGAT CGATACGACC TGGCCGAGTC GGAACGCGCG ATGGTGTCTG ACGTGCTCGA CCTGCAGCGG GTTAGGCGTC GTCGTGGCGT CGCCGGTCCG GCCGGATCCG TCGGCGGCAC AGGAGCCTCG GGCGGCCCGA GCGCGTCGGG CAGCTCAGCT GGCGGCGCCG GCTCGGGTGA TGCGGCCGAC TTCGGCGCTC CGGTCGGCGC GCCGGGGACG GTTGTCTGCC CGTACAAGGG GCTGGCGCGT TTTGACGAGC AGGACGCTGC CTATTTCTTT GGCCGGGAGC GGCTGATCGC GCAGCTCGTG ACCCGCTGCG TCGCGGCACC GCTGCTGGCG GTGGTGGGTG CGTCGGGAAG CGGGAAATCG TCGGTCGTGC GCGCTGGACT GGTGCCGGCG ATCCGCTCCG GCGTACTGCC GGGAAGCGAC CGATGGCGGA TCCGGCTGCT GCGGGCCGGC ACCGTCGAGG CGACCTCGCT GGACGTCGAC ATGGACCTCG ATGGCGATGG CGACGGCGCC GGTGGCAACG AGCTGCTCAT TCTGGATCAG TTCGAGGAAG TGTTCACCAC ACGGTCGCAG GACAGCCAGA CCCGTTTGAT CGACCAGCTG GTCGACACGC TGGAACGGGG CGACGGCCGG CTGCGCGTCG TTCTCACCGT GCGGGCCGAC TACTACGGCC GGTTCGCCGC CCACCCGACG CTTGCCCGCC TGGTCGCTGA CAACAGCGTG CTCGTCGGTC CGATGGTCGT CGACGAGTTG CGGCGCGCGA TCGAGGAACC CGCGCTGGTC GCCGGGCTGG AACTCGAGGA GGGGCTCACG GCGGCGGTAC TGGCCGACGC CCGTCAGGAG CCTGGCGTGT TGCCGCTGTT GTCGACGGCG CTGCTGGCGA CGTGGGAAGG GCGCGACGGC CGGACGTTGC GCGTAGCCAC CTACCGTGAG GCCGGCGGTG TGGCCGGAAC GCTGACCAGG CTTGCCGACG GTATGTACGA GAGCCTCGAC GCCGCTGGTC AGGCGATCGT GCGGCAGCTG TTTCTGCGCC TCGCGACGCT GGGCGAGGAC GGCGACGATC TGCGGCGTCG CGCGCCACGC TCCGAGCTGG TAGGCAGCGC GCCGGTCGAG GCCGTTCTCA CGGTGCTGAT CGCGCGACGG CTGGTCATCG CCGCGGACGA CACGGTCGAG GTCGCGCACG AGGCGTTGCT AAGGGAGTGG CCCCGGCTGC GGGGTTGGCT CGAGACAGAC CGCGACGGCC GCCGGGTGCA CCGCGAACTG AGCGGGGCAG CCGTGGCGTG GGACGCCGGC GGGCGGAACC CGGCCGACCT TTACGGCGGT CCCCGGCTGG CCGCCGCACA GGACTGGGCC GCGGCCCACC CACGCGACGC CAACCCCCTG GAGGAGGAGT TCCTCGCGGC CGCTGGTGCC GCCCGGGATC GGGCGGCACA GGTCGCTCGA CGGACCACCC AGTGGCTGCG GACGCTCGCC GCCGGGTTGG CCGTGCTCCT GGTGGTCGCG CTGATCGCGA CAGGGGTGGC CGTCGGGCAG CGGCACAGGG CCACGCTGCA GGCCGATCGG GCACACAGCG AAGCGGAGCT GGCGCGGGCG AGCCGGCTCG CGGCGGTCAC CCGCACCCTT GGCCCCGACC AGATCGACCT TGCCCTGCTG CTGGGCGTCG AGAGCTACCG GCAACAGCCC ACGGTCGAGA CCGAGGGAAA TCTCGAGACC GCGCTCGTCC ACACCCCGCC CGGCCTCACT CAGCTGATCC GTTTCAGGTC GCCGAGTTTC TACCCGTCGG TGAGCCCGGA CGGACGACTG CTGGCCTCGC CCGGCCAGGA CGGCACGGTC GACCTGTGGG ATCTGCAGGC CGGCCGGCTG CTGCGCACGT TCACCTGGCC CACCGCACGC CAGGTCGCCA TGTTCAGCTC GGACGGGAAG CTGCTCACCG CCGGGGGGAA CGACGGGACC GTCGTCGTCT GGGACGCCGT CACCGGACGG CAGGTCGGGG CACCGGTCAA GGTCGCCGGC GGCTTCGTGT ACGGCGAGTT CGACCCCACT GACCCATCTC GCCTTTTCGC TGTCAGCGAC ACCGGCGAGG TCGCCATCTG GGACCGGGCC ATCCCGGACC GGCCGGTGCA GCTCGGGCCG CCGCTGCGGT TCACGCCGCG GGCCGGCGCG CTCCCGGTGG CGGTCGTCAG CGCTGACGGC CGTCGCCTGG CGGCCGGGAC GTCGTCCCCC GACAGCTCGA CGCGGGTGTG GGATCTGGGC TCCCGCACCG TGATACGGGA CCTTCCCGGC CTCACCGGAT CCTTCAGCCC GGACGGCCGA TCGCTGGCTA CCGCGTCCGA CGGTCGGGTC GAGGTGTGGG ACGTCGCCTC CGGAGACCGG CGGGGCGTGC CGCTGACGGG ACTGACCGGA GCGTTGCCAG GAATCCTCTT CAGTCCGGAT GGGCAGCTCG TGGCCGCCGG CGACGGAGGC AACGTCGTCC GGGTGTTCGA CCTGGCCTCG GGACTGCAGA TCGGCGCTTC CCTCGCCCTG CACACTGGCG GTCCGGCGCT CCCCACCGCC TTTCTTCCCG ACGGCCGGGT CGTCACCAGT GGGCCGCACG AGGCCGCGGT CTGGCAGGTG GGTCGTACGA CGCCGCCGAT CGGCCTGGTG CTCGGAGGCC ATCAGGGTCT GACGTTCGGC ACGTTCGTCC CGGGTGGCCA GGAGGTTGTC ACCCGCGGCA TCTCCGACAG CGGGCTACGG CGATGGGACG CCGGCACCGG AAGGGAACTC GGCCGGCTGC TCGACGGCCG GGTCCAGGCA CCCGTGACGT TCAGCCCCGA CGGCCGTCTG CTCGCCACGG CCGGTGCCGA CGACAACCAG CTCGCGCTGT GGGACGCGCG CACCGGCGAA CGACTCGCCG GACTCGACGG AACCCGCGGC GGACGACGCA GCTACGCGGA CTGGAGCCCG GCCGGAGACC GGATCGTGAC CGGCGCCGAC GGGTCGGTGC GGATCTGGGA CGTCAGCGAC CCCCGCCATC CGGCATCGGT CGAGCAGCTG GACCCGGACG GCGAGCCCCA TCCGGGCACC CCGGACAGCC AGTGGCCGGT TTTCAGCCAT GACGGGCAGT GGATCGCGGT CCATGACTTT CCGGGTCACA CGATGACGGT GTTCGACGCG GCGTCAGGTC GCAAACAGTG GTCGACATCG GCGCCCGTGG CGGACCTGGC GGGAATCGCC TTCGCGCCGG ACGACAAGAC TCTCGCGATC AGCTACGGCG CCCGGGTCGG TGGTCGAGTC TCGTTCCACG ACACCGGGAC CGGGCACGTG GAGCGGACCC TGCAGATCCC GAGTGGAGGC GGAATTGAGT TCCTCCGGGG TGGCAGTGTC GTGATGACAA CGGGCGATTC GGCTGGTAGC AGCGCTGCGC AGTTGTGGGA CGCGGCCACC TTCGCACCGA TCGGGGAGCC GCTGCCGCAA CCCGAACCGG GCGCGTACTT CATCTCCCGT GACGCCGAGG GCACCCGAGC GGTGGCTGGC TCGCAGAACT CGTTCGCGGT GATCTGGGAC GTCGACACCG CACGGTGGCA GACGGTCGCC TGCGGAATCG CTGGACGCAA CCTCACCCGG GCGGAATGGG ATCAGTACTT TCCCAACCGG CCGTACAGGC AGACCTGTCC GCAGTGGCCG GCGGGTTCAT GA
|
Protein sequence | MAREDADAPA GRTWRATEPG GLTVVPRQPA PDTGSVAFPG LWFGVLGPLE VRSDGRPLAL RGPRERALLG LLLAERNRAV SVSRLVDGIW GAVPPPTAEK TLQSHISRLR RLLEPERPPA GWTVLVTVPS GYSLRADLTA LDSALFEQLV EQARRAFAVG APELARARFR RALGLWRGDA YEDLLDLEFA AAERDRLVEL RLAAVADQID AEFALGRSAA VFGELQRLVA AHPLRERFRS QLMIALYESG RQADALEAYR SARARLVAEI GVEPGRQLRA LNAAILEEQP SLLPDIVQPQ RLPAALGVDR RALIGRDVEM RWLERVWDCV LDDRGEVVVV RGEPGMGVRR LVAEFARRVA GRGAAVVLGP IDTGTLTGIA QERPVLMVVD VDDDQPDTSP GWAAVQASAV ARLPVLVVVT ARPGDGPPEV RPGGFDVGAQ SVLQLGPLGS AQVTEIVAGY VGAGEVPAAT AAVIESSGGV PARVHADAAD WAAVRAGERV GQRAARVATD RYDLAESERA MVSDVLDLQR VRRRRGVAGP AGSVGGTGAS GGPSASGSSA GGAGSGDAAD FGAPVGAPGT VVCPYKGLAR FDEQDAAYFF GRERLIAQLV TRCVAAPLLA VVGASGSGKS SVVRAGLVPA IRSGVLPGSD RWRIRLLRAG TVEATSLDVD MDLDGDGDGA GGNELLILDQ FEEVFTTRSQ DSQTRLIDQL VDTLERGDGR LRVVLTVRAD YYGRFAAHPT LARLVADNSV LVGPMVVDEL RRAIEEPALV AGLELEEGLT AAVLADARQE PGVLPLLSTA LLATWEGRDG RTLRVATYRE AGGVAGTLTR LADGMYESLD AAGQAIVRQL FLRLATLGED GDDLRRRAPR SELVGSAPVE AVLTVLIARR LVIAADDTVE VAHEALLREW PRLRGWLETD RDGRRVHREL SGAAVAWDAG GRNPADLYGG PRLAAAQDWA AAHPRDANPL EEEFLAAAGA ARDRAAQVAR RTTQWLRTLA AGLAVLLVVA LIATGVAVGQ RHRATLQADR AHSEAELARA SRLAAVTRTL GPDQIDLALL LGVESYRQQP TVETEGNLET ALVHTPPGLT QLIRFRSPSF YPSVSPDGRL LASPGQDGTV DLWDLQAGRL LRTFTWPTAR QVAMFSSDGK LLTAGGNDGT VVVWDAVTGR QVGAPVKVAG GFVYGEFDPT DPSRLFAVSD TGEVAIWDRA IPDRPVQLGP PLRFTPRAGA LPVAVVSADG RRLAAGTSSP DSSTRVWDLG SRTVIRDLPG LTGSFSPDGR SLATASDGRV EVWDVASGDR RGVPLTGLTG ALPGILFSPD GQLVAAGDGG NVVRVFDLAS GLQIGASLAL HTGGPALPTA FLPDGRVVTS GPHEAAVWQV GRTTPPIGLV LGGHQGLTFG TFVPGGQEVV TRGISDSGLR RWDAGTGREL GRLLDGRVQA PVTFSPDGRL LATAGADDNQ LALWDARTGE RLAGLDGTRG GRRSYADWSP AGDRIVTGAD GSVRIWDVSD PRHPASVEQL DPDGEPHPGT PDSQWPVFSH DGQWIAVHDF PGHTMTVFDA ASGRKQWSTS APVADLAGIA FAPDDKTLAI SYGARVGGRV SFHDTGTGHV ERTLQIPSGG GIEFLRGGSV VMTTGDSAGS SAAQLWDAAT FAPIGEPLPQ PEPGAYFISR DAEGTRAVAG SQNSFAVIWD VDTARWQTVA CGIAGRNLTR AEWDQYFPNR PYRQTCPQWP AGS
|
| |