Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0573 |
Symbol | |
ID | 5668990 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 660451 |
End bp | 665550 |
Gene Length | 5100 bp |
Protein Length | 1699 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641239500 |
Product | transcriptional activator domain-containing protein |
Protein accession | YP_001504938 |
Protein GI | 158312430 |
COG category | [R] General function prediction only [T] Signal transduction mechanisms |
COG ID | [COG2319] FOG: WD40 repeat [COG3629] DNA-binding transcriptional activator of the SARP family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.836093 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTCGACG AGGGCATCGG CGCGGCGGGC GACGAACCGA GCGTGCAGAA CCGAACCCCG CCGGCCGGTG GCCGGCCGGC GGGGCTGTCG TTCCAGGTGC TCGGGCCGTT GGAAGTTCGC CGGGGCGGCA CACCCGTCCA GCTGCGCGGT ACCCGCGAAC GCACACTCCT GGCCATCCTG CTGTCCGAGC GAAACCGTGC GGTTCCGTTG TCCCGGTTGG TCACCGGTCT GTGGGGAGCA CAGCCGCCGG CCAGCGCCGA ACGGACGGTG CAGGCCTACG TCGCCCGGCT CCGCCCGATG CTCGAACCAG AGCGGCCCGA GGCCGGCTGG TCGGTGCTGG TCACCTCACC GACGGGGTAC TGCCTGCGGG TCGACGACGC CGCGTTCGAC GCGGCGTCGT TCGCGCGGCT GGTCGACCGC GCGCAGTCGG CTCTGGTGGC CGGGGCACCG GACCTCGCAC TGACCCGGTT CCGTCGGGCG GAGGCGATGT GGCGGGGCGA TGAGCCCTAC CAGGACCTGG CAGACGTCGC CTACCTCACC CCGGAGCGGG ACCGGCTCAT CGAACTCCGG CTCACCGCGG CTGCTGACCG GCTCGACGCC GAACGCGCCC TCGGCCGTGC CGCGGGTACG GTCGGCGACC TACGGCGCCT GGTCGCGGAC CATCCGACCA ACGAGCGGTT CTGGGGCCAG CTGATGGTGG CGCTCTACGA GTCGCAGCGG CAGAAGGAGG CACTCGACAC CTATCGCGCC GCGTGGACGC GGCTGGTCGA CGACGGTGGC ATCGAGCCAG GGCCGAAGCT GCGAGCGTTG CACGACACGA TCGTTTCGCA GCGGCCCGTC GAGGTCGCCT GGCCGGTACG ACCGGGCGAT GTCCCGGCTC CACTGAGTGC CGACCGCACC CCGTTGATCG GCAGGGACGG GGCCCAACGC TGGCTGCGGG ACGTGTGGGA CCAGGTACTC GATGACGTGG GCAGGCTGGT CGTCGTCGAG GCCGAACCGG GCGGCGGAGC CAGCCGCCTG CTGGCGGAGT TCGCACGGAC GGTGGCGGCG CGCGGCGCCG TCGTCGAGAC GACCTTGACG CCCGCGCTCA CGACGCAGGT GGCGCAACGT CCCGTGCTTG TCGTCGTAGA CCAGCCAGCC GGCCCGGAAC CCGCGGAGGC TGTGCCAGCC GGGCTGGGGC CAGCCGGGGC GGAGTCCGCG GGGGTCGGGC CGGCTGGGAG GGGGCCCGCG TGGGTGAAGC GGGTGGCTGC CGCGGTCGGG GGGAGCGGCC GCCCGGTACT GGTTGTCGTG GTGGTGGCCG CCTCCGACCG GGCCGCCCAC CCGGACGAGT GGGCTGACGC GGTCCCCGCC GAGCGGGTGC TGCGACTCGG GCCCCTAGGG GGCGACGCGG TCGGCAACAT CGTGGCGGGC TACGTCCAAC CCGCCGACCT GGACGAGGCG ACCTCGGTGA TCCTCGACAC CGCGGGTGGC AACCCGGCCA GGGTCCACGA GGCGGCGGCG CGATGGGCCG CCGAGCGGGC GAGAGCCCAG GCCGGCCGCT CAGCCGAGCG GATGGCCGCG GTCCACCGGA CACTGGCGCA GGCCGAGCAG GAGATGCTCG ACGACGTCAC CGACCTGCAG CGGGTGAGAG CCTCCCGAGC CCCGGGGCCT GACACCGTCG TGTGCCCGTA CAAGGGCCTG GCCCGCTTCG ACGAGGCGGA CGCCGCCTAC TTCTTCGGCC GGCAACGGCT GATCGCCCAG CTGGTCACCG GGTGCGTCGC CGCGCCGCTG CTCACGGTCG TCGGCCCCTC CGGAAGCGGG AAGTCGTCCG TGGTCCGCGC TGGACTGCTC CCCGCGCTGC GTGCGGGTGC GCTGCCCGGC AGCGAGCGAT GGCGGTATAC GCCGCTGCGG ACCGGCGCGA CCGATGAGGC GGCACTGCGC GCGGTGCTGC GGGACGGCGG CGCAGAAGAC CCCGACGGCA CCGACCTGCT GGTGTTGGAC CAGTTCGAGG AGGCCTTCAC CGCCTGGTCG CCGGCGACGC GCACCGCTGT GGTGGACTGG CTGGTCGGTG AGCTCGAGCT GCGGGACGGG CGGCTGCGCG TCGTCATCAC CGTGCGGGCC GACTACTACG GCCGTTTCGC CCATCATCCG ACGCTCGCTC GCCTCATCGG GCGTAACACT CTGCTGGTCG GCCCGATGAC CGACGACGAA CTCTCCCAGG CGATCGAGCA GCCCGCCCGG GTCGCTGGCC TGGGCGTGGA GGACGGGTTC GTGGCGGCGG TGCTCGGCGA TGCCAAGCAC GAACCGGGCG CGCTGCCGTT GCTGTCAACC GCCCTGCTGG CGACCTGGGA ACGCCGTGAC GGCCGGATGC TGCGGACCGC GGCGTACCGG GAGGCTGGTG GCGTCGCCGG AGCGGTCACC CGGCTTGCCG AAGACGTCTA CAAGGGCCTG GACCCCGGCG AACAGGCGAT CGCCCGCCGG CTGCTCCTCA GGCTGGTCAG CCCCGGCGAG GACGGCCTGG ATGTGAGGCG GCGGGCCGCC CGCGACGAGC TCGTGGACAG CGACGCCGCC GACCGCATCC TCACCCTGCT GGTCGAGCGG CGGCTGGTCA CCGCTGACGA CGACACCGTC GAGGTGGCGC ACGAGGCGCT GCTGCGCGCG TGGCCACGCC TGCGGGGCTG GCTGGAGGCT GACCGCGATG GCCGTCGTCT GCACCGCCAG CTCACCGAGG CGGCGGCCGC CTGGCAGCGC AGCGACCAGG ACCCGGGCTA CCTGTACCAC GGCACCCGGC TGCACGCCCT GCAGGAATGG GCGCAGGCCA ATCCTGGCGA TGCCAACGCA CTGGAACGCC TGTTCCTGGC CGCTTCGGTC GCCGTCGAGG AGCGTCAGCT GCGGGACGCC CGCCGCTCGG CGCGTCGATC ACGTTCCTGG GCGTCAGTAC TGGCCATTCT GCTCGTCGTC GCGCTGACCA TGACAGTGCT GGCGGTGGTG CAGTGGAGTG CAGCGAACCA GCAGGCCAAC GTCGCCCGCG AAGCCACCAC GCTGTCGCAG GCCGGCCGGT TGGCGACGCT GGCCGCGAAT CTGGGGCCGG ACCAGGTGGA CCTGGCACTG CTGCTGGGGG TGCAGGGCTA CCAGCTCGCG CCGTCGCGAG ACACCGAGGG CGGGCTGCAG GCCGCCCTCG CCCGCACCCC GGCCAACCTC GACCAGATCA TCCGGTTTCC GTCGTCGAGT TTCCTGCCGG CTGTGGTGCC GCCCTCGGTG AGCCCGGACG GTCGACTGGT CGCCGCCCCC GGTCAGGACG GCACGGTCCG GCTGTGGGAC CTGCACGCAG CTCGTATCGT CCGCGAGCTG CACTGGCCCA CCGGCCGCCA GCTCGCGGTG TTCAGCGCGG ACGCGTCGAT GCTCGCGGTC GGAGGAAGCG ACGGCAAGGT CGTGGTGTGG GAGGTCGCCA CCGGCCGGCA GGTGGGCGCT CCGATCCCGG CCGGCACCGG GCTCGCCTAC GGCCAGTTCG ACCCGCGTGA CCCCGACCGG TTCTTCGCGG TCGACAACAG CGGACAGATC GTGGCGTGGG ACCGGTCGGT CCCCGAGCGG CCACGCCAGC TGGGGCAGCC GCTGCGGTTC CCCGCGGCGC CGGGCGAGAT ACCGATCTTC GTCCTGAACG CGACCGGGAC GCGGATGGCC GCCGGCGCAT ACGGCAGGCC GACGACGCGG GTGTGGGACA TCGACTCGGG TGCTGTCCTG CGCGACCTGG CAGGAGCACC GGGATTCTTC GGGGCCGACG GCGTCACTCT GCCCACCTCG CTGCGGGACC GGGTGACGTT GTGGGACGTC AACACCGGGC TGGCGAAACA GGAACTGACG GGCCTGTCGG GAGCGGCTCC GGGCATTGTC CTCAGCCAGA ACCTACGACG GCTCGCCGTG AACGACGGAG GCAACGCCAT CCGGATCTTC GACGTTGGCT CGGGGCGGGA GCAGGTCACG CTGTCGGCTG CCGGCCGGGC GTCCACTCCC GTCGCGTTCC TCGCTGACGG GCGGCTGATG ACCAGCGGAG CCGCCGAGGC GGACATCTGG CGCCTCGACC GGGGCGGCTC ACCGCTGGGG TTCACGCTGG CAGGCCACGA CGGCAGAGTT ACCGGCAGCT TCACGGCGAG CGGGACCGAG GTCATCACGC AGGGCTTGGA CGACCACCGG GTCCTGGTGT GGGACGCGGC TGACGGACGC GAGCTGGGCC CTCTGCTCAA CGGCGCGGTG TCCGCTCCAG TCGCGCTCAG CCCCGACGGC GCCCGGATCG TCGGGGTGGG TGCCAACGGC GTGCTGCGGC TCTGGGACCG GGCCGGGAAA ACCGAACTGG CGGTGCTCGC CCCGGCGGAC CATCCGGCGG CAGTGGAGTG GAGCCCGGCG GGTGACCGGG TCGCCGCCGT TCACGCGGGC GGGGTGCTGC TGTGGGACGT CGGCGACCCA CACCGGCCAC GGCTGGTCGC GGACCTCGAC ACCGCCGGCT CCACCGGCTC CACCGGCCAG CCGCGGCCGG CGTTCAGCCC CGACGGGCAG CGGCTCGCTG TCACGGACCA ACAGGGCCAC CGGATCACGA TGTTCGACGC CGCGACTGGC CGCTCGGTGT GGTTGCGGCA ACTGGAGACC GCGGACCAGG CGACGTTGGC GTTCTCGCCG GACAGCACGA AGATCGCCAT CGGCTTCGGC ACCATAGCTT CTGGTTTCGT CGAGTTCCTG GACACCTCCG ACGGTACGGT GCGCCGGCAT CTCAACACCA CAAGCGTGGG AGGCGTGGCC TTCCTGCGCG ACGGCGATCT GGTCATGACG ACGAGTGACA CCGGTGACCA GAGCTCCGTG CAGCTGTGGG ACGCCACGAC GACGGCCTCC GTCGGGGAAC CGGCGACGCA GGCCCATGGC GCGGGCATTC TGGCCCGCAG CCCTGACGGG ATGTCGGTGG TGGCCGGCTC CGACAGGGGC ATCGCGCAGG TGTGGCACGT CGACCTGCCG GAATGGATGG CGACCGCGTG CCGGATCGCC GGCCGGAACC TGACGAGGGT GGAATGGGAG CGCTACCTGC CCGGTGAGCC GTACCGGGCC AGCTGCGCTC AATGGCCGCC CACCCCCTGA
|
Protein sequence | MLDEGIGAAG DEPSVQNRTP PAGGRPAGLS FQVLGPLEVR RGGTPVQLRG TRERTLLAIL LSERNRAVPL SRLVTGLWGA QPPASAERTV QAYVARLRPM LEPERPEAGW SVLVTSPTGY CLRVDDAAFD AASFARLVDR AQSALVAGAP DLALTRFRRA EAMWRGDEPY QDLADVAYLT PERDRLIELR LTAAADRLDA ERALGRAAGT VGDLRRLVAD HPTNERFWGQ LMVALYESQR QKEALDTYRA AWTRLVDDGG IEPGPKLRAL HDTIVSQRPV EVAWPVRPGD VPAPLSADRT PLIGRDGAQR WLRDVWDQVL DDVGRLVVVE AEPGGGASRL LAEFARTVAA RGAVVETTLT PALTTQVAQR PVLVVVDQPA GPEPAEAVPA GLGPAGAESA GVGPAGRGPA WVKRVAAAVG GSGRPVLVVV VVAASDRAAH PDEWADAVPA ERVLRLGPLG GDAVGNIVAG YVQPADLDEA TSVILDTAGG NPARVHEAAA RWAAERARAQ AGRSAERMAA VHRTLAQAEQ EMLDDVTDLQ RVRASRAPGP DTVVCPYKGL ARFDEADAAY FFGRQRLIAQ LVTGCVAAPL LTVVGPSGSG KSSVVRAGLL PALRAGALPG SERWRYTPLR TGATDEAALR AVLRDGGAED PDGTDLLVLD QFEEAFTAWS PATRTAVVDW LVGELELRDG RLRVVITVRA DYYGRFAHHP TLARLIGRNT LLVGPMTDDE LSQAIEQPAR VAGLGVEDGF VAAVLGDAKH EPGALPLLST ALLATWERRD GRMLRTAAYR EAGGVAGAVT RLAEDVYKGL DPGEQAIARR LLLRLVSPGE DGLDVRRRAA RDELVDSDAA DRILTLLVER RLVTADDDTV EVAHEALLRA WPRLRGWLEA DRDGRRLHRQ LTEAAAAWQR SDQDPGYLYH GTRLHALQEW AQANPGDANA LERLFLAASV AVEERQLRDA RRSARRSRSW ASVLAILLVV ALTMTVLAVV QWSAANQQAN VAREATTLSQ AGRLATLAAN LGPDQVDLAL LLGVQGYQLA PSRDTEGGLQ AALARTPANL DQIIRFPSSS FLPAVVPPSV SPDGRLVAAP GQDGTVRLWD LHAARIVREL HWPTGRQLAV FSADASMLAV GGSDGKVVVW EVATGRQVGA PIPAGTGLAY GQFDPRDPDR FFAVDNSGQI VAWDRSVPER PRQLGQPLRF PAAPGEIPIF VLNATGTRMA AGAYGRPTTR VWDIDSGAVL RDLAGAPGFF GADGVTLPTS LRDRVTLWDV NTGLAKQELT GLSGAAPGIV LSQNLRRLAV NDGGNAIRIF DVGSGREQVT LSAAGRASTP VAFLADGRLM TSGAAEADIW RLDRGGSPLG FTLAGHDGRV TGSFTASGTE VITQGLDDHR VLVWDAADGR ELGPLLNGAV SAPVALSPDG ARIVGVGANG VLRLWDRAGK TELAVLAPAD HPAAVEWSPA GDRVAAVHAG GVLLWDVGDP HRPRLVADLD TAGSTGSTGQ PRPAFSPDGQ RLAVTDQQGH RITMFDAATG RSVWLRQLET ADQATLAFSP DSTKIAIGFG TIASGFVEFL DTSDGTVRRH LNTTSVGGVA FLRDGDLVMT TSDTGDQSSV QLWDATTTAS VGEPATQAHG AGILARSPDG MSVVAGSDRG IAQVWHVDLP EWMATACRIA GRNLTRVEWE RYLPGEPYRA SCAQWPPTP
|
| |