Gene Franean1_0573 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0573 
Symbol 
ID5668990 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp660451 
End bp665550 
Gene Length5100 bp 
Protein Length1699 aa 
Translation table11 
GC content73% 
IMG OID641239500 
Producttranscriptional activator domain-containing protein 
Protein accessionYP_001504938 
Protein GI158312430 
COG category[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG2319] FOG: WD40 repeat
[COG3629] DNA-binding transcriptional activator of the SARP family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.836093 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCGACG AGGGCATCGG CGCGGCGGGC GACGAACCGA GCGTGCAGAA CCGAACCCCG 
CCGGCCGGTG GCCGGCCGGC GGGGCTGTCG TTCCAGGTGC TCGGGCCGTT GGAAGTTCGC
CGGGGCGGCA CACCCGTCCA GCTGCGCGGT ACCCGCGAAC GCACACTCCT GGCCATCCTG
CTGTCCGAGC GAAACCGTGC GGTTCCGTTG TCCCGGTTGG TCACCGGTCT GTGGGGAGCA
CAGCCGCCGG CCAGCGCCGA ACGGACGGTG CAGGCCTACG TCGCCCGGCT CCGCCCGATG
CTCGAACCAG AGCGGCCCGA GGCCGGCTGG TCGGTGCTGG TCACCTCACC GACGGGGTAC
TGCCTGCGGG TCGACGACGC CGCGTTCGAC GCGGCGTCGT TCGCGCGGCT GGTCGACCGC
GCGCAGTCGG CTCTGGTGGC CGGGGCACCG GACCTCGCAC TGACCCGGTT CCGTCGGGCG
GAGGCGATGT GGCGGGGCGA TGAGCCCTAC CAGGACCTGG CAGACGTCGC CTACCTCACC
CCGGAGCGGG ACCGGCTCAT CGAACTCCGG CTCACCGCGG CTGCTGACCG GCTCGACGCC
GAACGCGCCC TCGGCCGTGC CGCGGGTACG GTCGGCGACC TACGGCGCCT GGTCGCGGAC
CATCCGACCA ACGAGCGGTT CTGGGGCCAG CTGATGGTGG CGCTCTACGA GTCGCAGCGG
CAGAAGGAGG CACTCGACAC CTATCGCGCC GCGTGGACGC GGCTGGTCGA CGACGGTGGC
ATCGAGCCAG GGCCGAAGCT GCGAGCGTTG CACGACACGA TCGTTTCGCA GCGGCCCGTC
GAGGTCGCCT GGCCGGTACG ACCGGGCGAT GTCCCGGCTC CACTGAGTGC CGACCGCACC
CCGTTGATCG GCAGGGACGG GGCCCAACGC TGGCTGCGGG ACGTGTGGGA CCAGGTACTC
GATGACGTGG GCAGGCTGGT CGTCGTCGAG GCCGAACCGG GCGGCGGAGC CAGCCGCCTG
CTGGCGGAGT TCGCACGGAC GGTGGCGGCG CGCGGCGCCG TCGTCGAGAC GACCTTGACG
CCCGCGCTCA CGACGCAGGT GGCGCAACGT CCCGTGCTTG TCGTCGTAGA CCAGCCAGCC
GGCCCGGAAC CCGCGGAGGC TGTGCCAGCC GGGCTGGGGC CAGCCGGGGC GGAGTCCGCG
GGGGTCGGGC CGGCTGGGAG GGGGCCCGCG TGGGTGAAGC GGGTGGCTGC CGCGGTCGGG
GGGAGCGGCC GCCCGGTACT GGTTGTCGTG GTGGTGGCCG CCTCCGACCG GGCCGCCCAC
CCGGACGAGT GGGCTGACGC GGTCCCCGCC GAGCGGGTGC TGCGACTCGG GCCCCTAGGG
GGCGACGCGG TCGGCAACAT CGTGGCGGGC TACGTCCAAC CCGCCGACCT GGACGAGGCG
ACCTCGGTGA TCCTCGACAC CGCGGGTGGC AACCCGGCCA GGGTCCACGA GGCGGCGGCG
CGATGGGCCG CCGAGCGGGC GAGAGCCCAG GCCGGCCGCT CAGCCGAGCG GATGGCCGCG
GTCCACCGGA CACTGGCGCA GGCCGAGCAG GAGATGCTCG ACGACGTCAC CGACCTGCAG
CGGGTGAGAG CCTCCCGAGC CCCGGGGCCT GACACCGTCG TGTGCCCGTA CAAGGGCCTG
GCCCGCTTCG ACGAGGCGGA CGCCGCCTAC TTCTTCGGCC GGCAACGGCT GATCGCCCAG
CTGGTCACCG GGTGCGTCGC CGCGCCGCTG CTCACGGTCG TCGGCCCCTC CGGAAGCGGG
AAGTCGTCCG TGGTCCGCGC TGGACTGCTC CCCGCGCTGC GTGCGGGTGC GCTGCCCGGC
AGCGAGCGAT GGCGGTATAC GCCGCTGCGG ACCGGCGCGA CCGATGAGGC GGCACTGCGC
GCGGTGCTGC GGGACGGCGG CGCAGAAGAC CCCGACGGCA CCGACCTGCT GGTGTTGGAC
CAGTTCGAGG AGGCCTTCAC CGCCTGGTCG CCGGCGACGC GCACCGCTGT GGTGGACTGG
CTGGTCGGTG AGCTCGAGCT GCGGGACGGG CGGCTGCGCG TCGTCATCAC CGTGCGGGCC
GACTACTACG GCCGTTTCGC CCATCATCCG ACGCTCGCTC GCCTCATCGG GCGTAACACT
CTGCTGGTCG GCCCGATGAC CGACGACGAA CTCTCCCAGG CGATCGAGCA GCCCGCCCGG
GTCGCTGGCC TGGGCGTGGA GGACGGGTTC GTGGCGGCGG TGCTCGGCGA TGCCAAGCAC
GAACCGGGCG CGCTGCCGTT GCTGTCAACC GCCCTGCTGG CGACCTGGGA ACGCCGTGAC
GGCCGGATGC TGCGGACCGC GGCGTACCGG GAGGCTGGTG GCGTCGCCGG AGCGGTCACC
CGGCTTGCCG AAGACGTCTA CAAGGGCCTG GACCCCGGCG AACAGGCGAT CGCCCGCCGG
CTGCTCCTCA GGCTGGTCAG CCCCGGCGAG GACGGCCTGG ATGTGAGGCG GCGGGCCGCC
CGCGACGAGC TCGTGGACAG CGACGCCGCC GACCGCATCC TCACCCTGCT GGTCGAGCGG
CGGCTGGTCA CCGCTGACGA CGACACCGTC GAGGTGGCGC ACGAGGCGCT GCTGCGCGCG
TGGCCACGCC TGCGGGGCTG GCTGGAGGCT GACCGCGATG GCCGTCGTCT GCACCGCCAG
CTCACCGAGG CGGCGGCCGC CTGGCAGCGC AGCGACCAGG ACCCGGGCTA CCTGTACCAC
GGCACCCGGC TGCACGCCCT GCAGGAATGG GCGCAGGCCA ATCCTGGCGA TGCCAACGCA
CTGGAACGCC TGTTCCTGGC CGCTTCGGTC GCCGTCGAGG AGCGTCAGCT GCGGGACGCC
CGCCGCTCGG CGCGTCGATC ACGTTCCTGG GCGTCAGTAC TGGCCATTCT GCTCGTCGTC
GCGCTGACCA TGACAGTGCT GGCGGTGGTG CAGTGGAGTG CAGCGAACCA GCAGGCCAAC
GTCGCCCGCG AAGCCACCAC GCTGTCGCAG GCCGGCCGGT TGGCGACGCT GGCCGCGAAT
CTGGGGCCGG ACCAGGTGGA CCTGGCACTG CTGCTGGGGG TGCAGGGCTA CCAGCTCGCG
CCGTCGCGAG ACACCGAGGG CGGGCTGCAG GCCGCCCTCG CCCGCACCCC GGCCAACCTC
GACCAGATCA TCCGGTTTCC GTCGTCGAGT TTCCTGCCGG CTGTGGTGCC GCCCTCGGTG
AGCCCGGACG GTCGACTGGT CGCCGCCCCC GGTCAGGACG GCACGGTCCG GCTGTGGGAC
CTGCACGCAG CTCGTATCGT CCGCGAGCTG CACTGGCCCA CCGGCCGCCA GCTCGCGGTG
TTCAGCGCGG ACGCGTCGAT GCTCGCGGTC GGAGGAAGCG ACGGCAAGGT CGTGGTGTGG
GAGGTCGCCA CCGGCCGGCA GGTGGGCGCT CCGATCCCGG CCGGCACCGG GCTCGCCTAC
GGCCAGTTCG ACCCGCGTGA CCCCGACCGG TTCTTCGCGG TCGACAACAG CGGACAGATC
GTGGCGTGGG ACCGGTCGGT CCCCGAGCGG CCACGCCAGC TGGGGCAGCC GCTGCGGTTC
CCCGCGGCGC CGGGCGAGAT ACCGATCTTC GTCCTGAACG CGACCGGGAC GCGGATGGCC
GCCGGCGCAT ACGGCAGGCC GACGACGCGG GTGTGGGACA TCGACTCGGG TGCTGTCCTG
CGCGACCTGG CAGGAGCACC GGGATTCTTC GGGGCCGACG GCGTCACTCT GCCCACCTCG
CTGCGGGACC GGGTGACGTT GTGGGACGTC AACACCGGGC TGGCGAAACA GGAACTGACG
GGCCTGTCGG GAGCGGCTCC GGGCATTGTC CTCAGCCAGA ACCTACGACG GCTCGCCGTG
AACGACGGAG GCAACGCCAT CCGGATCTTC GACGTTGGCT CGGGGCGGGA GCAGGTCACG
CTGTCGGCTG CCGGCCGGGC GTCCACTCCC GTCGCGTTCC TCGCTGACGG GCGGCTGATG
ACCAGCGGAG CCGCCGAGGC GGACATCTGG CGCCTCGACC GGGGCGGCTC ACCGCTGGGG
TTCACGCTGG CAGGCCACGA CGGCAGAGTT ACCGGCAGCT TCACGGCGAG CGGGACCGAG
GTCATCACGC AGGGCTTGGA CGACCACCGG GTCCTGGTGT GGGACGCGGC TGACGGACGC
GAGCTGGGCC CTCTGCTCAA CGGCGCGGTG TCCGCTCCAG TCGCGCTCAG CCCCGACGGC
GCCCGGATCG TCGGGGTGGG TGCCAACGGC GTGCTGCGGC TCTGGGACCG GGCCGGGAAA
ACCGAACTGG CGGTGCTCGC CCCGGCGGAC CATCCGGCGG CAGTGGAGTG GAGCCCGGCG
GGTGACCGGG TCGCCGCCGT TCACGCGGGC GGGGTGCTGC TGTGGGACGT CGGCGACCCA
CACCGGCCAC GGCTGGTCGC GGACCTCGAC ACCGCCGGCT CCACCGGCTC CACCGGCCAG
CCGCGGCCGG CGTTCAGCCC CGACGGGCAG CGGCTCGCTG TCACGGACCA ACAGGGCCAC
CGGATCACGA TGTTCGACGC CGCGACTGGC CGCTCGGTGT GGTTGCGGCA ACTGGAGACC
GCGGACCAGG CGACGTTGGC GTTCTCGCCG GACAGCACGA AGATCGCCAT CGGCTTCGGC
ACCATAGCTT CTGGTTTCGT CGAGTTCCTG GACACCTCCG ACGGTACGGT GCGCCGGCAT
CTCAACACCA CAAGCGTGGG AGGCGTGGCC TTCCTGCGCG ACGGCGATCT GGTCATGACG
ACGAGTGACA CCGGTGACCA GAGCTCCGTG CAGCTGTGGG ACGCCACGAC GACGGCCTCC
GTCGGGGAAC CGGCGACGCA GGCCCATGGC GCGGGCATTC TGGCCCGCAG CCCTGACGGG
ATGTCGGTGG TGGCCGGCTC CGACAGGGGC ATCGCGCAGG TGTGGCACGT CGACCTGCCG
GAATGGATGG CGACCGCGTG CCGGATCGCC GGCCGGAACC TGACGAGGGT GGAATGGGAG
CGCTACCTGC CCGGTGAGCC GTACCGGGCC AGCTGCGCTC AATGGCCGCC CACCCCCTGA
 
Protein sequence
MLDEGIGAAG DEPSVQNRTP PAGGRPAGLS FQVLGPLEVR RGGTPVQLRG TRERTLLAIL 
LSERNRAVPL SRLVTGLWGA QPPASAERTV QAYVARLRPM LEPERPEAGW SVLVTSPTGY
CLRVDDAAFD AASFARLVDR AQSALVAGAP DLALTRFRRA EAMWRGDEPY QDLADVAYLT
PERDRLIELR LTAAADRLDA ERALGRAAGT VGDLRRLVAD HPTNERFWGQ LMVALYESQR
QKEALDTYRA AWTRLVDDGG IEPGPKLRAL HDTIVSQRPV EVAWPVRPGD VPAPLSADRT
PLIGRDGAQR WLRDVWDQVL DDVGRLVVVE AEPGGGASRL LAEFARTVAA RGAVVETTLT
PALTTQVAQR PVLVVVDQPA GPEPAEAVPA GLGPAGAESA GVGPAGRGPA WVKRVAAAVG
GSGRPVLVVV VVAASDRAAH PDEWADAVPA ERVLRLGPLG GDAVGNIVAG YVQPADLDEA
TSVILDTAGG NPARVHEAAA RWAAERARAQ AGRSAERMAA VHRTLAQAEQ EMLDDVTDLQ
RVRASRAPGP DTVVCPYKGL ARFDEADAAY FFGRQRLIAQ LVTGCVAAPL LTVVGPSGSG
KSSVVRAGLL PALRAGALPG SERWRYTPLR TGATDEAALR AVLRDGGAED PDGTDLLVLD
QFEEAFTAWS PATRTAVVDW LVGELELRDG RLRVVITVRA DYYGRFAHHP TLARLIGRNT
LLVGPMTDDE LSQAIEQPAR VAGLGVEDGF VAAVLGDAKH EPGALPLLST ALLATWERRD
GRMLRTAAYR EAGGVAGAVT RLAEDVYKGL DPGEQAIARR LLLRLVSPGE DGLDVRRRAA
RDELVDSDAA DRILTLLVER RLVTADDDTV EVAHEALLRA WPRLRGWLEA DRDGRRLHRQ
LTEAAAAWQR SDQDPGYLYH GTRLHALQEW AQANPGDANA LERLFLAASV AVEERQLRDA
RRSARRSRSW ASVLAILLVV ALTMTVLAVV QWSAANQQAN VAREATTLSQ AGRLATLAAN
LGPDQVDLAL LLGVQGYQLA PSRDTEGGLQ AALARTPANL DQIIRFPSSS FLPAVVPPSV
SPDGRLVAAP GQDGTVRLWD LHAARIVREL HWPTGRQLAV FSADASMLAV GGSDGKVVVW
EVATGRQVGA PIPAGTGLAY GQFDPRDPDR FFAVDNSGQI VAWDRSVPER PRQLGQPLRF
PAAPGEIPIF VLNATGTRMA AGAYGRPTTR VWDIDSGAVL RDLAGAPGFF GADGVTLPTS
LRDRVTLWDV NTGLAKQELT GLSGAAPGIV LSQNLRRLAV NDGGNAIRIF DVGSGREQVT
LSAAGRASTP VAFLADGRLM TSGAAEADIW RLDRGGSPLG FTLAGHDGRV TGSFTASGTE
VITQGLDDHR VLVWDAADGR ELGPLLNGAV SAPVALSPDG ARIVGVGANG VLRLWDRAGK
TELAVLAPAD HPAAVEWSPA GDRVAAVHAG GVLLWDVGDP HRPRLVADLD TAGSTGSTGQ
PRPAFSPDGQ RLAVTDQQGH RITMFDAATG RSVWLRQLET ADQATLAFSP DSTKIAIGFG
TIASGFVEFL DTSDGTVRRH LNTTSVGGVA FLRDGDLVMT TSDTGDQSSV QLWDATTTAS
VGEPATQAHG AGILARSPDG MSVVAGSDRG IAQVWHVDLP EWMATACRIA GRNLTRVEWE
RYLPGEPYRA SCAQWPPTP