Gene Franean1_6600 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_6600 
Symbol 
ID5674915 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp8029777 
End bp8034978 
Gene Length5202 bp 
Protein Length1733 aa 
Translation table11 
GC content72% 
IMG OID641245451 
Producttranscriptional activator domain-containing protein 
Protein accessionYP_001510843 
Protein GI158318335 
COG category[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG2319] FOG: WD40 repeat
[COG3629] DNA-binding transcriptional activator of the SARP family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCCGGG AGGATGCCGA CGCGCCGGCC GGGCGGACCT GGCGGGCTAC GGAGCCGGGC 
GGGTTGACGG TCGTGCCACG CCAACCCGCC CCAGACACCG GCAGTGTCGC TTTCCCCGGT
CTGTGGTTCG GCGTGCTCGG GCCGTTGGAG GTGCGTAGCG ACGGTCGCCC ACTGGCGCTG
CGGGGGCCGC GAGAACGTGC GTTGCTGGGC CTGTTGCTGG CGGAACGTAA TCGGGCTGTT
TCGGTCTCGC GGCTGGTTGA CGGAATTTGG GGGGCGGTGC CGCCGCCGAC GGCCGAAAAG
ACGCTGCAGA GCCATATTTC CCGATTGCGT CGTCTTCTGG AGCCGGAACG CCCGCCAGCG
GGTTGGACAG TACTGGTGAC GGTGCCTTCC GGGTACAGCC TGCGGGCCGA CTTGACTGCC
CTTGACTCGG CATTGTTCGA ACAACTCGTG GAACAGGCGC GGCGGGCTTT TGCGGTCGGT
GCTCCTGAAC TTGCCCGCGC TCGGTTCCGG CGGGCGCTGG GATTGTGGCG GGGGGACGCG
TACGAGGATC TTCTCGACCT CGAGTTCGCG GCGGCGGAAA GGGACCGGCT TGTCGAACTG
CGGCTGGCCG CGGTAGCGGA TCAGATCGAC GCCGAGTTCG CACTCGGCCG GTCGGCGGCA
GTATTCGGTG AGCTGCAGCG GTTGGTTGCC GCCCATCCGT TGCGGGAGCG ATTCCGTAGC
CAGCTGATGA TCGCGCTGTA CGAATCGGGC CGCCAGGCCG ACGCGCTGGA GGCCTACCGG
TCGGCGCGGG CCCGGCTGGT GGCGGAGATC GGCGTGGAGC CGGGCCGGCA GCTTCGAGCG
TTGAACGCGG CCATTCTCGA GGAGCAGCCC AGCCTGCTCC CGGACATCGT GCAACCGCAG
CGCCTTCCGG CGGCGTTGGG TGTCGATCGC CGTGCATTGA TCGGTCGGGA CGTGGAGATG
CGCTGGCTCG AGCGCGTCTG GGACTGCGTC CTCGACGACC GCGGTGAGGT TGTCGTCGTC
CGCGGCGAGC CAGGAATGGG TGTCCGCCGC CTCGTCGCCG AGTTCGCCCG CCGGGTCGCG
GGCCGGGGTG CCGCGGTCGT CCTCGGACCG ATCGACACCG GGACTCTGAC GGGGATCGCA
CAGGAGCGCC CCGTGCTCAT GGTCGTCGAC GTCGACGACG ACCAGCCGGA CACCAGCCCG
GGCTGGGCCG CGGTGCAGGC CTCCGCGGTC GCGCGCCTGC CGGTGCTCGT TGTCGTCACC
GCCCGACCCG GCGACGGCCC GCCGGAGGTA CGGCCCGGCG GGTTCGACGT CGGCGCGCAG
TCGGTCCTGC AGCTCGGCCC GCTCGGGTCG GCTCAGGTCA CCGAGATAGT GGCTGGCTAC
GTCGGTGCAG GCGAGGTGCC GGCCGCGACC GCGGCTGTGA TCGAATCCTC CGGCGGCGTC
CCGGCCCGGG TGCACGCGGA TGCGGCCGAC TGGGCGGCCG TCCGAGCCGG CGAACGGGTC
GGTCAGCGGG CGGCGCGGGT CGCGACCGAT CGATACGACC TGGCCGAGTC GGAACGCGCG
ATGGTGTCTG ACGTGCTCGA CCTGCAGCGG GTTAGGCGTC GTCGTGGCGT CGCCGGTCCG
GCCGGATCCG TCGGCGGCAC AGGAGCCTCG GGCGGCCCGA GCGCGTCGGG CAGCTCAGCT
GGCGGCGCCG GCTCGGGTGA TGCGGCCGAC TTCGGCGCTC CGGTCGGCGC GCCGGGGACG
GTTGTCTGCC CGTACAAGGG GCTGGCGCGT TTTGACGAGC AGGACGCTGC CTATTTCTTT
GGCCGGGAGC GGCTGATCGC GCAGCTCGTG ACCCGCTGCG TCGCGGCACC GCTGCTGGCG
GTGGTGGGTG CGTCGGGAAG CGGGAAATCG TCGGTCGTGC GCGCTGGACT GGTGCCGGCG
ATCCGCTCCG GCGTACTGCC GGGAAGCGAC CGATGGCGGA TCCGGCTGCT GCGGGCCGGC
ACCGTCGAGG CGACCTCGCT GGACGTCGAC ATGGACCTCG ATGGCGATGG CGACGGCGCC
GGTGGCAACG AGCTGCTCAT TCTGGATCAG TTCGAGGAAG TGTTCACCAC ACGGTCGCAG
GACAGCCAGA CCCGTTTGAT CGACCAGCTG GTCGACACGC TGGAACGGGG CGACGGCCGG
CTGCGCGTCG TTCTCACCGT GCGGGCCGAC TACTACGGCC GGTTCGCCGC CCACCCGACG
CTTGCCCGCC TGGTCGCTGA CAACAGCGTG CTCGTCGGTC CGATGGTCGT CGACGAGTTG
CGGCGCGCGA TCGAGGAACC CGCGCTGGTC GCCGGGCTGG AACTCGAGGA GGGGCTCACG
GCGGCGGTAC TGGCCGACGC CCGTCAGGAG CCTGGCGTGT TGCCGCTGTT GTCGACGGCG
CTGCTGGCGA CGTGGGAAGG GCGCGACGGC CGGACGTTGC GCGTAGCCAC CTACCGTGAG
GCCGGCGGTG TGGCCGGAAC GCTGACCAGG CTTGCCGACG GTATGTACGA GAGCCTCGAC
GCCGCTGGTC AGGCGATCGT GCGGCAGCTG TTTCTGCGCC TCGCGACGCT GGGCGAGGAC
GGCGACGATC TGCGGCGTCG CGCGCCACGC TCCGAGCTGG TAGGCAGCGC GCCGGTCGAG
GCCGTTCTCA CGGTGCTGAT CGCGCGACGG CTGGTCATCG CCGCGGACGA CACGGTCGAG
GTCGCGCACG AGGCGTTGCT AAGGGAGTGG CCCCGGCTGC GGGGTTGGCT CGAGACAGAC
CGCGACGGCC GCCGGGTGCA CCGCGAACTG AGCGGGGCAG CCGTGGCGTG GGACGCCGGC
GGGCGGAACC CGGCCGACCT TTACGGCGGT CCCCGGCTGG CCGCCGCACA GGACTGGGCC
GCGGCCCACC CACGCGACGC CAACCCCCTG GAGGAGGAGT TCCTCGCGGC CGCTGGTGCC
GCCCGGGATC GGGCGGCACA GGTCGCTCGA CGGACCACCC AGTGGCTGCG GACGCTCGCC
GCCGGGTTGG CCGTGCTCCT GGTGGTCGCG CTGATCGCGA CAGGGGTGGC CGTCGGGCAG
CGGCACAGGG CCACGCTGCA GGCCGATCGG GCACACAGCG AAGCGGAGCT GGCGCGGGCG
AGCCGGCTCG CGGCGGTCAC CCGCACCCTT GGCCCCGACC AGATCGACCT TGCCCTGCTG
CTGGGCGTCG AGAGCTACCG GCAACAGCCC ACGGTCGAGA CCGAGGGAAA TCTCGAGACC
GCGCTCGTCC ACACCCCGCC CGGCCTCACT CAGCTGATCC GTTTCAGGTC GCCGAGTTTC
TACCCGTCGG TGAGCCCGGA CGGACGACTG CTGGCCTCGC CCGGCCAGGA CGGCACGGTC
GACCTGTGGG ATCTGCAGGC CGGCCGGCTG CTGCGCACGT TCACCTGGCC CACCGCACGC
CAGGTCGCCA TGTTCAGCTC GGACGGGAAG CTGCTCACCG CCGGGGGGAA CGACGGGACC
GTCGTCGTCT GGGACGCCGT CACCGGACGG CAGGTCGGGG CACCGGTCAA GGTCGCCGGC
GGCTTCGTGT ACGGCGAGTT CGACCCCACT GACCCATCTC GCCTTTTCGC TGTCAGCGAC
ACCGGCGAGG TCGCCATCTG GGACCGGGCC ATCCCGGACC GGCCGGTGCA GCTCGGGCCG
CCGCTGCGGT TCACGCCGCG GGCCGGCGCG CTCCCGGTGG CGGTCGTCAG CGCTGACGGC
CGTCGCCTGG CGGCCGGGAC GTCGTCCCCC GACAGCTCGA CGCGGGTGTG GGATCTGGGC
TCCCGCACCG TGATACGGGA CCTTCCCGGC CTCACCGGAT CCTTCAGCCC GGACGGCCGA
TCGCTGGCTA CCGCGTCCGA CGGTCGGGTC GAGGTGTGGG ACGTCGCCTC CGGAGACCGG
CGGGGCGTGC CGCTGACGGG ACTGACCGGA GCGTTGCCAG GAATCCTCTT CAGTCCGGAT
GGGCAGCTCG TGGCCGCCGG CGACGGAGGC AACGTCGTCC GGGTGTTCGA CCTGGCCTCG
GGACTGCAGA TCGGCGCTTC CCTCGCCCTG CACACTGGCG GTCCGGCGCT CCCCACCGCC
TTTCTTCCCG ACGGCCGGGT CGTCACCAGT GGGCCGCACG AGGCCGCGGT CTGGCAGGTG
GGTCGTACGA CGCCGCCGAT CGGCCTGGTG CTCGGAGGCC ATCAGGGTCT GACGTTCGGC
ACGTTCGTCC CGGGTGGCCA GGAGGTTGTC ACCCGCGGCA TCTCCGACAG CGGGCTACGG
CGATGGGACG CCGGCACCGG AAGGGAACTC GGCCGGCTGC TCGACGGCCG GGTCCAGGCA
CCCGTGACGT TCAGCCCCGA CGGCCGTCTG CTCGCCACGG CCGGTGCCGA CGACAACCAG
CTCGCGCTGT GGGACGCGCG CACCGGCGAA CGACTCGCCG GACTCGACGG AACCCGCGGC
GGACGACGCA GCTACGCGGA CTGGAGCCCG GCCGGAGACC GGATCGTGAC CGGCGCCGAC
GGGTCGGTGC GGATCTGGGA CGTCAGCGAC CCCCGCCATC CGGCATCGGT CGAGCAGCTG
GACCCGGACG GCGAGCCCCA TCCGGGCACC CCGGACAGCC AGTGGCCGGT TTTCAGCCAT
GACGGGCAGT GGATCGCGGT CCATGACTTT CCGGGTCACA CGATGACGGT GTTCGACGCG
GCGTCAGGTC GCAAACAGTG GTCGACATCG GCGCCCGTGG CGGACCTGGC GGGAATCGCC
TTCGCGCCGG ACGACAAGAC TCTCGCGATC AGCTACGGCG CCCGGGTCGG TGGTCGAGTC
TCGTTCCACG ACACCGGGAC CGGGCACGTG GAGCGGACCC TGCAGATCCC GAGTGGAGGC
GGAATTGAGT TCCTCCGGGG TGGCAGTGTC GTGATGACAA CGGGCGATTC GGCTGGTAGC
AGCGCTGCGC AGTTGTGGGA CGCGGCCACC TTCGCACCGA TCGGGGAGCC GCTGCCGCAA
CCCGAACCGG GCGCGTACTT CATCTCCCGT GACGCCGAGG GCACCCGAGC GGTGGCTGGC
TCGCAGAACT CGTTCGCGGT GATCTGGGAC GTCGACACCG CACGGTGGCA GACGGTCGCC
TGCGGAATCG CTGGACGCAA CCTCACCCGG GCGGAATGGG ATCAGTACTT TCCCAACCGG
CCGTACAGGC AGACCTGTCC GCAGTGGCCG GCGGGTTCAT GA
 
Protein sequence
MAREDADAPA GRTWRATEPG GLTVVPRQPA PDTGSVAFPG LWFGVLGPLE VRSDGRPLAL 
RGPRERALLG LLLAERNRAV SVSRLVDGIW GAVPPPTAEK TLQSHISRLR RLLEPERPPA
GWTVLVTVPS GYSLRADLTA LDSALFEQLV EQARRAFAVG APELARARFR RALGLWRGDA
YEDLLDLEFA AAERDRLVEL RLAAVADQID AEFALGRSAA VFGELQRLVA AHPLRERFRS
QLMIALYESG RQADALEAYR SARARLVAEI GVEPGRQLRA LNAAILEEQP SLLPDIVQPQ
RLPAALGVDR RALIGRDVEM RWLERVWDCV LDDRGEVVVV RGEPGMGVRR LVAEFARRVA
GRGAAVVLGP IDTGTLTGIA QERPVLMVVD VDDDQPDTSP GWAAVQASAV ARLPVLVVVT
ARPGDGPPEV RPGGFDVGAQ SVLQLGPLGS AQVTEIVAGY VGAGEVPAAT AAVIESSGGV
PARVHADAAD WAAVRAGERV GQRAARVATD RYDLAESERA MVSDVLDLQR VRRRRGVAGP
AGSVGGTGAS GGPSASGSSA GGAGSGDAAD FGAPVGAPGT VVCPYKGLAR FDEQDAAYFF
GRERLIAQLV TRCVAAPLLA VVGASGSGKS SVVRAGLVPA IRSGVLPGSD RWRIRLLRAG
TVEATSLDVD MDLDGDGDGA GGNELLILDQ FEEVFTTRSQ DSQTRLIDQL VDTLERGDGR
LRVVLTVRAD YYGRFAAHPT LARLVADNSV LVGPMVVDEL RRAIEEPALV AGLELEEGLT
AAVLADARQE PGVLPLLSTA LLATWEGRDG RTLRVATYRE AGGVAGTLTR LADGMYESLD
AAGQAIVRQL FLRLATLGED GDDLRRRAPR SELVGSAPVE AVLTVLIARR LVIAADDTVE
VAHEALLREW PRLRGWLETD RDGRRVHREL SGAAVAWDAG GRNPADLYGG PRLAAAQDWA
AAHPRDANPL EEEFLAAAGA ARDRAAQVAR RTTQWLRTLA AGLAVLLVVA LIATGVAVGQ
RHRATLQADR AHSEAELARA SRLAAVTRTL GPDQIDLALL LGVESYRQQP TVETEGNLET
ALVHTPPGLT QLIRFRSPSF YPSVSPDGRL LASPGQDGTV DLWDLQAGRL LRTFTWPTAR
QVAMFSSDGK LLTAGGNDGT VVVWDAVTGR QVGAPVKVAG GFVYGEFDPT DPSRLFAVSD
TGEVAIWDRA IPDRPVQLGP PLRFTPRAGA LPVAVVSADG RRLAAGTSSP DSSTRVWDLG
SRTVIRDLPG LTGSFSPDGR SLATASDGRV EVWDVASGDR RGVPLTGLTG ALPGILFSPD
GQLVAAGDGG NVVRVFDLAS GLQIGASLAL HTGGPALPTA FLPDGRVVTS GPHEAAVWQV
GRTTPPIGLV LGGHQGLTFG TFVPGGQEVV TRGISDSGLR RWDAGTGREL GRLLDGRVQA
PVTFSPDGRL LATAGADDNQ LALWDARTGE RLAGLDGTRG GRRSYADWSP AGDRIVTGAD
GSVRIWDVSD PRHPASVEQL DPDGEPHPGT PDSQWPVFSH DGQWIAVHDF PGHTMTVFDA
ASGRKQWSTS APVADLAGIA FAPDDKTLAI SYGARVGGRV SFHDTGTGHV ERTLQIPSGG
GIEFLRGGSV VMTTGDSAGS SAAQLWDAAT FAPIGEPLPQ PEPGAYFISR DAEGTRAVAG
SQNSFAVIWD VDTARWQTVA CGIAGRNLTR AEWDQYFPNR PYRQTCPQWP AGS