Gene Franean1_6249 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_6249 
Symbol 
ID5674568 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp7587151 
End bp7590390 
Gene Length3240 bp 
Protein Length1079 aa 
Translation table11 
GC content75% 
IMG OID641245101 
Producttranscriptional regulator 
Protein accessionYP_001510497 
Protein GI158317989 
COG category[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG3629] DNA-binding transcriptional activator of the SARP family
[COG3903] Predicted ATPase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGCATCG GCCTCACTCT CCTGGACGGG GTGCACTGGA ACGGCATCCC GGTGGTCGGC 
GACCGGCCGC GGGCGCTGCT GGCGGCGCTG GCGGCGGAGC GCGGGCGTCC GGTGCGGGCC
GAGCGGCTGG TGGAGCTGAT CTGGCGCGGG GAGCCGCTGG CCAATGCGAC CAAGGGTCTT
CAGGTGGTCG TCTCCCGAAC GCGGGCGGCC TGCGGCGGCG CCGCGGTGGT GCGCGACGGC
GAGGGCTACC GGCTCGACCT GCCCCCGACC GAGGTCGACA GCTGCCTGCT CAGCTGGCTC
GTCGCCGAGG CTGACGCCCT CCTCGCGGCT GATCCGCCCG CCGCCGCCGA GCGGGCCCGG
GAAGCGCTGG CGCTCGGCGC GTCGCTGTTA CCGGTGCCGG GGGGCGACCA GGGGCCGCTC
GCCGACCTGC GCCGGGCGGC CGGCCGCGAT CTGGCGACGG CCCGGCTGCT GCTGGCCCGA
GCGGCGAGCC GAACAAACCG GCACGCCGAG GCACTGGGGC TCCTGGAGAC GGCGCACGCC
GAGCGGCCCG ACGACGAGGC CCTGCTCGCC GATCTGCTGC GCAGCGAGGC CGCCGCCCGC
GGGCCGGGCG CGGCTCTGGA GCGCTTCGAG CGCTACCGGC GGGACCTGCG GGAACGGCTC
GGTATCAGCC CCGGGGAGGC GCTCACCCGG CAGCAGCGCG ACCTGCTCGC CGCCGACCAG
CCCGTCCGCG ACGGCATCCG CTACGACGCC ACTCCCCTGC TGGGCCGGCA GCGCGACGTC
GACCGGCTGC GGGCGTTGCT GGACCGGTCA CGGGTGGTGT CGATCGTCGG GCCGGGTGGT
CTCGGCAAGA CCCGGCTGGC GCATGTGCTC GCCCGGGAGA GCACCTTGCC GGTGGTGCAC
GTCGTCCACC TGGTCGGGGT GACGGCGCCC GAGGACCTGC TCGGCGAGGT CGGTTCCGCG
CTCGGTGTCC GCGACTCGAT CGGCGAGCGC CGGGTGCTGT CCCCGGGCCA GCGCGCCGAC
CTGCGCACAC GCATCGCCGC GCAGCTCTCG CGCGGGTCGA GCCTGCTGCT GCTGGACAAC
TGCGAGCATC TCGTCGAGGA GGTCGCCGAG CTGGTCGCCT TCCTCGTCAC GGCCACCGCG
GACGTCCGGG TGCTCACCAC GAGCCGGGCG CCGTTGGCGA TCAGCGCCGA GCGGGTCTAC
CTGCTCGGGG AGCTGGCCCG TACCGACGCG ATCGAGCTTT TCCGCCAGCG CGCCACCGCG
GCCCGGCCCA CCGTCCGGCT CGACCCCGAG GTGGTCGACC GGATCGTCCA CCGGCTCGAC
GGTCTTCCGC TGGCGATCGA GCTGGCGGCG GCGCGGGTCC GCGCGATGTC GGTGGAGGAC
GTCGACCGGC GGCTGGCGGA CCGCTTCGCA CTGCTGCGCG GCGGCGACCG CGGCGCGCCG
GACCGTCACC GCACGCTGTT CGCGGTGATC GACTGGTCGT GGAACCTGCT GGCCGAGCCG
GAACGGCGGT CGCTGCGTCG GCTGGCGCTC TTTCCGGACG GGTTCACCCT CGACGCCGCG
GAGGAGGTGC TCGGCCCCGC TGGTGAGGGG GCCGGCCCGG AGCCCTTCGA CGCGGTCGAC
GCCGTGCAGA ACCTCGTCGA CCAGTCGCTG CTGAGCGTGC GTGAGTCGGC CGACGGGGTC
CGCTACCGGA TGTTGGAGAC CGTCCGCGAG TTCGGTCGGC TGCGGCTGGC CGAGGCCGGC
GAGCAGGTGT CGGCGCGGCG GGCCCAGCGG GCCTGGGCGG TCGGTTACGT CACCCGGCAC
GGCGCGGACC TGCTCAGTGA GCGGCAGTTC GCGACGATCG ACGCGGTCGC GGCCGAGGAG
ACGAACCTCG CCGACGAACT GCGGGCCGCG CTGGTCGACG GCGGTACCGA GGCGGTCGTG
CGGCTGCTGG CGGTGCTCAA TCCGTTCTGG GAGATCCGCG GCGATCACAC GCGGATGATG
GTGCTGGCCA ACGCGGTCGC CGAGGCCCTG CACGACTGGT CCCCGCCGCC AGCGGCGGCC
TCGGCGACCC TGGCCAGCCG GATCGCCGTC CTGCGGACGA AGATGCTCAT GACCGTCAGG
TACTCGGAGG AGGCCCGCGA ACTGCTGGCG CCGCTCGATC CGGCGGACGC GGAGAACACG
TGGCTGGCCG GTTCGGTGCG GGCGCAGCTT GCGCTCGATC CGACCCGACC CGCGGACACC
GTGGCACGGC TGGAACGGCT GGCGGACGAC GCCGACCGGC ACACGCGCCT GGCCGCGCTG
ATGTGGCTGA CCCATCTGTG GGAGAACGAC GGTATCCCGC GGACGGCGGC GGCCTGCGCC
GAACGGGCGC TGGCGCTCGC CGCGCCGCAG GACGGCCCGT GGATCGCGGC CGTGCTGCGC
ACCCAGCTCG CCGCGCTGTC GATGCAGCTC GGGGACGTCA CCACCGCCCG GTCGTGTGCT
CGCGCGGCAC TGCCCGTCCT GGAACGGCTC GGCGCCCGCG ACGACGTCGC GCAGCTCAGG
GTGCTGCTGG CGTTCGACGC CATCAACGAC GGCCGGCTCG AAGCGGCCGA GAGCTACCTC
GCCCAGAGCG CCCCGGAGGA CAGGACGGGG CTCGGCGGCA GGATGATGGC CACCGCCGAG
GCCGAGATCA TGATCGCCCG CGGCGACACG ACTGGTGGTC TACGCCGGTA CGTGGACGCG
GCGGAGGAGA TGCGGGCCCT GCGGCTGCCG GGCGTGGAAC CCACCGGCCT CGAGCCCTGG
GTGCTCGTGT GCAACGCGGC GACGGTGGCC GCGTTCGCCC GGCACGCGAG CACGGCCGAC
GACCTCGCCA CCGGTCACAG CCTGTTTCGG GCCTGCCTCA CCCATTGCGT CGAAGCGTTC
CGGCGCGGGC AGGCCGGCGC CGACTACCCG GTGTACGGGA CGGTGCTGTT CGCACTGGGC
GCCTGGGGTC TTCGGGAGAA CCTCCCGGCG CCCGGAGACT GTCTGGCCGG GATGGCCCCG
GAGGACGCGG TCCGGCTGCT GGCGCTCGCG GAACGGTTCG CCTACCCCGC GACGATCCCG
TCGATGGGGT GGTCGCGGAT CGAGCCGCTC GCCGAGCGCC GCGCGCCCGG CACGCTTGCG
GCGGTCCGGG CGAGCGTCGC CGGACGGCGG CCGGCCCAGG TGCTCGACGA GGCCCGCGCC
CTCGTCGAGC ACCTGGCCGA ACGCCTGACT GGATGGCTGG ATGGCTGGCC GGGCAGTTGA
 
Protein sequence
MGIGLTLLDG VHWNGIPVVG DRPRALLAAL AAERGRPVRA ERLVELIWRG EPLANATKGL 
QVVVSRTRAA CGGAAVVRDG EGYRLDLPPT EVDSCLLSWL VAEADALLAA DPPAAAERAR
EALALGASLL PVPGGDQGPL ADLRRAAGRD LATARLLLAR AASRTNRHAE ALGLLETAHA
ERPDDEALLA DLLRSEAAAR GPGAALERFE RYRRDLRERL GISPGEALTR QQRDLLAADQ
PVRDGIRYDA TPLLGRQRDV DRLRALLDRS RVVSIVGPGG LGKTRLAHVL ARESTLPVVH
VVHLVGVTAP EDLLGEVGSA LGVRDSIGER RVLSPGQRAD LRTRIAAQLS RGSSLLLLDN
CEHLVEEVAE LVAFLVTATA DVRVLTTSRA PLAISAERVY LLGELARTDA IELFRQRATA
ARPTVRLDPE VVDRIVHRLD GLPLAIELAA ARVRAMSVED VDRRLADRFA LLRGGDRGAP
DRHRTLFAVI DWSWNLLAEP ERRSLRRLAL FPDGFTLDAA EEVLGPAGEG AGPEPFDAVD
AVQNLVDQSL LSVRESADGV RYRMLETVRE FGRLRLAEAG EQVSARRAQR AWAVGYVTRH
GADLLSERQF ATIDAVAAEE TNLADELRAA LVDGGTEAVV RLLAVLNPFW EIRGDHTRMM
VLANAVAEAL HDWSPPPAAA SATLASRIAV LRTKMLMTVR YSEEARELLA PLDPADAENT
WLAGSVRAQL ALDPTRPADT VARLERLADD ADRHTRLAAL MWLTHLWEND GIPRTAAACA
ERALALAAPQ DGPWIAAVLR TQLAALSMQL GDVTTARSCA RAALPVLERL GARDDVAQLR
VLLAFDAIND GRLEAAESYL AQSAPEDRTG LGGRMMATAE AEIMIARGDT TGGLRRYVDA
AEEMRALRLP GVEPTGLEPW VLVCNAATVA AFARHASTAD DLATGHSLFR ACLTHCVEAF
RRGQAGADYP VYGTVLFALG AWGLRENLPA PGDCLAGMAP EDAVRLLALA ERFAYPATIP
SMGWSRIEPL AERRAPGTLA AVRASVAGRR PAQVLDEARA LVEHLAERLT GWLDGWPGS