Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_6249 |
Symbol | |
ID | 5674568 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 7587151 |
End bp | 7590390 |
Gene Length | 3240 bp |
Protein Length | 1079 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 641245101 |
Product | transcriptional regulator |
Protein accession | YP_001510497 |
Protein GI | 158317989 |
COG category | [R] General function prediction only [T] Signal transduction mechanisms |
COG ID | [COG3629] DNA-binding transcriptional activator of the SARP family [COG3903] Predicted ATPase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGGCATCG GCCTCACTCT CCTGGACGGG GTGCACTGGA ACGGCATCCC GGTGGTCGGC GACCGGCCGC GGGCGCTGCT GGCGGCGCTG GCGGCGGAGC GCGGGCGTCC GGTGCGGGCC GAGCGGCTGG TGGAGCTGAT CTGGCGCGGG GAGCCGCTGG CCAATGCGAC CAAGGGTCTT CAGGTGGTCG TCTCCCGAAC GCGGGCGGCC TGCGGCGGCG CCGCGGTGGT GCGCGACGGC GAGGGCTACC GGCTCGACCT GCCCCCGACC GAGGTCGACA GCTGCCTGCT CAGCTGGCTC GTCGCCGAGG CTGACGCCCT CCTCGCGGCT GATCCGCCCG CCGCCGCCGA GCGGGCCCGG GAAGCGCTGG CGCTCGGCGC GTCGCTGTTA CCGGTGCCGG GGGGCGACCA GGGGCCGCTC GCCGACCTGC GCCGGGCGGC CGGCCGCGAT CTGGCGACGG CCCGGCTGCT GCTGGCCCGA GCGGCGAGCC GAACAAACCG GCACGCCGAG GCACTGGGGC TCCTGGAGAC GGCGCACGCC GAGCGGCCCG ACGACGAGGC CCTGCTCGCC GATCTGCTGC GCAGCGAGGC CGCCGCCCGC GGGCCGGGCG CGGCTCTGGA GCGCTTCGAG CGCTACCGGC GGGACCTGCG GGAACGGCTC GGTATCAGCC CCGGGGAGGC GCTCACCCGG CAGCAGCGCG ACCTGCTCGC CGCCGACCAG CCCGTCCGCG ACGGCATCCG CTACGACGCC ACTCCCCTGC TGGGCCGGCA GCGCGACGTC GACCGGCTGC GGGCGTTGCT GGACCGGTCA CGGGTGGTGT CGATCGTCGG GCCGGGTGGT CTCGGCAAGA CCCGGCTGGC GCATGTGCTC GCCCGGGAGA GCACCTTGCC GGTGGTGCAC GTCGTCCACC TGGTCGGGGT GACGGCGCCC GAGGACCTGC TCGGCGAGGT CGGTTCCGCG CTCGGTGTCC GCGACTCGAT CGGCGAGCGC CGGGTGCTGT CCCCGGGCCA GCGCGCCGAC CTGCGCACAC GCATCGCCGC GCAGCTCTCG CGCGGGTCGA GCCTGCTGCT GCTGGACAAC TGCGAGCATC TCGTCGAGGA GGTCGCCGAG CTGGTCGCCT TCCTCGTCAC GGCCACCGCG GACGTCCGGG TGCTCACCAC GAGCCGGGCG CCGTTGGCGA TCAGCGCCGA GCGGGTCTAC CTGCTCGGGG AGCTGGCCCG TACCGACGCG ATCGAGCTTT TCCGCCAGCG CGCCACCGCG GCCCGGCCCA CCGTCCGGCT CGACCCCGAG GTGGTCGACC GGATCGTCCA CCGGCTCGAC GGTCTTCCGC TGGCGATCGA GCTGGCGGCG GCGCGGGTCC GCGCGATGTC GGTGGAGGAC GTCGACCGGC GGCTGGCGGA CCGCTTCGCA CTGCTGCGCG GCGGCGACCG CGGCGCGCCG GACCGTCACC GCACGCTGTT CGCGGTGATC GACTGGTCGT GGAACCTGCT GGCCGAGCCG GAACGGCGGT CGCTGCGTCG GCTGGCGCTC TTTCCGGACG GGTTCACCCT CGACGCCGCG GAGGAGGTGC TCGGCCCCGC TGGTGAGGGG GCCGGCCCGG AGCCCTTCGA CGCGGTCGAC GCCGTGCAGA ACCTCGTCGA CCAGTCGCTG CTGAGCGTGC GTGAGTCGGC CGACGGGGTC CGCTACCGGA TGTTGGAGAC CGTCCGCGAG TTCGGTCGGC TGCGGCTGGC CGAGGCCGGC GAGCAGGTGT CGGCGCGGCG GGCCCAGCGG GCCTGGGCGG TCGGTTACGT CACCCGGCAC GGCGCGGACC TGCTCAGTGA GCGGCAGTTC GCGACGATCG ACGCGGTCGC GGCCGAGGAG ACGAACCTCG CCGACGAACT GCGGGCCGCG CTGGTCGACG GCGGTACCGA GGCGGTCGTG CGGCTGCTGG CGGTGCTCAA TCCGTTCTGG GAGATCCGCG GCGATCACAC GCGGATGATG GTGCTGGCCA ACGCGGTCGC CGAGGCCCTG CACGACTGGT CCCCGCCGCC AGCGGCGGCC TCGGCGACCC TGGCCAGCCG GATCGCCGTC CTGCGGACGA AGATGCTCAT GACCGTCAGG TACTCGGAGG AGGCCCGCGA ACTGCTGGCG CCGCTCGATC CGGCGGACGC GGAGAACACG TGGCTGGCCG GTTCGGTGCG GGCGCAGCTT GCGCTCGATC CGACCCGACC CGCGGACACC GTGGCACGGC TGGAACGGCT GGCGGACGAC GCCGACCGGC ACACGCGCCT GGCCGCGCTG ATGTGGCTGA CCCATCTGTG GGAGAACGAC GGTATCCCGC GGACGGCGGC GGCCTGCGCC GAACGGGCGC TGGCGCTCGC CGCGCCGCAG GACGGCCCGT GGATCGCGGC CGTGCTGCGC ACCCAGCTCG CCGCGCTGTC GATGCAGCTC GGGGACGTCA CCACCGCCCG GTCGTGTGCT CGCGCGGCAC TGCCCGTCCT GGAACGGCTC GGCGCCCGCG ACGACGTCGC GCAGCTCAGG GTGCTGCTGG CGTTCGACGC CATCAACGAC GGCCGGCTCG AAGCGGCCGA GAGCTACCTC GCCCAGAGCG CCCCGGAGGA CAGGACGGGG CTCGGCGGCA GGATGATGGC CACCGCCGAG GCCGAGATCA TGATCGCCCG CGGCGACACG ACTGGTGGTC TACGCCGGTA CGTGGACGCG GCGGAGGAGA TGCGGGCCCT GCGGCTGCCG GGCGTGGAAC CCACCGGCCT CGAGCCCTGG GTGCTCGTGT GCAACGCGGC GACGGTGGCC GCGTTCGCCC GGCACGCGAG CACGGCCGAC GACCTCGCCA CCGGTCACAG CCTGTTTCGG GCCTGCCTCA CCCATTGCGT CGAAGCGTTC CGGCGCGGGC AGGCCGGCGC CGACTACCCG GTGTACGGGA CGGTGCTGTT CGCACTGGGC GCCTGGGGTC TTCGGGAGAA CCTCCCGGCG CCCGGAGACT GTCTGGCCGG GATGGCCCCG GAGGACGCGG TCCGGCTGCT GGCGCTCGCG GAACGGTTCG CCTACCCCGC GACGATCCCG TCGATGGGGT GGTCGCGGAT CGAGCCGCTC GCCGAGCGCC GCGCGCCCGG CACGCTTGCG GCGGTCCGGG CGAGCGTCGC CGGACGGCGG CCGGCCCAGG TGCTCGACGA GGCCCGCGCC CTCGTCGAGC ACCTGGCCGA ACGCCTGACT GGATGGCTGG ATGGCTGGCC GGGCAGTTGA
|
Protein sequence | MGIGLTLLDG VHWNGIPVVG DRPRALLAAL AAERGRPVRA ERLVELIWRG EPLANATKGL QVVVSRTRAA CGGAAVVRDG EGYRLDLPPT EVDSCLLSWL VAEADALLAA DPPAAAERAR EALALGASLL PVPGGDQGPL ADLRRAAGRD LATARLLLAR AASRTNRHAE ALGLLETAHA ERPDDEALLA DLLRSEAAAR GPGAALERFE RYRRDLRERL GISPGEALTR QQRDLLAADQ PVRDGIRYDA TPLLGRQRDV DRLRALLDRS RVVSIVGPGG LGKTRLAHVL ARESTLPVVH VVHLVGVTAP EDLLGEVGSA LGVRDSIGER RVLSPGQRAD LRTRIAAQLS RGSSLLLLDN CEHLVEEVAE LVAFLVTATA DVRVLTTSRA PLAISAERVY LLGELARTDA IELFRQRATA ARPTVRLDPE VVDRIVHRLD GLPLAIELAA ARVRAMSVED VDRRLADRFA LLRGGDRGAP DRHRTLFAVI DWSWNLLAEP ERRSLRRLAL FPDGFTLDAA EEVLGPAGEG AGPEPFDAVD AVQNLVDQSL LSVRESADGV RYRMLETVRE FGRLRLAEAG EQVSARRAQR AWAVGYVTRH GADLLSERQF ATIDAVAAEE TNLADELRAA LVDGGTEAVV RLLAVLNPFW EIRGDHTRMM VLANAVAEAL HDWSPPPAAA SATLASRIAV LRTKMLMTVR YSEEARELLA PLDPADAENT WLAGSVRAQL ALDPTRPADT VARLERLADD ADRHTRLAAL MWLTHLWEND GIPRTAAACA ERALALAAPQ DGPWIAAVLR TQLAALSMQL GDVTTARSCA RAALPVLERL GARDDVAQLR VLLAFDAIND GRLEAAESYL AQSAPEDRTG LGGRMMATAE AEIMIARGDT TGGLRRYVDA AEEMRALRLP GVEPTGLEPW VLVCNAATVA AFARHASTAD DLATGHSLFR ACLTHCVEAF RRGQAGADYP VYGTVLFALG AWGLRENLPA PGDCLAGMAP EDAVRLLALA ERFAYPATIP SMGWSRIEPL AERRAPGTLA AVRASVAGRR PAQVLDEARA LVEHLAERLT GWLDGWPGS
|
| |