Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4842 |
Symbol | |
ID | 5673183 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 5806001 |
End bp | 5808940 |
Gene Length | 2940 bp |
Protein Length | 979 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641243698 |
Product | SARP family transcriptional regulator |
Protein accession | YP_001509114 |
Protein GI | 158316606 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG3629] DNA-binding transcriptional activator of the SARP family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0556456 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCGATTCC ACGTGCTGGG GCCGCTGGAA GTGATCCGGG ACGGCCAGCC CATCGTCGTG CCCGGCGTCC ATCAGCGTGC CACGCTCGGT TTCCTGCTGT TGCATCCGAA CACGGCGGTG GCCACCAGCA GGCTGCTCCA GGCCCTGTGG GACCAGGAAC CGCCGGTGAC CGCGCGCAAG ATGGTGCAGA ACGCGGTGGC GGGCCTTCGC CGGACGCTGG GCGGCGGGGC GGACGTGCTG ACCCGCCCAC CTGGATACCA GCTGAGCATC CGCGCCGACC AGGTGGACCT GGCCGCCTTC CAGGAACTGA GCGCGCGTGG CCGGGCGGAG CTGGCCGCCG GTTCGTGGGC TGCCGCTGCC GCGACCCTGT CGGCCGCGCT CGGCCACTGG CGTGGGCCCG CGCTGGCCGA CCTGGCAGAG GAGGGCCTGG CCTGGCCGGA GGCCGTCGCC CTGGACAGCG CACGCATGGT CACCTTCGAG GACTGGGCGG AGGCGCAGCT CGCCCTCGGC CACCACCAGG AGCTCGTGCC CGAACTGGAG TCGGCCGTCG AGACGACCCC GCTGCGGGAG CGGCTGACCG GGCTGCTGAT GTTGGCGCTC TACCGCTGCG GCCGGCAGGC GGACGCGCTC GAGCGCTACC GGCGCACCCG GGGCGCTCTC GTCGACCAGC TCGGTCTCGA TCCCGGGCAC GAGCTGCAGA ACCTGGAACG GGCGATCCTC GACCACGAGC CCTGGTTGAG CTCACGGCCG ACGGGCGCGC TCACCTCCCG CCACGCCGGG GTCCTCGCTC CGGGCCGGAT GATCGCGGGC AGCGGCGCTC TCGCGGGGAC TGGCGCTCTC GCGAGGGCCG GTGCGGTCGC GGGAACCGGT GGCCTGGCGG CCGGCGGTGT GGTCGCGGCG GCCGGAGTGG CTGGCATGGG CGGCGACGGC GGCGCGGCCG GTGCGCGGCT GCCGGAGGTC TTCCCGGCCC GGGCTCCCGG CGTGCACGCC GCCCGGAAAT GGGTCAGTGT CCTCGCCGGT CGACTCGACA TCGGGTCGGG GGCGGAGCTC TCCGATCCGG AGGACGCCGA CCGGGTCCTG CGGGCCGTCA CCGCCACGGT CCGGGCCGAG ACCGCCCGTT TCGGCGGCGA GGTCCACACG GCGCTCGGCT CGGTGTGGAT CGCGGTCTTC GGCCTGCCCC GAACCCGGGA GGACGACGCG GAGCGCGCCG TCCGCGCGGG CCTGGCCATC CGGCGGGCGG TCGCCACATC GGCGAAGGCG TCCTCGTACG GTGTCGGACA GTACGACCTG CGGATGGCTG TCACTACCGG GGAGGTCCTC GCAACCCTCA CCCAGGCCAC GCCGGCGGCC GCGGCTTCGG CGGCTGCCGC CGCGATGTCC GGGGATGTAA CCGGCCGGTG CATGCGCCTG CTGGCGTCTG TGCCCAGGGG TGAGCTGCGG GTCTGCGAGG CGACCAGGGC GGCGTCCGAC GGGGCCTTCA CCCATGCTTC GACGGACGAG CACGGCCGAG GCGTCGGCGG CGTGGTGGCG GTGCGCCTGG GCGGCTTGGC GAGTGCGTTC GGGGCCGGTC CGGCGGGCCG TGAAGGAGCC CTCCACGTGC CATTCCTCGA GCGGGAGCAC GAGAAGGACC TGATGCACCG GCTGCTCGCC GAGACCGCCC GCCGCCGCCG GCCACGGCTG GTGACGATCT TCGGTGAGCC CGGCATCGGC AAGAGTCGGC TGCTGTGGGA GTTCTGCCGG GAGGCCGGCG ACGCGGCTGT CTCGCGCGAC GTCCAGTTCC TGCACGGCCG GATCGGGAGG TTCCAGAGCG GCCCGTTCGA GGTTCTTACG GACATCACCC GCACGTGGGC GGGGATCGCC GAGGACGACT CCCGTGAACT GGCGGTGACG AAGCTCGCCG CCGCGGTCGG GATGACGGGC GCCACCTTCG AAGTCTCGCG CCGCCTGGCG GCCGAGCTGG TTCCGCTGCT GGACCCAGCC TGGCCGGGCG GCGAGGACGG AGCACGCGTG CTCGCGGCCT GGCGTCGGAC ACTTGGGAGG ATCGCGGCCG TGGCCCCGAC GGTGCTCGTC GTCGAGGACC TGCACCGGGG AGACGACCGG CTGCTCGACT GTCTGGAGGT GCTCGACGAG CAGCTGGGCC GCGTTCCGCT GCTTGTCGTC GCCAGCACCC GGCCGGACCT GCTCGAACGG CGGCCGTGCT GGGCCGGCGG CAAGCGTGAC GCGACGACGA TGTCCCTCGA CCAGCTCTCC GCCGCCGCCA TCCAGCATCT GATGGTCGAT CTGGCGGTGT TGTACGGGCT GCGCCCCGCG TCGTCGGCGC CGCCGCTCGC GGGCGCGGGC GGCGGCGGGC CCGCGCCCGA GACCGCGGCC CACGAGTGCG GGACGGTCGC CAAGGTCGGC GGCAATCCGC TGTTCGCGGT CGCCCTCGTC GAGATGCTCC GCGCGGCACG GGCCGCCAAG GGTGACGGCG TCTCCCCGGC TCTGCTGATC GACAGTGCGG GCCCGATCGG CCCGGGCTCC GCGATGGTGT CGATCCCGCG CACGGTTCAC GGCGTGATCG CGGCGTGGCT GGACACTCTG CCGTCACGCA GCCGTCTGGT GCTGCAGGGC GCGGCGGTGT TCGGCGCGAC CGTCTGGGCC GCGCCGGTCG CGGCCGAGTG CGAGCTGACC CGCGAAGACG TCCTACGGGA GCTGGAGTAC CTCGAACGCC GGGGCGTGCT GCGGCAGATG CCGACCGTTG CCGACCCCGA CCCGCACTAC GAGTTCCGGC ATGCCTTCGT CCGGGACGTC GCCTACTCGC AGATCCCGCG TGCCGTCCGG GGAGGGTGGC ACCTGTGCTT CGCCCGCTGG TTGGGTGACG AGTACGGGCG GGTCGCACAG GACGATCTCC GCCGGCACCA CCATCGGCGC GCCGACAGCC TCACCGCCGC GGCCGGCTAG
|
Protein sequence | MRFHVLGPLE VIRDGQPIVV PGVHQRATLG FLLLHPNTAV ATSRLLQALW DQEPPVTARK MVQNAVAGLR RTLGGGADVL TRPPGYQLSI RADQVDLAAF QELSARGRAE LAAGSWAAAA ATLSAALGHW RGPALADLAE EGLAWPEAVA LDSARMVTFE DWAEAQLALG HHQELVPELE SAVETTPLRE RLTGLLMLAL YRCGRQADAL ERYRRTRGAL VDQLGLDPGH ELQNLERAIL DHEPWLSSRP TGALTSRHAG VLAPGRMIAG SGALAGTGAL ARAGAVAGTG GLAAGGVVAA AGVAGMGGDG GAAGARLPEV FPARAPGVHA ARKWVSVLAG RLDIGSGAEL SDPEDADRVL RAVTATVRAE TARFGGEVHT ALGSVWIAVF GLPRTREDDA ERAVRAGLAI RRAVATSAKA SSYGVGQYDL RMAVTTGEVL ATLTQATPAA AASAAAAAMS GDVTGRCMRL LASVPRGELR VCEATRAASD GAFTHASTDE HGRGVGGVVA VRLGGLASAF GAGPAGREGA LHVPFLEREH EKDLMHRLLA ETARRRRPRL VTIFGEPGIG KSRLLWEFCR EAGDAAVSRD VQFLHGRIGR FQSGPFEVLT DITRTWAGIA EDDSRELAVT KLAAAVGMTG ATFEVSRRLA AELVPLLDPA WPGGEDGARV LAAWRRTLGR IAAVAPTVLV VEDLHRGDDR LLDCLEVLDE QLGRVPLLVV ASTRPDLLER RPCWAGGKRD ATTMSLDQLS AAAIQHLMVD LAVLYGLRPA SSAPPLAGAG GGGPAPETAA HECGTVAKVG GNPLFAVALV EMLRAARAAK GDGVSPALLI DSAGPIGPGS AMVSIPRTVH GVIAAWLDTL PSRSRLVLQG AAVFGATVWA APVAAECELT REDVLRELEY LERRGVLRQM PTVADPDPHY EFRHAFVRDV AYSQIPRAVR GGWHLCFARW LGDEYGRVAQ DDLRRHHHRR ADSLTAAAG
|
| |