Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_1585 |
Symbol | |
ID | 5669988 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 1893060 |
End bp | 1896002 |
Gene Length | 2943 bp |
Protein Length | 980 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | 641240504 |
Product | transcriptional regulator |
Protein accession | YP_001505930 |
Protein GI | 158313422 |
COG category | [R] General function prediction only [T] Signal transduction mechanisms |
COG ID | [COG3629] DNA-binding transcriptional activator of the SARP family [COG3903] Predicted ATPase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.79609 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0274178 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCTGCGGA TCCGGCTGCT GGGCGAGGTC GGTGCCGTCA CGTCCGGCGG CGAGCCGGTC GACGTCGGCC CGGCCAAGTG TCAGGCGCTG CTCGCCGCGC TCGCGCTCGC GCCCGGCAAG GTCGTGCCGG TCGGGAGGCT GGTGGAGCTG GTCTGGGGCC CGCACCCGCC CAGGACGGCC GACCGCACCC TGCAGTCGTA CGCCACCCGG CTTCGCAGGG CGCTCGGTCC CGGTTCGATC CGCCGGGTCG GCGCCGCCTA CCTGCTGGAT GTCGTGCCGG AGGCGGTGGA CGTCGTCCGC TTCCAGCGAC TGCTCGACGA GCGGGACGTC GCCGCCGCCC TCGCCGAGTG GGCGGGCCCG CCGCTGGCCG GGCTGGTCGT CCCCGGCCTG GCCGGAGCCG TCGACGGGCT GGTCGAGCAG TGGCTGGCCG CCGTGGAGGT GGATCTGGCG CGCCAGGTGG AGACCGACGC GCGTTCCGCG CTGGGCCGGC TCACCGAGCT GACCGCCGCC TACCCGCTCC GCGAGGGCCT GTGGGCGCTG CTCATGACGG CGCTCTACCG GGCCGGGCGC CCGTCCGAGG CCCTGGCGGC GTTCCGCGCC GCCCGCCGGC ACCTGGTCGA CACCCTCGGG GTGGAGCCGG GGCCGGCGCT ACGCGAGCTC GAGGCACTGG TCCTGGAGCA CGACAGACGG CTCCTGCCGC CCGGCGAGGG ATTCGCCCAG GTCGGCGAAC ATCCCGTCCC GGGCGAGCGC CCGGGCAATC TTCCCCGCCG GCTGGGGCGG CTACTCGGCC GCGATCATGA TCTGGAGGCC GTCGGCGGCG CGCTGGCCGT CTCACCCGTC GTGACCCTGG TGGGGCCGGG TGGGATCGGC AAGACCCGGC TGGCCCTCGC GGCGGCGGCG CGGCACCTCG ACGCGGACGG CGTCTGGCTG GTCAAGCTCG CCGAGATCGC GGCGCCGTCC GATGTGGCGC GGGCGGTCGC CGGTGTCGTG GGCGTGACGG AGAGCTCCGG GCGCCCGCTG GGCGAGTCCG TCGTGGCGGC GCTGCGCCAT CGCCGGGTGC TGCTCGTCCT GGACAACTGC GAGCATGTCG TCGACGGCGC CGCGGCGCTC GCCCAGGCGA TCGCCGAGGG CTGCCCCGAG GTGCGGATCC TCGCCACCTC CCGGGAGCGG CTCGACCTCG GGCACGGCTT CGAGCGGCTC GTCGCCGTGG GCCCGCTGGA GCCCGCAGGG GCGGCCGCCG AGCTGTTCGC GCAGCGTGCC CGCGCGGTCT TCCCCGCCTT CGACGCGCGG GCCGGCCGCG CCGAGATCAC CGAGATCTGC CGCCGTCTCG ACGGCCTCCC GCTGGCCATC GAGCTGGCCG CCGCCCGCAC CGCGGCCCTC GGCCTCACCG ACCTGCTCGA ACGTCTCGAC GACCAGCTGC GCCTGCTCGT CGGCGGCGCG CGGACGGCCG ACGAGCGCCA CCGGACGCTG CGCGCCACCG TGCGGTGGTC CTACGACCTG CTCGCTCCCG CCGCGCAACG CCTGTTCCGG CAGCTGTCCG TCTTCGCCGG GCCGTTCGAC CTGGGCGCAG CCGCCGCCGT CGCGGCAACC ACCGTCGCGG CAACCGCCGA AAGCAACGGT GCGGACGGCT CGCGCGCCGC CGGCTGGCCG GGCGACCCGG GCACGGCCGA CGTGGGCGGC GCGGGCGGCG CGATGGGCGA GGTGGAGGAC CTGCTCGGTG ACCTCGTGGC GCGGTCGATG CTCGTGGCGG AGACCGGGCC GTTCGGGCGG CGCTTCCGGC TTCTGGAGAC GATGCGCCAG TTCGCCGCCG AACGGCTCGC TGATGCCGGT GAGACCCGGC AGGCGACCGC ACGGCACGCC CGGTGGTGCC TGAAGGCGGC CACCCGCGTC CAGGTGCTGC TCGCCGGTCC GGCCGAGGTC GAGGGCGTCG CCCGTCTCGA CGAGCTCTGG CCCAACCTGC GGGCCGCGTT CGACCGGGCG TGCGCGGCCG GTGACACGGC GCTGGCCCGG GCGCTCGTGC GCCGCGTCGT CGTGGAGATC GTCCGCCGCA GCCGCCACGA GATCGGCGAC TGGATCGAAC GGCTCCTGGC GCTCGGGCAG CCGGGGGCGG AGCCGGACCC CGACCTCGTC GTCTTCGCGC TCACCTGGGC GGCGCAGCGC TACAAGATGA GCCAGGACAG CGCCGCCTAC GAACGACTCG TCGAACGCCA CGGCGAGCCG GACCATCCGT TGGTCCGCCA TGCGCGGGCG TCCGTCTACG AGGACTACCC GGCCCATGTG ACCTGGGCCG CGCCGGCCGT GGCGGAGCTG CGCCGGCACG GCGCGGACGA CCTGGCCGAG CAGTTCGAGC TGGACGTCGG CGCGGCCCTG CTGTTCACCG GCCGGTTCGC CGAGCACGAC GCCGCGGTCG GGGCGCTGGT CGAGCGGTAC CGCCGGCAGG GGCCGCCCAC CCTGCTGAAC CTGACTCTGG TGCTGCTCGG CTACTCGGCG CTGCTGCGCG GCCGGCACGA CCACGCCGAC CGGCTGTTCG ACGAGGCCGT CGGCGTCGAG GTGCCAGCCC GGACGCACTC GCCGAACCGG GCCGTCCAGG CCCGGGCGGT CTTCCGACGC GGTGAGCGGG CACAGGCGTT CGCCATCCTG CACTCGCACG TCGAGCAGCT GCTCGACGAC GGGAACATGC AGGCCGTCTG CGTCACCACG GTCGAGTTCG TCAACATGGC GGCGGCCGTC GGCCGGCTCG AGGACGCGGC ACGCATGCTG GGCTTCCTCG AGACGACCGG CCTGCTCGAG CATCCCGCGT GGCGGACGCT CGTCGACGGC GCGACGGCCC GGGTCGCCGC CGACCCCCGC GTCGACCCGG AGCGGGAACG GGCCGGCGGC CGGCGGCTGG ATGACCGCCA GGCGCTGGAG TACATGCGCG GCGTCCTGGG CCTCCTCACC TGA
|
Protein sequence | MLRIRLLGEV GAVTSGGEPV DVGPAKCQAL LAALALAPGK VVPVGRLVEL VWGPHPPRTA DRTLQSYATR LRRALGPGSI RRVGAAYLLD VVPEAVDVVR FQRLLDERDV AAALAEWAGP PLAGLVVPGL AGAVDGLVEQ WLAAVEVDLA RQVETDARSA LGRLTELTAA YPLREGLWAL LMTALYRAGR PSEALAAFRA ARRHLVDTLG VEPGPALREL EALVLEHDRR LLPPGEGFAQ VGEHPVPGER PGNLPRRLGR LLGRDHDLEA VGGALAVSPV VTLVGPGGIG KTRLALAAAA RHLDADGVWL VKLAEIAAPS DVARAVAGVV GVTESSGRPL GESVVAALRH RRVLLVLDNC EHVVDGAAAL AQAIAEGCPE VRILATSRER LDLGHGFERL VAVGPLEPAG AAAELFAQRA RAVFPAFDAR AGRAEITEIC RRLDGLPLAI ELAAARTAAL GLTDLLERLD DQLRLLVGGA RTADERHRTL RATVRWSYDL LAPAAQRLFR QLSVFAGPFD LGAAAAVAAT TVAATAESNG ADGSRAAGWP GDPGTADVGG AGGAMGEVED LLGDLVARSM LVAETGPFGR RFRLLETMRQ FAAERLADAG ETRQATARHA RWCLKAATRV QVLLAGPAEV EGVARLDELW PNLRAAFDRA CAAGDTALAR ALVRRVVVEI VRRSRHEIGD WIERLLALGQ PGAEPDPDLV VFALTWAAQR YKMSQDSAAY ERLVERHGEP DHPLVRHARA SVYEDYPAHV TWAAPAVAEL RRHGADDLAE QFELDVGAAL LFTGRFAEHD AAVGALVERY RRQGPPTLLN LTLVLLGYSA LLRGRHDHAD RLFDEAVGVE VPARTHSPNR AVQARAVFRR GERAQAFAIL HSHVEQLLDD GNMQAVCVTT VEFVNMAAAV GRLEDAARML GFLETTGLLE HPAWRTLVDG ATARVAADPR VDPERERAGG RRLDDRQALE YMRGVLGLLT
|
| |