Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3827 |
Symbol | |
ID | 5672191 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 4545250 |
End bp | 4548558 |
Gene Length | 3309 bp |
Protein Length | 1102 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641242706 |
Product | transcriptional regulator domain-containing protein |
Protein accession | YP_001508126 |
Protein GI | 158315618 |
COG category | [K] Transcription [T] Signal transduction mechanisms |
COG ID | [COG2909] ATP-dependent transcriptional regulator [COG3629] DNA-binding transcriptional activator of the SARP family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0262799 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGGAGG CGGGGGATGC GTCGCGGCGT GTACAGATCT ATGCCGGCGG GAATGCCGAA CTTGACGCCG CGCCTGGAAC AACCTCACGG GGAACGGCGA TGACCGAGGC GCCGGTTCCC CCGATCCTCG CGGCCCGCGT TCGGCTACCC GCCCCCGCCG GCATGCACCG GCCGCGGCTG GCCGGACCGC TCATCGTCCC GGCCGCCTAC CGGGTCGGAA CCGTGGTCGC GCCGGCCGGC GCCGGGAAGA CAACCCTGCT TGTGGAGGTG GCGCAGGCGT GTGCGTGGCC GACCGCCTGG CTCACGCTCG ACGACCGGAT CGGCGGCCTC GACTCGTTCC TCGCCCACCT GCATGCCGCC GCGGCAGCGG CGGCCGGTCT GCCCGCCGGA TCCTGGACCA GCATCGAGAA CGCCATCGTG GACCTCGACC GGCACCTGAC CGACAACCTG CTCATCGTCC TGGACGACCT GCACGCGGTG GACGGACAGC CGGCCGAGGC GGCGGTCCAA CTCCTGCTCG ACTACCTCCC CCCCAAGGTC CGGGTTCTCG TCGCCGCCCG CTGGCGCCCG AACCTCGATC TGCACCGGCT GCGTCTCGCC GGGCAGATAC GGGAGGTGGA CGCCGACGCG CTGCGTTTCC GGACCTGGGA GGTCGAGGAG CTGTTCCGGG ACTGCCACGG CGTCCGGCTG CGACCCGAAG AGGTAACCGC GCTGACCCGC AGCACGGACG GCTGGGCCGC CGGTTTACAG CTGTTCCACC TCGCCACCCG CGGCCGGCCC GCGTCCGAAC GGGCGAAGAT TCTCTCCGGG CTCGGTAACA CGCGGCTCAC CCGCGAATAC CTGACGATGC ACGTGCTGTC CGCGGTGAGC GCGGACGAGC GGGACTTCCT GGTGCGGACG AGCGTCTTCG ACCGGCTGAC CGAGAAACGC TGCGACAGCC TGCTGGGACG CACCGGGAGC GGTCTTCAGC TCGCCGAGCT GGAACGGCGT GGGCTGTTCA CCTTCGTCGA GGACGACGGG CGGACCTACC GTTACCACGA GGTGTTACGA ATTCACCTGC TGGAACGGCT CGTGCTGGAA CACGGGGAGG AACGAGCCCA GGAGGTGCAC GGGGCGGCGG CGCGGCTGTG TGAGGAGGAC GGCGCGCTCA CCGAGGCGCT CCTGTCGTAC TGCCGCAGCG GGGACTGGGA GCAGGTGCGG CGGCTGCTCG GCCAGGGCGG CCGGCGCCTG GCCGACGACC CGGCCGCCTG GTTCGAGCTG CTGCCGGCCG CGATCCGCGA CTCCGACCCG TGGGTGCTGC TCGCGATGGC CCGCCGGCTC GCCGCGGACG GGTCGCTCGA AGCCGCGGCC GATACCTATC GCGATGCCGC GACGGCGTTC GGTTCCGAAA CGCCGGCCCG GGTTCACCGC GAGCTCGCCG AACTCGACGA CTGGCTCCAC CCCGCGCCGC GCCGCGGCAC GACCTGGCTG CGGACTCTTC GGGCGGTGCT GGCCGATCCC GGCCCGCATG TCGAAGGATC CGCCGACCCC CCGCAGACGG CGCTCGTGCG CGGGATTGCC GCCTTCGTCG CGGGAGAGAT CGCGCTGGCC CGCCAACGCT TCGACACGAT CACCGCGGGT ATCGGCGCAA GTCCCGTGAT CGAGACGGTC GCCGCGGTCG GGCGGGCCGC GTGCGCCCTG CTCACGCAGG ACGCACAGGC CGGCGAGACG GTCGAGGAGG GAGTGGCCGC GGCGGAGCTG CTCGGGCATG CGGCCGTGCT CCGCGTTGCG GCCGCCGTCC GCGCCCTGTA CTTCGGCTAC GGCAACCGGC ACGTCCTGCA CCTGCTCGAC GCCGCCGGCC GGGCCGGTGA CCGGTGGGGG GTGGGGGCTC TCGTCCTGCT CGGCAGCTTT GCCGGCCTCC CGGTGCCGGA CGACGTCATG TTCCCGGCCG ACGAGCTTCC CGCGCCCGCC CTGCCGCCGT CGACCGGCCT GGCGCCGGGG CAGGTGTGGC CGGACGACGA GCCGATGCAC CTGCGCGCCT ACCGACAGCT GGCGATGGCA CTGGACCACA CGGCAGGGAT CTTCGACGAG CTGACCGCCG GTGCGCTCGC GACCTGGGCG CGGGCGTCCG CGCTGCTGGC CTGGCGGCGG GCGGGCGCGC CGCTGGACGA GGACGAGGCA GCGCGGGTGG TCCGGGCAGC GGCCCGGCTG GGACCGGCCC CGCACGCCCT CGCCCTCGTC GGTAGCGTCC CACCGCCCTC GGCCGCCCCA CCGCCGTCGC ACGCGGCCGG TCTGCCGCCG TCGCCCTCTG CGGCGTCCGG TGCCCCGTTC GCCCCAGGGT CCGGCGCGAC ATCCGGCCAG ACCCATCCGC ACGAGGTGGC GCGCCGGATC GCCGCCGAGG CGGGCGTCGG AAGCTGGGTG GAAAGGCTGA TCGCGTCGCT GTCGGGCGGC GGCCGCCCGC CCGCTGCCGT CATGCCCGCG GGCATGACGG GCACCGTCGC TGCCGCGGTC GACGCCGACG CGGCCAGCCC GGCGCCGGCC GGTGAACCGC CCGCTGCGGA GCGCCGGCTG ACCGTCCGCT GCCTGGGTCG GTTCGAGCTG GAGGTCGACG GAGTGCCGGT CGGCCTGGCC GGGGTACAGC CGCGCAACCT CGAGCTCCTT CATGTGCTCG CGGTGCACGC GGACCGGCCG TTGCATCGGG ACCAGCTGAT CGAGCTGATC TGGCCGGACG CCGCGCCGGC ACAGGCGAAC CACAGCCTGC AGGTCGCGGT CAGCGCCCTG CGCGGGCTGC TCGAACCGGC GGTCCCGCGC GGCCGGCGCG GGCTGCTCCG CAGGGCCGGG CAGGGCTACC TCCTGGCCCT CGTCGATCCG GACGACCACG ACATCCGCCG GCTCGAACGA CTGCTCGATG CCGCGGCCGC GGCACGACGC TCCGGTGACG GCGAGGCCGA GTACCAGCGG CTGGCGAGCG CGATCGAGCT CTACGCCGGG GACGTCCTGC CCGGCGGTGG CCCGGCGGAC TGGCTGCTGG CCGAACGGGA CCGGCTGCGG ACGACCATCG CCGCGGCCTG TGAACGCGCC GCGGCCATCT CGACCGAGAC AGGCCGGCAC AAGGATGCGG TCCGGCACGC CGAACTGGGG CTCGAGGTCG ACCGGTACCG GGACGGGCTG TGGCGGACGC TCGTGGTGGC GCTCCGATCG GGCGGGCAGC CGGTCGCGGC CGCCCGCGCG GAGGCCCGGT ACGAGGCGAT GCTCGCCGAC CTGGGCATCG AACTCACACC GGCCGAACCT CCACCCGGTG CGGGACATGG GCCGGCTTGG CCGACCTGA
|
Protein sequence | MPEAGDASRR VQIYAGGNAE LDAAPGTTSR GTAMTEAPVP PILAARVRLP APAGMHRPRL AGPLIVPAAY RVGTVVAPAG AGKTTLLVEV AQACAWPTAW LTLDDRIGGL DSFLAHLHAA AAAAAGLPAG SWTSIENAIV DLDRHLTDNL LIVLDDLHAV DGQPAEAAVQ LLLDYLPPKV RVLVAARWRP NLDLHRLRLA GQIREVDADA LRFRTWEVEE LFRDCHGVRL RPEEVTALTR STDGWAAGLQ LFHLATRGRP ASERAKILSG LGNTRLTREY LTMHVLSAVS ADERDFLVRT SVFDRLTEKR CDSLLGRTGS GLQLAELERR GLFTFVEDDG RTYRYHEVLR IHLLERLVLE HGEERAQEVH GAAARLCEED GALTEALLSY CRSGDWEQVR RLLGQGGRRL ADDPAAWFEL LPAAIRDSDP WVLLAMARRL AADGSLEAAA DTYRDAATAF GSETPARVHR ELAELDDWLH PAPRRGTTWL RTLRAVLADP GPHVEGSADP PQTALVRGIA AFVAGEIALA RQRFDTITAG IGASPVIETV AAVGRAACAL LTQDAQAGET VEEGVAAAEL LGHAAVLRVA AAVRALYFGY GNRHVLHLLD AAGRAGDRWG VGALVLLGSF AGLPVPDDVM FPADELPAPA LPPSTGLAPG QVWPDDEPMH LRAYRQLAMA LDHTAGIFDE LTAGALATWA RASALLAWRR AGAPLDEDEA ARVVRAAARL GPAPHALALV GSVPPPSAAP PPSHAAGLPP SPSAASGAPF APGSGATSGQ THPHEVARRI AAEAGVGSWV ERLIASLSGG GRPPAAVMPA GMTGTVAAAV DADAASPAPA GEPPAAERRL TVRCLGRFEL EVDGVPVGLA GVQPRNLELL HVLAVHADRP LHRDQLIELI WPDAAPAQAN HSLQVAVSAL RGLLEPAVPR GRRGLLRRAG QGYLLALVDP DDHDIRRLER LLDAAAAARR SGDGEAEYQR LASAIELYAG DVLPGGGPAD WLLAERDRLR TTIAAACERA AAISTETGRH KDAVRHAELG LEVDRYRDGL WRTLVVALRS GGQPVAAARA EARYEAMLAD LGIELTPAEP PPGAGHGPAW PT
|
| |