Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_5445 |
Symbol | |
ID | 5673776 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 6587426 |
End bp | 6590911 |
Gene Length | 3486 bp |
Protein Length | 1161 aa |
Translation table | 11 |
GC content | 78% |
IMG OID | 641244300 |
Product | transcriptional regulator |
Protein accession | YP_001509706 |
Protein GI | 158317198 |
COG category | [R] General function prediction only |
COG ID | [COG3899] Predicted ATPase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.202606 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCGAATTC AGGTCCTCGG ACCGGTCGAG GTGCTCCTGG GCGGCGCGCC GCTCGATCTC GGCCCACCGA AACAGCGCGC GCTGCTCGCC CTTCTCCTGG CGGCCGGCGA CACCGTCATC CCGGTCGAGA CGATCATCGA ACGGCTGTGG GACGGCCAGC CGCCGGCCAG CGCGACGGCG AGCCTGCAGG TGTACGTCTC GAACCTGCGC CGTCTGCTCG AGCCGGAGCG GGACCCCCGG GCGCGGGCGA CCGTGCTGGT GACCCAGGCG CCGGGGTACG TGCTGCGCAC CGGCGCGGCG GAGGTCGACG CCCGGCAGTT CGTCGCGGCG CTCGGCGCGG CCCGCGGCTC GCTGGAGGCC GGGCGGCCGG CCGAGGCGCT GCCCGCCCTC GACGCCGCCC TGGCGCTGTG GCGCGGTGAC GCCTACGCCG ACGTGCGGGA CGCGTCCTGG ATCAGGCCGG AGGCCGGACG TCTCGAGGAG CTGCGGCAGA GCGCCGTCGA GGACCGCGCG CGGGCGCTCG TCGAGCTGGG GCGGCACGAC GAGGCCGCCG CCGATCTGGA GGCCCTCACG GCGCGTCACC CGCTCCGCGA GCGGGCCTGG GGGCTGTTCG CCCTCGCGAT CTACCGGTGT GGCCGGCCGG CGGAGGCGCT GGCCGCGCTG CGGGCGGCAC GCGCCCACCT GGCCGAGGAG CTGGGCGTCG ACCCCGGGCC GGAGCTCGCC GCCCTGGAAC TCGCCATCCT GCGTCACGAC CCCGCGCTGG CCTGGGCCGG TGGCGCCCAC GAGCCGCCCG CGGCGCAGCC GGCCGTGTCT CCGACGACGG GGCTGGACGG CCCACAGCCT GTGCTCCCCG ACGTGTCCCC CGCCGCGGCC GGGGCCCCTG AGCCGTCCGG CCCGTTCGTG GGCCGCGACG AGGCGCTGGC GGCGCTGCTC GCGGCGATCC CGGCGCCGGC CCGGCGCGGG CGCGTGGTGC TGGTCGACGG CGAGCCGGGT GTCGGCAAGA CGCGGCTCGT CGCGGAGCTG GGCCGCCGGT GCGGCCGGTT GACGGCCTGG GGAGCGTGCC CCGAGCACGA GGTGGCGCCG GCCCTGTGGC CCTGGGAGCA GAGCCTGCGC GCGGTGGCCG CCGCGCGGCC GGACGTCCCG GTGCCGCCCG GTGTGGCCGC GCTCATCGAC CAGCGGGCGC AGGAGCCCGG CGGGTACGAC GCGGCCGGAG CCCGGCTGCG GCTGTACGAG GGAGTCGCCG GCTACCTCGC GGCGGCCGCG CCGTTGGTCA TCGTCCTCGA CGATCTGCAC TGGGCGGACG TCTCCTCGCT GCGGCTGCTG GCCCATGTCG GCGGGGCGGC GCCGCGGGGC GTGATGGTGA TCGGCACGTT CCGCACCCAC GAGGCCGGCG CGCTGGCGGA CACCCGCGCC GCGCTGGCCA GGTCCGGCTG CGAGCGGCTC CGCCTCGACG GGCTGGACGA CGCCGCCGTC CGCGAGCTGA TGCGGGCCAC CACCGGCACG GACCCGGGAC CCGAGACGGC GAGCGCCCTG CGCTCCCGTA CCGCGGGCAA CCCGTTCTTC ATCGGTGAGC TCGTCCGCCT GCGCGGGCCC GGCCGACGCG GCCTGGCGGG GACCGGCGCC GCCGGGCCGG CCACGGACCG GGGCGCGGAG CAGGACACGG CGCAGGGCGC GCTGCCCGAT CACGTCCGCG ACGTGCTGTC CCGGCGGGTC GCGCGCCTGC CGGAGCCCGC CGCCGCGCTG CTCGCCACGG CGGCGGTGGC CGGCGGCGAG TTCGACGCGG ACGTGGTCGC CCAGGTCGCC GGGCACACCT TCGAGACGTC CCTCGACCTG CTCGACGCCG CGCTCGCCGC CGGGCTCATC ACCGAGGCCG GCGACCGGCT GGGCCGGTTC CGGTTCTCCC ACGCGCTGGT CCGCGAGGCG CTCGACGCCG GCCACTCCCG GCTGCGCCGC GCGCACCTGC ACCGCCGGTA CGGCGAGGTC ACCGCCGAGC GGTACGCCGG ACGTCCCGAG CGGGCCGGCG AGGTCGCGCG GCACTGGCTG GCCGCCGCCG AGCTCGGCGC CGAGACGGCC CGTGCCGCGA CGGAGCACGC CGCCCGCGCG GCGCGCGCGG CGGCCGACCG GCTCGCGCCC GAGGACGCGG CCGGCTACTG GCAGGAGGCG CTGGCCGCGG CCGAGCTGGC GGGCGCCGAC CGCGGCACCC GGCTGGAGCT CCTCCTCGGC CTGGTCAGCG CCCGGTACGC GGCGGGCCAG CTCAACGACG GGCTCGACGT GGTGGACCGG GCCCTCGACG AGGCCGGCGA CGATCCGCAC CTGATCGTCC GGGTCGCCGA GGCCGCGATG GGTGGCTCGC CGTGGTTCCC GTTCCCCTAC GGCACCGACC GCGGCCGGCT GCACCGCGCG CTCGAGCGGG CGCTCGGCGG GCTGCCGACC GCCGGCCGCG ACCGGGCGCT GGCCGTCGGC TGCCTCGCGG TGCTCGAATC CCACGTCGGG CGGATGGCCG AGGCCGAGCG GGCCGGCGCA CTGGCCGTGG CCGCGGCGCG GGACGTCTCC GACGACCCTG CGCTGCTGCC GCGGGTCCTG CACCTGCGCA GTCTCACCGT GACCGGGGTC GACTACGCCG AGCACCGGCA CACCTGCGCG CGAGAACTGG CGTCCCTGCC GTCGACACCA CCGGAGCTGC TGGTCGGCGC CCACCTCACC CTCGTCGACA ACCTGGTCCG CTTCGGCCAC GTCGCGCGGG CGCGGGCCGC GCTGGGCGAA GCGGACACGA TGATCAGCCG GCTCGGCTCG CCGACCCTCG CCTACCAGGC GGCAATCATG CGGGCGGCGC TGCTGGCCTT CTCGGGCGAG CTCGAGGAGT CCGCGGACGT GGCGACCGCC GCCGTCGGGC GCCTCGGGCT GGCCGCGCAG AACGGGGTCG AGAGCTCGTT CGTCGCGAAC GTCATCGACC GGGCCCTCCA GGCCGGCACG CTGGCCCGGT TGGCCGACAC GCTGGCCCGC TCGCTGGCCT CCACCGGCAT CGAGGCCCTG CGCGGGTCGC TGGCGCTGGC CCTCGCCGCC GCCGGGCGCC CCGAGGCGGG ACGCGCCGCG CTCGCCGAGA TGCGGCTGCC GCCGCGCGAC TACACCTGGC TCAGCGCGGT CGTCATGCGG CTGAACGCCG CGGTGGCCCT CGGCGTGCTG GACACCGTCG AGGAGGACAT CGCCCACCTG CGGCCGCACT CGGGCGAGCT GGCGAGCCTG GGCACCTGCA CAGCGGTCGT CGGGGCCGTC GACAGCCACC TGGGCGAGGC CTACCTGGCG CTCGGTGACC ACATGGCCGC CCGGGAACAC CTGACCGCGG CGCTCGCCCT GCTCGAGGCG AACGACTCCC CCTACTGGGC GGCAAGAGCA CACCAAGCGC TTGCCAAGTG CCCGTCAAGC GGCAACCGCC ACGCTGAGTC CCATCCGAAG GAAAACGGGA GGGAACCTCG ATGGCTCGCG CACTGA
|
Protein sequence | MRIQVLGPVE VLLGGAPLDL GPPKQRALLA LLLAAGDTVI PVETIIERLW DGQPPASATA SLQVYVSNLR RLLEPERDPR ARATVLVTQA PGYVLRTGAA EVDARQFVAA LGAARGSLEA GRPAEALPAL DAALALWRGD AYADVRDASW IRPEAGRLEE LRQSAVEDRA RALVELGRHD EAAADLEALT ARHPLRERAW GLFALAIYRC GRPAEALAAL RAARAHLAEE LGVDPGPELA ALELAILRHD PALAWAGGAH EPPAAQPAVS PTTGLDGPQP VLPDVSPAAA GAPEPSGPFV GRDEALAALL AAIPAPARRG RVVLVDGEPG VGKTRLVAEL GRRCGRLTAW GACPEHEVAP ALWPWEQSLR AVAAARPDVP VPPGVAALID QRAQEPGGYD AAGARLRLYE GVAGYLAAAA PLVIVLDDLH WADVSSLRLL AHVGGAAPRG VMVIGTFRTH EAGALADTRA ALARSGCERL RLDGLDDAAV RELMRATTGT DPGPETASAL RSRTAGNPFF IGELVRLRGP GRRGLAGTGA AGPATDRGAE QDTAQGALPD HVRDVLSRRV ARLPEPAAAL LATAAVAGGE FDADVVAQVA GHTFETSLDL LDAALAAGLI TEAGDRLGRF RFSHALVREA LDAGHSRLRR AHLHRRYGEV TAERYAGRPE RAGEVARHWL AAAELGAETA RAATEHAARA ARAAADRLAP EDAAGYWQEA LAAAELAGAD RGTRLELLLG LVSARYAAGQ LNDGLDVVDR ALDEAGDDPH LIVRVAEAAM GGSPWFPFPY GTDRGRLHRA LERALGGLPT AGRDRALAVG CLAVLESHVG RMAEAERAGA LAVAAARDVS DDPALLPRVL HLRSLTVTGV DYAEHRHTCA RELASLPSTP PELLVGAHLT LVDNLVRFGH VARARAALGE ADTMISRLGS PTLAYQAAIM RAALLAFSGE LEESADVATA AVGRLGLAAQ NGVESSFVAN VIDRALQAGT LARLADTLAR SLASTGIEAL RGSLALALAA AGRPEAGRAA LAEMRLPPRD YTWLSAVVMR LNAAVALGVL DTVEEDIAHL RPHSGELASL GTCTAVVGAV DSHLGEAYLA LGDHMAAREH LTAALALLEA NDSPYWAARA HQALAKCPSS GNRHAESHPK ENGREPRWLA H
|
| |