Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3490 |
Symbol | |
ID | 5671861 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 4149716 |
End bp | 4152568 |
Gene Length | 2853 bp |
Protein Length | 950 aa |
Translation table | 11 |
GC content | 77% |
IMG OID | 641242378 |
Product | LuxR family transcriptional regulator |
Protein accession | YP_001507798 |
Protein GI | 158315290 |
COG category | [R] General function prediction only |
COG ID | [COG3899] Predicted ATPase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.123756 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.368778 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCACAG CCGCGATGAC CACCGCCGAG CTCACTGCGA CGGGTGCGCG CGGCACCGTC CGAACAGCGG AGCACGGCCG GATCCTGAAC GTGCTCGAGG GCGCGGACGG CGGTGGCCCG GCGATCGTGG AGCTCACCGG CGATCCCGGC TACGGCAAGA CCCGGCTGCT GACGGCCGTG GCGGACGAGG CCCGCAAGCG TGGCGTCGAC GTGCTGCGGG GCAGGTCGCT CGAGACCCGC CGGCACCGGC CCTTCCACCC GTTCATCGAC GCCTTCAGCA CCTGGCGGCC CACCGGCGCC GGGCCGCTGC CGCAGGCCAC CGCGTTCGTC CAGTCCCTGG CGACCGCCGG TGACATGCTC GCCGACACCG AGACCGGCCG CTGCCGGCTC TTCGCCGAGC TGCGGTCGCT GCTGGAGGAG ACACTGGCGG CGGCGCCCGA CCCGCTGCTG CTGCTGCTCG ACGACGTGCA CTTCGCCGAC CGGGCGTCGG TCGAGCTGCT CGAGATCCTG ACGCGTTGGC CGCTGGGGCA GCCGCTGTCG GTCGTGGTGG CGCATCGCCG GCGTCAGATG TCCTCCTGGC TGCAGGTCAC CCTGCAGCAG GGCGTCGAGG TGGGCGCGGT GGACCAGGTG GCGTTGCCGG CGCTCAGCCT GCGCCAGAGC GCCGAGCTTC TCGGGCTCGC GGTCACCGCG CCGGGGCTGG CCGACCTGCA CGAGCGCAGT GCCGGCAATC CGCTCTACCT GACGGCGTTG GCCGAGCGGG AGCGCGGCCT GAACGGAGTG GACCGGACCG GCCCGGACCC GACCGGCGCG GGTGGCCACG ACCCGGCCGG CGAGCCGCCC GACCTGTGGA CGCGCAGCGC GCTCGGCGCC CGGCTGCTGG CCGAGACCGT GCATCTCGAT GCCCGCCAGC GGCTGGTCGC GCACGCGGCG GCCGTGCTCG GCGACACCTT CTCGGTCGAC GCCGTGGCGG CCGTCGCGCA GCTCGACCGG GAGGAGGCCT GCCAGGCGCT CGGCCAGCTG CGCGGCCGCG ACCTGGTGCG CATGGTGCCG GGCGGCGAGC TGGTCTTCCG CCACCCGCTG CTGGGCAGCT GTGTCTACGG GGAGACCGAC TCCTGCTGGC GGGCCGGGGC GCACCGACGC GCCCTCGAGC ACCTCCGGTC GACGTTCGCC GCGCCGGCCG TGCTCGCCCG GCATGTCGAG CGCTCCGGAT CGCACAGCCA TGCCCTGGAC CTGACGGTGC TGCTCGACGC GACCCGGGCG GCCGTGCAGA GCGAGCAGGG CACCGAGGCC GCCCACTGGC TCAGCGCCGC CCTGCGGCTG CGCCGCGCCG CCCCGGAAAC CCCGAACGGC ACCGGCAGCA CCGGTGAGAC GGCCGGTGCG GCCGGTGCGG GGCTCCCGCC GGTCGGCCCG GAGGTCTGGC GCCCGGTGAT CGACCTGCTC GCCCGCCCCG ACGCCGTCGC GCGCCTGCGC GCCCTCGGCC AGGAGGTCCT GAACACCCCG GCCGCGTACG GGCCGGCCGG CCGGGTGGGC ACGGCGGCGC ACCTGGCGAT GGTCTCGGCC TCGCTGGGCT GCCACGACCA CGCCGTCGCC CTGCTGTCGA TCGCGCTCGC CGACGCCCGG GGGCCGGCCG AGGAGGGGCG CGTGCAGCTG TTCGCCCAGC TCGCCCGCAT CGTCTCCGGA GGCATACCGA GCCGCGCCGA GGTGGACGCG CTGACCGGGC CGGAGCCGGC CGGCGACCCG CTGACCCGGG CCGGCGGGCT CGCGGTCCGG GCGCTGTGCG CCGCGCTCAC CACCGACCGC GGCGCCACCG GGAACAGGTC CGCGGGCGCG GAGGCGGACG CGGCGGCCGC CGCGCTCGAC GCCCTGGCGG TGAACGGCCC GGACCCCTCG GGCCCGGAGC TGTACTTCCT GCAGTGCCTG GGCTGGACGG AGGCGATGCT CGGTCGCTAC GACAGCGCGC ACGCCAGGAT CACCCGCGCG GTGCGCGGCG CCCGCAAGCA CGGCGAGCGC CACCTGCTGC CCGCCCTGCT CAACAGCCTC GCCTACATCC ACTACCAGTC GGGGCGCCGG AGCGAGGCGA TCGAGGTCAC CCGGGAGGCC CAGCGCGTGT CGCAGCGGGC CGGTCGCGCC GACCAGGTGG CACTGGCCCA GGCCGTGGCC ACGGCCGCCT GGGCCGGGCT GGGGCGCTCC CCCACGTTCG GCCCGGAGCT GCCGGAGGGC GTGCACGCCG ACGCGCCCCG CACGCCGCTG ACCGCGCTGC TGTTCGCCGA GGCCGCGCTG GCCGCCGGCG ACGGACCGGC CGCCCTGGCC CTGCTGGGGT CGAAGCGGGA GACCTGGCGG GTCGCCGAGC CGATCCCGGT GCTGGGTGCC CGCATCTTCG AGGTGCTGGC CGCCGCCGCG CTGCTGGCCG GGGAGGATCC ACGCCCGTGG GCGGCCCGAG CGGCCGAGGA GGCCGCCCGG GTCGGGCTGC CCGAGCAGCG GGGCCACGCG CTGCTCGCCC GCGGCTACGC GCTGGGTCGC GGCCCGGAGG CCGACCGGTG CTACGCCGAG GCGGCCGGCC TGCTGGCGGG CGCGCCCGCC GGCGGCCGGG CCCGGACGTT CGCGCGGTCG GCTCAGCAAC GGCGCCACCG CCCCGGCCCC GACCCGCTGG TCGAGCTCAC CACCAGGGAG CAGGAGGTCG CCCAGCTCGC CGGCCAGGGG CTGCGGACCC GGGACATCGC CAGCCGGCTG CGGGTGAGCC CGCGCACCGT CGACACCCAC CTCTCCCACA TCTACGACAA GCTCGGCATG AGCTCGCGGG TCGAGCTGGC CCGGTTGCTG TCCCCGGTCC CGCTGGTGCC CCCGGGCCAC TGA
|
Protein sequence | MTTAAMTTAE LTATGARGTV RTAEHGRILN VLEGADGGGP AIVELTGDPG YGKTRLLTAV ADEARKRGVD VLRGRSLETR RHRPFHPFID AFSTWRPTGA GPLPQATAFV QSLATAGDML ADTETGRCRL FAELRSLLEE TLAAAPDPLL LLLDDVHFAD RASVELLEIL TRWPLGQPLS VVVAHRRRQM SSWLQVTLQQ GVEVGAVDQV ALPALSLRQS AELLGLAVTA PGLADLHERS AGNPLYLTAL AERERGLNGV DRTGPDPTGA GGHDPAGEPP DLWTRSALGA RLLAETVHLD ARQRLVAHAA AVLGDTFSVD AVAAVAQLDR EEACQALGQL RGRDLVRMVP GGELVFRHPL LGSCVYGETD SCWRAGAHRR ALEHLRSTFA APAVLARHVE RSGSHSHALD LTVLLDATRA AVQSEQGTEA AHWLSAALRL RRAAPETPNG TGSTGETAGA AGAGLPPVGP EVWRPVIDLL ARPDAVARLR ALGQEVLNTP AAYGPAGRVG TAAHLAMVSA SLGCHDHAVA LLSIALADAR GPAEEGRVQL FAQLARIVSG GIPSRAEVDA LTGPEPAGDP LTRAGGLAVR ALCAALTTDR GATGNRSAGA EADAAAAALD ALAVNGPDPS GPELYFLQCL GWTEAMLGRY DSAHARITRA VRGARKHGER HLLPALLNSL AYIHYQSGRR SEAIEVTREA QRVSQRAGRA DQVALAQAVA TAAWAGLGRS PTFGPELPEG VHADAPRTPL TALLFAEAAL AAGDGPAALA LLGSKRETWR VAEPIPVLGA RIFEVLAAAA LLAGEDPRPW AARAAEEAAR VGLPEQRGHA LLARGYALGR GPEADRCYAE AAGLLAGAPA GGRARTFARS AQQRRHRPGP DPLVELTTRE QEVAQLAGQG LRTRDIASRL RVSPRTVDTH LSHIYDKLGM SSRVELARLL SPVPLVPPGH
|
| |