Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4833 |
Symbol | |
ID | 5673174 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 5772147 |
End bp | 5775131 |
Gene Length | 2985 bp |
Protein Length | 994 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 641243689 |
Product | LuxR family transcriptional regulator |
Protein accession | YP_001509105 |
Protein GI | 158316597 |
COG category | [R] General function prediction only |
COG ID | [COG3899] Predicted ATPase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.010143 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGCACTGG TTGAACGTGA CGACATTCTC CGTCAGCTCG ACCTTCTCCT GGCGGAATGT ACGGCGGGAA GCGGCCGCGT CGTATTGATC GACGGCCCGG TCGGCACCGG TAAGACGGAG CTCGTCCGCT ACTGCGGCAC CCGGGCCGCC GGCAGCGGCG TGACAGTCCG CGCCGCGACC TGCGCCCGTG CCGAACACGT CCTCCCGCTC GGAGTGGTCG GCCAGCTCCT GCGCGGCCTG CCGGCGGGCG ACGGCGAGAC CGGCGCCGGC ACCGTCTGCA CCGGTCCCGA CGGTGCGGCC GACCCGGACA CCGCCCGTGG CACGGACACA CTGTGCGAGG AGGCGACCCA GGACGGGTGC CGAGGCCGCG AGGCGCGGGC GGACGGGGCG GGCCGGGCGG CCCGCATCGC GGAGCTCCTC GACGTCGGTG CCGCGATCGC CGGCAGGCTG GGGCCGGATC CGGATCGCGA ACTGGCCCAG ATCCACCAGG AACTCACGCT CCTCATCCTC GACCTCAGCA AGCGCGGTCC CGTCCTGCTG TGCATCGACG ACGTGCAGTA CGCGGACGTC CCCTCCCTGC ACTTCCTGCT GCACCTCGTG CGCCGGCTCG GATCGGCGCG CATCGCCGTG CTGCTCGCCG GTGATCTGGC TTCGCACCCG CTGGACCTGC CGTTCCGCGC GGAGCTCGTC CGGGCGCCCG AGTTCCGGTC ACTGCGGGTC GGCCCGCTCT CGCCCGCCGG CTCCCGCGAC CTCCTCGCCG CGGCCACGCC GGTCACTCCG AACCCTTCTG CCACAGACCC GTACCAGGTC ACGGGGGGCA ATCCACTGCT GCTGACCGCG CTGGTGCAGG ACGACCGCGG GTTCGGCCAG CCCGGTCCGG AGGTCTTCGG TCTGGCCCTG CTGAGCTGTC TGCACCGCGG TGAGCCCGTC GCGGTCCAGG TGGCCCGGGC CCTGGCGGTG CTGGAGACGC CGGTGCCCGA CGGGGTCGTC GCGCCGGACA CCGCGACGCT GGCGGGGATG ATCGGCGCGG ACGTCCCCGC CGCGGCACGC GCCCTCGACG CCATGAATGC CGCCGGGCTC CTCGATGACG GCCGCTTCCG CCACAAGGTC GCCCGGGACG CGGTGCTGGG CGATCTGACG GCGTCCGAGC GGACTGATCT GCACCGGCGG GCGGCGCGGC TTCGCCACCA GCAGGGCGCG CCGGCGGCGA CCGTGGCGGC ACACCTCGTC GAGGGCAACG ACGTGCAGCC CCCGTGGGGA ACCGGCGTAC TCGTCGAGGC GGCCGAGCAG GCGCTGCTCG ACGGGCGGCC CGAGCGGGCC GCCGCCTGCC TGAGGCTCGC GGCCCGGTCC GCCGCCGGTG AGCGGGAACG CGCCGCGATC CGGGCCCGGC TCGCGCATGC CGAGTGGCAG ACGAGCCCGG CGGCGGCCGC GCGCCATCTC TCCCCGCTGG TGAGCGCTGC CCACGCCGGC CTGCTGGAAC AGCGGACCAA CGCGGCCCTG GTACGTCAGC TGCTGTGGCA CGGACGCTCG GCCGAGGCCG AGGATCTGCT CGCCCGGATG CGTGCCGTGG CCCAGGCGGA GCCCGAGGAC GACGCGGCCG AGGTGCACGA CCTGGAGGTC TGGCTCGCCA CCGTCCATCC GCCGCTGGCT CGCCGCCGGC GTGGGCCGGC CGCGGGCAGC GCGGTGCCGG CGGCACCGGC GGCGGACTCG TGGCTGCGGT CAGCGGCGCT GCTCGCCGAC GCGGTGGCCG CGGGCGGTCA CCGGACGGGG CGCGAGCCGG GGCCCTCCGG CATCACCGGC GCTGGTGTGG CCGGCACCGG CCCGGTGGGG TCCGGTCTCG TCGGGGCGGA CCGGGCCGAG AGTGCCCTGC GCGATCTGCA CCTGGCGCGA GCCGATCCGT GGGCCGGTGA GGTGGCGCTG CTCGCCCTGC TGTTGCTGAC CAGGGCTCCG CGCATGGACG CCGCGGTCGC CTGGTGCGAG CGCGTCCTGG CCGATCCCGA CCTGGAGGAC CAGACCGCGC GGGCGATGGC GACGGCGGTG CGTGGCACCC TCGCGCTCGA GCAGGGCGAT CTGGCGGTCG CCGCCGACCA CGCCCGTGCC GCGCTGCGTC GCCTGCCGGC CAAGGCGTGG GGAGTCGCGA TCGGCCTGCC GCTGGGCACC CTGGTACTGG CCGCCACCCG GACCGGGGAC CTGGAGGAGG CGGCGAGGCA GCTCGTCCAG ACCGTGCCCG AGGAGATGGT CGCCAGTCGC TACGGACTGG ACTACCTGCA CGCCCGTGGC CACTACCATC TCGCCGCGAA CCACGCCCAC GCCGCGCTCG CCGACTTCCT GGCCTGCGGT GACCTCATCC GCGGCTGGGG CCTCGACGCG GCGGCTCTCG TCCCGTGGCG CATCGGCGCC GCCGAGGCGT GGCTGCGGCT GGGAAACGTG GACCAGGCCC GCCAACTGGC CCAGGAACAA CTGAGCCGGC CCGGTGCCAC TCGGGTCCGC GGTCTGTCCC TGCGCCTGCT CGCCGCCGCC AGCCCGCCCG GGCGCCGGCT TCAGCCGCTC ACCGAGGCGC TCGAGCTCCT CGAGGCCACC GGGGACCGGC TCGGGCAGGC CTACGTCCTG GCGGACCTGA GCCGCGTCCA CGATCATCTC GACCAGCGGC GGCGGGCGCG GCTGCTGCTG CGGCGGGCGC TGCACATCGC CACGATGTGC GGTGCGCGAC CGCTCGCCCA GGAGCTCCTC GCGATCTCCG GCGACGGCAG AGCTGTCTTC GGGCTCGGTG CCGACCAGGA GATGATCACC GGCCTGACGG ACTCGGAGCG GCGGGTGGCG TCACTGGCGG TGATGGGCTA CACGAACCGG GAGATCGCGC TGCGGCTCTA CGTCACGCCG AGCACGGTCG AGCAGCACCT GACCCGGGTC TACCGAAAGC TCAACGTCAA ACGCCGTCAG GACCTCCCCG CCGACCTGTG GACGGACGCC ACCCACACCG GTTGA
|
Protein sequence | MALVERDDIL RQLDLLLAEC TAGSGRVVLI DGPVGTGKTE LVRYCGTRAA GSGVTVRAAT CARAEHVLPL GVVGQLLRGL PAGDGETGAG TVCTGPDGAA DPDTARGTDT LCEEATQDGC RGREARADGA GRAARIAELL DVGAAIAGRL GPDPDRELAQ IHQELTLLIL DLSKRGPVLL CIDDVQYADV PSLHFLLHLV RRLGSARIAV LLAGDLASHP LDLPFRAELV RAPEFRSLRV GPLSPAGSRD LLAAATPVTP NPSATDPYQV TGGNPLLLTA LVQDDRGFGQ PGPEVFGLAL LSCLHRGEPV AVQVARALAV LETPVPDGVV APDTATLAGM IGADVPAAAR ALDAMNAAGL LDDGRFRHKV ARDAVLGDLT ASERTDLHRR AARLRHQQGA PAATVAAHLV EGNDVQPPWG TGVLVEAAEQ ALLDGRPERA AACLRLAARS AAGERERAAI RARLAHAEWQ TSPAAAARHL SPLVSAAHAG LLEQRTNAAL VRQLLWHGRS AEAEDLLARM RAVAQAEPED DAAEVHDLEV WLATVHPPLA RRRRGPAAGS AVPAAPAADS WLRSAALLAD AVAAGGHRTG REPGPSGITG AGVAGTGPVG SGLVGADRAE SALRDLHLAR ADPWAGEVAL LALLLLTRAP RMDAAVAWCE RVLADPDLED QTARAMATAV RGTLALEQGD LAVAADHARA ALRRLPAKAW GVAIGLPLGT LVLAATRTGD LEEAARQLVQ TVPEEMVASR YGLDYLHARG HYHLAANHAH AALADFLACG DLIRGWGLDA AALVPWRIGA AEAWLRLGNV DQARQLAQEQ LSRPGATRVR GLSLRLLAAA SPPGRRLQPL TEALELLEAT GDRLGQAYVL ADLSRVHDHL DQRRRARLLL RRALHIATMC GARPLAQELL AISGDGRAVF GLGADQEMIT GLTDSERRVA SLAVMGYTNR EIALRLYVTP STVEQHLTRV YRKLNVKRRQ DLPADLWTDA THTG
|
| |