Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_0923 |
Symbol | |
ID | 3906087 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | + |
Start bp | 1065971 |
End bp | 1069678 |
Gene Length | 3708 bp |
Protein Length | 1235 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 637878257 |
Product | ATP-dependent transcription regulator LuxR |
Protein accession | YP_480036 |
Protein GI | 86739636 |
COG category | [K] Transcription |
COG ID | [COG2909] ATP-dependent transcriptional regulator |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.280541 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTCGACG AACAGTCACC GATGTCCGGT TCCGTCCGGT CCCGTCCCGC CGCATTTGAG GGTCGACGGC CCCTCCGCCC GGAGGATGCC GCTCCCCGTC CCGGGGAGGA GGATCCCGCG GACGGTCCCG AAAGCCTGCC GCTGCTCCTG AACCGGTTCG TCCCCCCGCC GGCCCCGCCG GGTCATGTCG ATCGACCACG GCTGCTCGAC GCACTCGACG CCGCCACCGT CCTGCCCCTC ACGGTGGTGC GGGCTCCGGC CGGCTCGGGC AAGACGGTGC TGGTCGCGGC CTGGGCCGCG GCCCGGCGGG GGGAGACGCC GATCGTGTGG GTGCGGGTCG ATCCGGCGGT CGTCCCCGGC CCGCGTGGCG GGGGGCCGGG CCTCCCACTG TGGCCGCAGC TGCTGCACGG GCTGGGACGA CACGGTCTGC TCCCGGCCGG TGAGGGCTTT GGGAGCCGGT ACGGTCCCGC GGCCGACGAT CTTTATGATC GGCGTTACCG ACGGCTCGCC AACATCCTCG CCGACCTCCC GACGTCCTTC ATACTGATCC TCGACGACGT GCATCTGATC ACCCGGCGTT CGGACCTCGA CGGCCTCGAA CTGCTGATCG ACGGGGCCGC GGATCGGGCC CGGATCATTC TGATGGGCCG GATGGTGCCC GTGGCGCTGC ACCGGCTGCG GGCCGCCGGG TGGATCACCG AGATCGGTCC GGCAGATCTC GCCTTCACCC GGGCGGAGGC CGCCGCGCTC CTGGGCCGGC AGGAGATCAC GGGCCGGCAG GAGATCACGG CGTCGCCCGC GACGGTGAGC GGTCTCGTGC ACGTCACCGA GGGCTGGGCG GCCGGGCTGC GGCTGGCGGC CGGGGCGCTC GCCGCACGGG AACGACGACC ACTGCCGGGG GCGGATGGGC GGTGGGAAGC GGCCGTCGCA TCCCCGTCCG GTGGCCTCGG TGACACGGCT GGTTTCTCCG GCGCCGTGGC CGGCCATCCC GCGATCGCCG ATTTCCTGCG GGCCGAGGTG CTAGCCGGTA AGCCGTCGCC GGTACGCCGC TTCCTCCTGC GGACCAGTGT CCTGGAGCGC ATGACCGGGC CGCTGGCGGA CGCCGTCGCG GCCGAGCGGA TGGACGCGGG CGTCCTGGCG GCGGCTACCG GCTCGCGTGC CGCGCCGGCA CCCACGGTGG GCGCACGGGT GAGCGGGAAG GAAGTTCTGG CGGCACTCGC GCATGCCGAC GGCTTCGTGG TCGAGCTCGA CCACGACGGC GGATGGTTCC GCTTTCACCG GCTGCTTGCC GCGGTGCTGC GTTCCGCGTT GGCCGCGGAC GGCGACGAGG ACACCGCTGA GCTCCACTTG CGAGCCGCGA TCTGGTTCGC CGCGCACCGT CGGACGACGG ACGCGGTACA TCACGCCGCG CATGCCGGGG ATTGGTGGTA CGCGGCCTGC CTGCTCGTCG ATGGGACCGA GATGGTGGAC GCCCTGCATG GCGGCGTCGC GGGGTTCAGC CCGATCATGA CGGGGATGCC GGACTCCCCG GCCTCGTGGT CGCCGGAGTG CGCACTGGTC GCCGCCGTGG GCCGGCTGGG CCGGGGACAG GTGGGGGCGG CCACCAGATA CCTGCAGGTG GCCCGGGGGA GCATCGCCGC GGCGGTCGTG TCCCGTCGTC GCGCGCTCGC GGTCCGGGCT GATCTCGTCG ACCTGCGCCG GGCGGAGCTC GCCGGGGCGT GCGAGGAGAT GCTGAGCATC ACCCGCCGGT TGCTACGGTC CCCGGGCAGG CCGGGCGCCG TGCCGCAGCC GGTTGGTCCG GTGGTTGCCG GTGTGACGGG GCGGGCGGAG GCTGCGGGGA AGGGCCGGCC ACGACCGCCC GCCGACATGC GGCCGCCGGG AGATCCCCGG TTCCTGCCCG CCGGCCCCGT CGGGGGCGCC GAAGGTCCCG GACGGGAGCC GGGCGGTGTT CGACGTGATC ACGTCGGGGG AGACGTGGCC CGCCCCGGCT CCGTCGTCGG TGGGGCCCCG CCGGACCGCC CGGCCGGGAT CGTCGCGCGA TCGACCGGTG GTGGCGTGGA TGGGGGCGTG GATGGTTCCG ACAGGATCGC GGCAGCCGTG GCCTGGTGTG CGCGCGGGCG GGCGGAGCTC TGGCTCGGCC GTCCCGACCT GGCGCTGGAG GCGTTGCGGG AGGCGATGAC GGCGGCCCGG GATACCGGTC TGCGGACGGT CGAGGCATCG GCGTCCGGGG CGATGGCTCT CGCCTACGTG CTACGGGGTC GGTTGCGGCA GGCCGAGTCC AGCGCGGCAG CGGCGGTCGC CACCGCCGCT GTGATCACTG CCAGCCCGGT CGATCCCACG GCCGTCCCCG CATCCTCGGA TGGTCGGACC GACATGCCGG CAGGCCTGGT CGAGGCCCAT CTCGCCCATG CGATGGTGGC GGCAGCCCGG GTCGACGACG TGCGTGTCAC GCACCATCTC GACCTCGCGC GGATGGCCTT CACCCCGGCG CATCCTCCCT ACCTGCTCGA TCTGATCGTC GTGCTGGAGG CGAGGGCGCG ATGCCGGCGT GGGGAGGCGG ACGATGTGCG GGCTGCCCGC CGTCTCCTGG CGTCGCGAGG ACGGCCGGCC GGACCGCCGC TGTGTCTGTC GCTGTGGCGC GCCGCCGAGA CCGACCTGCT GATCGCCGTC GGGAACGCCC AGGCTGCCCG GCAGCTGCTG GGCCAGCTGG CCGACACCCG CCGGCCGGAC CATGCTCTCG CGCTCGCGGC GGCCCGGGCG AACCTCGCGT GCGCTGATCT AGAGGGAGCG GAGCACGCGG TGGCGGCTCT GCTGCGGGCG GACGGTGGCG GAGGCGGTTC GGTGGTGTCG GCATGTGTGG TGGCAGCGGT GGCGGCCGCG CGGCGTGATG ACCATGCGCG GGCCACCGAC CTGCTGGCGC GTGCCCTCGC GCTCGCCGAG GACGAGGGAC TCGCTGGCCC GTTCCTCGAA TTCGGCGGCG AACCGTTGGC GATCTTGGAT GCCCATCCCG GCCTCTCCGC CGCACATCCG TTCTTCGTGT CGACTTTGCG GGCCATGGCC ACGACGGAGT CCGCCGCCGC GGTGAACCCG CGGACGCCGA CCGACGGCGG ACCGGTCATG GCCGTATCCG GGGCGGGGGT GCCGCCTCTG GGGGTGCCGC CTCTGGGGGC GGCCGGCCTG GAGGTGCCGA ACGGGGGGAC GCCGAGCGGG GGGATGCCGA GCGGAGAGGT GGCGGTGCCC GCGCCCGCGT GGCCGGCCAC CCCACGTCGA TCCGTCATGA GCGGGCGAAT GGCACGTCCG GCCCTCATGA CCCCCGCGGC GCGCTCGGCG CCGCCTGACC GTGACAGGCA GGGGGCCGAC GTCCCGCGCG GCCCCTCCGG CCCCTCCGGC CCCTCCGGCC CCTCCGGTCC GGCGGGCTAC GGCCTTGGGA CGGGAACAAG TCCGATCTCC CTAGGGCAGC CGGGGGCGCG TGAGCGCTCA CCCGGTTCCA GGGACGGGCG GGGACCCGGT GCGGGTGGCC GGCTCAGCGA CCGGGAACTC GCGGTCCTCA GCTATCTGCC GACGATGTTG ACGACCACCG AGATCGCCGC CGAGCTCTTC GTCTCCGTGA ACACGGTGAA GACGCACCTG AAGAGCATCT ACCGCAAGCT CGACGTGCCA CGGCGTCGCG ACGCTGTACA TCGCGCGCGC GAATTGCGTC TATTGTAA
|
Protein sequence | MLDEQSPMSG SVRSRPAAFE GRRPLRPEDA APRPGEEDPA DGPESLPLLL NRFVPPPAPP GHVDRPRLLD ALDAATVLPL TVVRAPAGSG KTVLVAAWAA ARRGETPIVW VRVDPAVVPG PRGGGPGLPL WPQLLHGLGR HGLLPAGEGF GSRYGPAADD LYDRRYRRLA NILADLPTSF ILILDDVHLI TRRSDLDGLE LLIDGAADRA RIILMGRMVP VALHRLRAAG WITEIGPADL AFTRAEAAAL LGRQEITGRQ EITASPATVS GLVHVTEGWA AGLRLAAGAL AARERRPLPG ADGRWEAAVA SPSGGLGDTA GFSGAVAGHP AIADFLRAEV LAGKPSPVRR FLLRTSVLER MTGPLADAVA AERMDAGVLA AATGSRAAPA PTVGARVSGK EVLAALAHAD GFVVELDHDG GWFRFHRLLA AVLRSALAAD GDEDTAELHL RAAIWFAAHR RTTDAVHHAA HAGDWWYAAC LLVDGTEMVD ALHGGVAGFS PIMTGMPDSP ASWSPECALV AAVGRLGRGQ VGAATRYLQV ARGSIAAAVV SRRRALAVRA DLVDLRRAEL AGACEEMLSI TRRLLRSPGR PGAVPQPVGP VVAGVTGRAE AAGKGRPRPP ADMRPPGDPR FLPAGPVGGA EGPGREPGGV RRDHVGGDVA RPGSVVGGAP PDRPAGIVAR STGGGVDGGV DGSDRIAAAV AWCARGRAEL WLGRPDLALE ALREAMTAAR DTGLRTVEAS ASGAMALAYV LRGRLRQAES SAAAAVATAA VITASPVDPT AVPASSDGRT DMPAGLVEAH LAHAMVAAAR VDDVRVTHHL DLARMAFTPA HPPYLLDLIV VLEARARCRR GEADDVRAAR RLLASRGRPA GPPLCLSLWR AAETDLLIAV GNAQAARQLL GQLADTRRPD HALALAAARA NLACADLEGA EHAVAALLRA DGGGGGSVVS ACVVAAVAAA RRDDHARATD LLARALALAE DEGLAGPFLE FGGEPLAILD AHPGLSAAHP FFVSTLRAMA TTESAAAVNP RTPTDGGPVM AVSGAGVPPL GVPPLGAAGL EVPNGGTPSG GMPSGEVAVP APAWPATPRR SVMSGRMARP ALMTPAARSA PPDRDRQGAD VPRGPSGPSG PSGPSGPAGY GLGTGTSPIS LGQPGARERS PGSRDGRGPG AGGRLSDREL AVLSYLPTML TTTEIAAELF VSVNTVKTHL KSIYRKLDVP RRRDAVHRAR ELRLL
|
| |