Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_3359 |
Symbol | |
ID | 3905941 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | + |
Start bp | 3984685 |
End bp | 3988023 |
Gene Length | 3339 bp |
Protein Length | 1112 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 637880682 |
Product | EmrB/QacA family drug resistance transporter |
Protein accession | YP_482443 |
Protein GI | 86742043 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | [TIGR00711] drug resistance transporter, EmrB/QacA subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.0181255 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGCATCT TCCAGCCGAA GGGCGACTCG TTCGGGAGCC TGAACGAGAC CGTCCAGTCG CGGCGCCTGA TCGTGAGCCG CTACCGCAGC CTGCGTGAAC GGCAGTGGGC CCTGCTGGTG CTCTGCACCG GCGCCCTCGC CATCGTGATC GACACCTCCA TCGTGAACGT GGCCCTGCCC TCCATCGGCC AGGACCTGGG GTTCTCCGCC GCCGGGCTCT CGTGGGTGGT GAACGCCTAT CTGATCCCGT TCGGCGGACT GCTCCTGTTC GCCGGACGGC TGGGCGACCT GGTCGGCGCC AAGCGGATGT TCCTCGCCGG AATCGCGCTG TTCACGCCGG CCTCCCTCGT CTGCGGCCTG GCCCAGAACC CGGAGACGCT GGTCGCCGCC CGTTTCGTTC AGGGGGTGGG GGGTGCGCTG GCGGCGGCTG TCGTGCTCGG TCTGGTCGTC TCCACCTTCC CCAACGCGAT CGAGCAGGCC AAGGCGATCG GCCTGTACGC GTTCGTCTCC TCCGCCGGTG CCGCCCTGGG GCTGCTGCTC GGGGCCTGGA TCACCGAGGC GCTGAGCTGG CACTGGACCT TCTTCATCAA CGTTCCGTTG GGCCTGCTGG TCCTGCCGCT CGGCGCGCGG CTGGTGGACG ACGAACGAGT GGTGCTCGAC GACCGCAGCA TCGACGTCCC GGGCGTCCTG CTGATCACGA GCGGACTGAT GCTCACGGTC TACACGATCG TGCGCTTCGC CGACGGCGCG ACACCGCTGA CCTGGGCGCT GGTGGGGATC AGCGCAGCCC TGCTGGCGGC CTTCGTCGTG CGGCAGCGGA CGGCCCGCAA TCCGCTCGTC CCGTTGTCGA TCTTCCGGGC GCGGAACGTC GCCTGGGCCA ACATGGTGCA GGCGCTGATG ATCGCCGGCA TGGGCGGCAT GTTCTTCCTC GGGACCCTAT ACCTGCGCCA GGTGCTGGGG CTCAGCGCCA TGGAGGTCGG GCTGGCCTTC CTGCCCACCG CCCTCATGGT GGCCGTGATC TCCCTCAAGG CGACGCCGCG ACTGATGGAG AAGATCGACC CCAAGACCGT GCTGGTGCCC GGCCTCCTGC TCATCACCGC GGGCCTCGCC CTGCTGGCCG TTGCTCCGGC GGACGGTGAC TACTGGCGCG ACGTCCTGCC CGCCATGGTG CTTCTGGGCG CCGGCGCCGG CCTCGGGTAC CCGTCCTCCC TGACCATGGT GATGGCCGAT GCCACCGCCG CCGACCGGGG ACTGCGCTCC GGGCTTGTCA ACACCACCCA GCAGGTCGGA CCGGCGATCG GCCTGGCCGT CCTGGCGACC GTGGCCGCCG ACCGCACCAC GACGCTCGTC TCGCGGGGGA GCACGCACGC CGAGGCCCTG ACCGGCGGAT ATCACCTGGC CTTCTGGATC GGGAGCGGGG CCAGCCTGGT CGCGCTGGTG CTGACGGTGC TGTTCCTCAG GCCGGCGGTG CCGACGAGCG TACCGAAGAC CCTCGACCGG GCCCGGGCCT CGCAGCGTCT GACCCAGGAC CACACCGGCC TGTCCGACCC GGATTTCCTG GCTCTGGGCC TGGGCGGAAC CAACATGATG GCCATGCTCT GGTCGGTGGC CATGGGCAGG CGTGCAGTGG GAGTGGAACT GCGCGGCTCG CCCTACGTGT CGGTGATGCG CTGGACGATG CGCGAGGAGA TCTACCACCA CCTCGCGATG ATCGATTCGA TGATGCTGGA GCGGTACGGC GAGGATCTGC TGCCGCGCCT CGGCGACGGC CGGCCGTTCA GACTGGCCGA CTGCTTCTTC TCCCGGCACA CCAGGGCAGG ATCCGTGCGG GGCGACGAGG TGGTGACCGG CTTCGAGTCG GACGCGCACA TCGGCGGTCT CGCCCGGCAC TACGAGACCA TCGACGACCG CTACCGGGAG GGCCGGGCGC ATCGTGAGAT CAACGTCCTC GACCCCCTGG AGGGCCCGGA GGGGCTCGAC CCCGCGGCGT TGGAGCACGA CCTCTCGGAC GTGCTGGGCA GCCAGTCGAT GTTCCAGGTC GGGGCCGAGG AACTGCTCGT CATGCTCCGC CGGTACCTGG AGGGGCTGGA ACAGATCGAC CGGGACCGGG GCGAAGCCCT GCCGCGGGTG CGCGTCCTGC CCTACCACCG GGTCGCCACG TCGGAGTCCT CCGACCGCGA GCGCCGGTGG CTGCCCTCGG CGCTGCGCCG GACCCCGGTC GCGGAACGGG GCTTCGAGCG CACGGCGGAC GGCCGGGTGC GGGTCCGGGT CGAGGCCGTG CGCGAGCTGG ACTACCGGGG TGCCTTCCGC CGTGTGCGGG TGCCGTACAG CGAGACGGTC GACCTCGGCA CCCCCGGGCT GATCATGATT GCCGAGGGGC TGGACAGCCC GGACGCGGCC CGGCTGGGAT TCCGTCAGGA GACGGTGTGC GTGGACCGGG GCGACGGGCG GGGCCCTGTG CCGGCCGAGG CCGACTTCCT GGTCGGCAAC GTCGACATCT ACCTCGCCGA CCGGATCCGT GTCCGTCGCG CGTCGGAGTT CGACCAGGCG GGCAACGAGT ACTGGGTCCA CCAGCGGTGC GTGGGGCACG AGGAGGACGC GCAGATCGGC TGGACGATGC TGGAGGTCCC GCCGTACAAG ACGTTCGACC CGATCCAGGC CGGCCTGGTC CCGGCCGGGA CGCCGAAGGA CTCCAAGCCC TACTTCGCCG GGCATCAGTT CCTGCTCCGG CGCTACTTCC TGGACCAGAC GGCACAGCTG ACGGAGATCC CACGCCGGGC CATCGAGTAC AACCAGCTCT CCTACGGCCC CAAGCTCTTC ACCGTGACGG AGCGGATCGG GCTCGACGCC CTGGTGGCCG CGAACGCGGT CGTGGCCGGT GACTCCTTCG GCAACTCCCA CCACCTGACC AGCGGCGGCA TCAACACCGG CATGCTCGGT CACGGCCTGC GGGTGCTGCG TTACTGGCAG GCCAGGGAGT CCGGTAGCAG CGCCAGCGAC GCGGCGCGTG ACCTGGCCGA CGGGATCAGG GCGGACACCC GGGCGCTGAT CGACCTGTGC GAGCAGGACT TCCGAGCGGC GCCGCCCGCG GCGCCCCCGA GGAAGCAGCA GCGACTGCTG GCGGGAGCCA CCCGTCAGCG GCGTTCGATC TTCCCGCAGC GCTACCCGGA CGAGTGGAGC CGCATCCAGC TCCGCACCGG CAAGATCTTC GCCTACAACC TGCCCATGCT CGATCCCGAG CACCCCGACA CCCGCGAGGC CCAGCCGGTC GGCGCCATGT CGGGTGCCCC GCCGTCCGCC AGCGAGCGGA AGGACCCGCT CGGCGCGGCT CGGGCATGA
|
Protein sequence | MSIFQPKGDS FGSLNETVQS RRLIVSRYRS LRERQWALLV LCTGALAIVI DTSIVNVALP SIGQDLGFSA AGLSWVVNAY LIPFGGLLLF AGRLGDLVGA KRMFLAGIAL FTPASLVCGL AQNPETLVAA RFVQGVGGAL AAAVVLGLVV STFPNAIEQA KAIGLYAFVS SAGAALGLLL GAWITEALSW HWTFFINVPL GLLVLPLGAR LVDDERVVLD DRSIDVPGVL LITSGLMLTV YTIVRFADGA TPLTWALVGI SAALLAAFVV RQRTARNPLV PLSIFRARNV AWANMVQALM IAGMGGMFFL GTLYLRQVLG LSAMEVGLAF LPTALMVAVI SLKATPRLME KIDPKTVLVP GLLLITAGLA LLAVAPADGD YWRDVLPAMV LLGAGAGLGY PSSLTMVMAD ATAADRGLRS GLVNTTQQVG PAIGLAVLAT VAADRTTTLV SRGSTHAEAL TGGYHLAFWI GSGASLVALV LTVLFLRPAV PTSVPKTLDR ARASQRLTQD HTGLSDPDFL ALGLGGTNMM AMLWSVAMGR RAVGVELRGS PYVSVMRWTM REEIYHHLAM IDSMMLERYG EDLLPRLGDG RPFRLADCFF SRHTRAGSVR GDEVVTGFES DAHIGGLARH YETIDDRYRE GRAHREINVL DPLEGPEGLD PAALEHDLSD VLGSQSMFQV GAEELLVMLR RYLEGLEQID RDRGEALPRV RVLPYHRVAT SESSDRERRW LPSALRRTPV AERGFERTAD GRVRVRVEAV RELDYRGAFR RVRVPYSETV DLGTPGLIMI AEGLDSPDAA RLGFRQETVC VDRGDGRGPV PAEADFLVGN VDIYLADRIR VRRASEFDQA GNEYWVHQRC VGHEEDAQIG WTMLEVPPYK TFDPIQAGLV PAGTPKDSKP YFAGHQFLLR RYFLDQTAQL TEIPRRAIEY NQLSYGPKLF TVTERIGLDA LVAANAVVAG DSFGNSHHLT SGGINTGMLG HGLRVLRYWQ ARESGSSASD AARDLADGIR ADTRALIDLC EQDFRAAPPA APPRKQQRLL AGATRQRRSI FPQRYPDEWS RIQLRTGKIF AYNLPMLDPE HPDTREAQPV GAMSGAPPSA SERKDPLGAA RA
|
| |