Gene Francci3_3359 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3359 
Symbol 
ID3905941 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3984685 
End bp3988023 
Gene Length3339 bp 
Protein Length1112 aa 
Translation table11 
GC content71% 
IMG OID637880682 
ProductEmrB/QacA family drug resistance transporter 
Protein accessionYP_482443 
Protein GI86742043 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00711] drug resistance transporter, EmrB/QacA subfamily 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0181255 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCATCT TCCAGCCGAA GGGCGACTCG TTCGGGAGCC TGAACGAGAC CGTCCAGTCG 
CGGCGCCTGA TCGTGAGCCG CTACCGCAGC CTGCGTGAAC GGCAGTGGGC CCTGCTGGTG
CTCTGCACCG GCGCCCTCGC CATCGTGATC GACACCTCCA TCGTGAACGT GGCCCTGCCC
TCCATCGGCC AGGACCTGGG GTTCTCCGCC GCCGGGCTCT CGTGGGTGGT GAACGCCTAT
CTGATCCCGT TCGGCGGACT GCTCCTGTTC GCCGGACGGC TGGGCGACCT GGTCGGCGCC
AAGCGGATGT TCCTCGCCGG AATCGCGCTG TTCACGCCGG CCTCCCTCGT CTGCGGCCTG
GCCCAGAACC CGGAGACGCT GGTCGCCGCC CGTTTCGTTC AGGGGGTGGG GGGTGCGCTG
GCGGCGGCTG TCGTGCTCGG TCTGGTCGTC TCCACCTTCC CCAACGCGAT CGAGCAGGCC
AAGGCGATCG GCCTGTACGC GTTCGTCTCC TCCGCCGGTG CCGCCCTGGG GCTGCTGCTC
GGGGCCTGGA TCACCGAGGC GCTGAGCTGG CACTGGACCT TCTTCATCAA CGTTCCGTTG
GGCCTGCTGG TCCTGCCGCT CGGCGCGCGG CTGGTGGACG ACGAACGAGT GGTGCTCGAC
GACCGCAGCA TCGACGTCCC GGGCGTCCTG CTGATCACGA GCGGACTGAT GCTCACGGTC
TACACGATCG TGCGCTTCGC CGACGGCGCG ACACCGCTGA CCTGGGCGCT GGTGGGGATC
AGCGCAGCCC TGCTGGCGGC CTTCGTCGTG CGGCAGCGGA CGGCCCGCAA TCCGCTCGTC
CCGTTGTCGA TCTTCCGGGC GCGGAACGTC GCCTGGGCCA ACATGGTGCA GGCGCTGATG
ATCGCCGGCA TGGGCGGCAT GTTCTTCCTC GGGACCCTAT ACCTGCGCCA GGTGCTGGGG
CTCAGCGCCA TGGAGGTCGG GCTGGCCTTC CTGCCCACCG CCCTCATGGT GGCCGTGATC
TCCCTCAAGG CGACGCCGCG ACTGATGGAG AAGATCGACC CCAAGACCGT GCTGGTGCCC
GGCCTCCTGC TCATCACCGC GGGCCTCGCC CTGCTGGCCG TTGCTCCGGC GGACGGTGAC
TACTGGCGCG ACGTCCTGCC CGCCATGGTG CTTCTGGGCG CCGGCGCCGG CCTCGGGTAC
CCGTCCTCCC TGACCATGGT GATGGCCGAT GCCACCGCCG CCGACCGGGG ACTGCGCTCC
GGGCTTGTCA ACACCACCCA GCAGGTCGGA CCGGCGATCG GCCTGGCCGT CCTGGCGACC
GTGGCCGCCG ACCGCACCAC GACGCTCGTC TCGCGGGGGA GCACGCACGC CGAGGCCCTG
ACCGGCGGAT ATCACCTGGC CTTCTGGATC GGGAGCGGGG CCAGCCTGGT CGCGCTGGTG
CTGACGGTGC TGTTCCTCAG GCCGGCGGTG CCGACGAGCG TACCGAAGAC CCTCGACCGG
GCCCGGGCCT CGCAGCGTCT GACCCAGGAC CACACCGGCC TGTCCGACCC GGATTTCCTG
GCTCTGGGCC TGGGCGGAAC CAACATGATG GCCATGCTCT GGTCGGTGGC CATGGGCAGG
CGTGCAGTGG GAGTGGAACT GCGCGGCTCG CCCTACGTGT CGGTGATGCG CTGGACGATG
CGCGAGGAGA TCTACCACCA CCTCGCGATG ATCGATTCGA TGATGCTGGA GCGGTACGGC
GAGGATCTGC TGCCGCGCCT CGGCGACGGC CGGCCGTTCA GACTGGCCGA CTGCTTCTTC
TCCCGGCACA CCAGGGCAGG ATCCGTGCGG GGCGACGAGG TGGTGACCGG CTTCGAGTCG
GACGCGCACA TCGGCGGTCT CGCCCGGCAC TACGAGACCA TCGACGACCG CTACCGGGAG
GGCCGGGCGC ATCGTGAGAT CAACGTCCTC GACCCCCTGG AGGGCCCGGA GGGGCTCGAC
CCCGCGGCGT TGGAGCACGA CCTCTCGGAC GTGCTGGGCA GCCAGTCGAT GTTCCAGGTC
GGGGCCGAGG AACTGCTCGT CATGCTCCGC CGGTACCTGG AGGGGCTGGA ACAGATCGAC
CGGGACCGGG GCGAAGCCCT GCCGCGGGTG CGCGTCCTGC CCTACCACCG GGTCGCCACG
TCGGAGTCCT CCGACCGCGA GCGCCGGTGG CTGCCCTCGG CGCTGCGCCG GACCCCGGTC
GCGGAACGGG GCTTCGAGCG CACGGCGGAC GGCCGGGTGC GGGTCCGGGT CGAGGCCGTG
CGCGAGCTGG ACTACCGGGG TGCCTTCCGC CGTGTGCGGG TGCCGTACAG CGAGACGGTC
GACCTCGGCA CCCCCGGGCT GATCATGATT GCCGAGGGGC TGGACAGCCC GGACGCGGCC
CGGCTGGGAT TCCGTCAGGA GACGGTGTGC GTGGACCGGG GCGACGGGCG GGGCCCTGTG
CCGGCCGAGG CCGACTTCCT GGTCGGCAAC GTCGACATCT ACCTCGCCGA CCGGATCCGT
GTCCGTCGCG CGTCGGAGTT CGACCAGGCG GGCAACGAGT ACTGGGTCCA CCAGCGGTGC
GTGGGGCACG AGGAGGACGC GCAGATCGGC TGGACGATGC TGGAGGTCCC GCCGTACAAG
ACGTTCGACC CGATCCAGGC CGGCCTGGTC CCGGCCGGGA CGCCGAAGGA CTCCAAGCCC
TACTTCGCCG GGCATCAGTT CCTGCTCCGG CGCTACTTCC TGGACCAGAC GGCACAGCTG
ACGGAGATCC CACGCCGGGC CATCGAGTAC AACCAGCTCT CCTACGGCCC CAAGCTCTTC
ACCGTGACGG AGCGGATCGG GCTCGACGCC CTGGTGGCCG CGAACGCGGT CGTGGCCGGT
GACTCCTTCG GCAACTCCCA CCACCTGACC AGCGGCGGCA TCAACACCGG CATGCTCGGT
CACGGCCTGC GGGTGCTGCG TTACTGGCAG GCCAGGGAGT CCGGTAGCAG CGCCAGCGAC
GCGGCGCGTG ACCTGGCCGA CGGGATCAGG GCGGACACCC GGGCGCTGAT CGACCTGTGC
GAGCAGGACT TCCGAGCGGC GCCGCCCGCG GCGCCCCCGA GGAAGCAGCA GCGACTGCTG
GCGGGAGCCA CCCGTCAGCG GCGTTCGATC TTCCCGCAGC GCTACCCGGA CGAGTGGAGC
CGCATCCAGC TCCGCACCGG CAAGATCTTC GCCTACAACC TGCCCATGCT CGATCCCGAG
CACCCCGACA CCCGCGAGGC CCAGCCGGTC GGCGCCATGT CGGGTGCCCC GCCGTCCGCC
AGCGAGCGGA AGGACCCGCT CGGCGCGGCT CGGGCATGA
 
Protein sequence
MSIFQPKGDS FGSLNETVQS RRLIVSRYRS LRERQWALLV LCTGALAIVI DTSIVNVALP 
SIGQDLGFSA AGLSWVVNAY LIPFGGLLLF AGRLGDLVGA KRMFLAGIAL FTPASLVCGL
AQNPETLVAA RFVQGVGGAL AAAVVLGLVV STFPNAIEQA KAIGLYAFVS SAGAALGLLL
GAWITEALSW HWTFFINVPL GLLVLPLGAR LVDDERVVLD DRSIDVPGVL LITSGLMLTV
YTIVRFADGA TPLTWALVGI SAALLAAFVV RQRTARNPLV PLSIFRARNV AWANMVQALM
IAGMGGMFFL GTLYLRQVLG LSAMEVGLAF LPTALMVAVI SLKATPRLME KIDPKTVLVP
GLLLITAGLA LLAVAPADGD YWRDVLPAMV LLGAGAGLGY PSSLTMVMAD ATAADRGLRS
GLVNTTQQVG PAIGLAVLAT VAADRTTTLV SRGSTHAEAL TGGYHLAFWI GSGASLVALV
LTVLFLRPAV PTSVPKTLDR ARASQRLTQD HTGLSDPDFL ALGLGGTNMM AMLWSVAMGR
RAVGVELRGS PYVSVMRWTM REEIYHHLAM IDSMMLERYG EDLLPRLGDG RPFRLADCFF
SRHTRAGSVR GDEVVTGFES DAHIGGLARH YETIDDRYRE GRAHREINVL DPLEGPEGLD
PAALEHDLSD VLGSQSMFQV GAEELLVMLR RYLEGLEQID RDRGEALPRV RVLPYHRVAT
SESSDRERRW LPSALRRTPV AERGFERTAD GRVRVRVEAV RELDYRGAFR RVRVPYSETV
DLGTPGLIMI AEGLDSPDAA RLGFRQETVC VDRGDGRGPV PAEADFLVGN VDIYLADRIR
VRRASEFDQA GNEYWVHQRC VGHEEDAQIG WTMLEVPPYK TFDPIQAGLV PAGTPKDSKP
YFAGHQFLLR RYFLDQTAQL TEIPRRAIEY NQLSYGPKLF TVTERIGLDA LVAANAVVAG
DSFGNSHHLT SGGINTGMLG HGLRVLRYWQ ARESGSSASD AARDLADGIR ADTRALIDLC
EQDFRAAPPA APPRKQQRLL AGATRQRRSI FPQRYPDEWS RIQLRTGKIF AYNLPMLDPE
HPDTREAQPV GAMSGAPPSA SERKDPLGAA RA