Gene Franean1_3343 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3343 
Symbol 
ID5671715 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3956184 
End bp3957788 
Gene Length1605 bp 
Protein Length534 aa 
Translation table11 
GC content70% 
IMG OID641242232 
ProductEmrB/QacA family drug resistance transporter 
Protein accessionYP_001507652 
Protein GI158315144 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00711] drug resistance transporter, EmrB/QacA subfamily 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.953855 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGATG ACCAACCACG ATCCGACTCC CGGAGGAGAT CCCCTCGATG CCCGACGCAG 
TCGATGTCTG ACCCCACGCT GGCCGCAGTT ACCGGCGGAG CCGACCGGCG CCGGTGGATC
GCCCTCGTGG TCGTGTGCCT GGCCATGCTG ATGAACGCGC TCGACTCGTC GATCGTCAAC
GTGGCGTTGC CGTCGATCCA GGGCGACCTG CACTTCAGCC AGTCCAACCT CACCTGGGTC
GTCGACGCCT ACCTGATCGC CTTCGGCAGT TTCCTGCTGC TCGCCGGGCG CCTCGGCGAC
CTGGTCGGCC GCAAGCGGGT GTTCCTCACC GGGGTCGCGC TGTTCACCGC CGCCTCGGCG
CTGTGCGCCG TCTCCCCCAA CCAGGGCACG CTGATCGTGG CCCGCTTCGC CCAGGGGCTC
GGCGGCGCCG TCTCCTCGTC GGTGATCATC GCGATCATCG TCACCGAGTT CCCGAACGCG
ACGGAGCGGG CCCGGGCGAT GAGCGCCTAC ATGCTCGTCG CCGTCGGGGG CGGGTCACTG
GGGCTCGTGG CCGGCGGAGT GCTCACCCAG GCCGTGAGCT GGCACTGGAT CTTCCTCATC
AACGTCCCGA TCGGGGTCGC GACCTTCGTC CTCGGGGTCG TCCTGATCGA GGAGAACGTC
GGTCTCGGGG TGGGCCGAGG AGTGGACGTC ACCGGTTCCC TGCTGGTCAC CGCGGCGCTG
CTGCTCGGCA TCTACGCGAT CGTCACGGCG GCCACCCACG GCTGGGCCTC GGCCCAGACA
CTCGGGTACG GGGCCGTCGG GGTTGTGCTC CTGGCCGCGT TCTTCCTGGT CGAGGCCCGG
CGGAGCAATC CGATCATGCC GTTGCGGATC CTGCGGGTGC GCAGCCTGGT CGGCTCCAGC
GTGGTGCGCG GATGCCTGTT CATCGCGATG TACGCCGTCT TCTTCTTCGG TGCGCTCTAC
CTCGAGCAGG TCCGCGACTA CAGTCCGCTG CGCACCGGGC TCGCGTTCCT GCCGATGTCC
CTCGTCGTCG CGGCCCTGTC GATGGGCATC GTCAGCCGGC TGCTCATCCG CTTCGGAGCG
ATGAACGTGC TCGTCCCCGG ACTGGTCGCG GTGATCGTCG GCCTGTTCCT GCTGACCCGG
GTGGACGAGC ACACCAGCTA CGCCGCCGGC CTGCTCCCGG GGCTGCTGAT CCTCGGTCTC
GGCATGGGCG CCGCGATGGT CCCGCTGCTG TCGATCGCCA TGGCGGACGT CCCCCCGGCC
GACGCCGGGC TGGCCTCGGG AATCGTCAAC GTCTCGATCT GGCTCAGCTC CTCGGTCGGC
CTCGCCGTCC TCTCCAGCCT CGCCGCCAGC CGCACGAAAA CCCTCACCGC CCACGGGCAC
TCCGCCGTCG CCGCCCTGGT CAGCGGCTAT CACCTGGCCT TCCTGATCGG TACCGGCTGC
GCCCTGCTCG GGCTCGTCAC GACGCTCGTC GTCCTGCGCG CTCCGGCCCG GGCGGCGTTC
GCGGCGACGC GGGCGGCAGC GGCGGCGACA GCCGAATCCG CGAAGCTGCC CACCCCCCGC
GAGCATCCTG AGCCCCGGGT GGAGGCGGTC CCCCACGAGT CCTGA
 
Protein sequence
MPDDQPRSDS RRRSPRCPTQ SMSDPTLAAV TGGADRRRWI ALVVVCLAML MNALDSSIVN 
VALPSIQGDL HFSQSNLTWV VDAYLIAFGS FLLLAGRLGD LVGRKRVFLT GVALFTAASA
LCAVSPNQGT LIVARFAQGL GGAVSSSVII AIIVTEFPNA TERARAMSAY MLVAVGGGSL
GLVAGGVLTQ AVSWHWIFLI NVPIGVATFV LGVVLIEENV GLGVGRGVDV TGSLLVTAAL
LLGIYAIVTA ATHGWASAQT LGYGAVGVVL LAAFFLVEAR RSNPIMPLRI LRVRSLVGSS
VVRGCLFIAM YAVFFFGALY LEQVRDYSPL RTGLAFLPMS LVVAALSMGI VSRLLIRFGA
MNVLVPGLVA VIVGLFLLTR VDEHTSYAAG LLPGLLILGL GMGAAMVPLL SIAMADVPPA
DAGLASGIVN VSIWLSSSVG LAVLSSLAAS RTKTLTAHGH SAVAALVSGY HLAFLIGTGC
ALLGLVTTLV VLRAPARAAF AATRAAAAAT AESAKLPTPR EHPEPRVEAV PHES