Gene Franean1_6409 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_6409 
Symbol 
ID5674724 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp7779067 
End bp7780803 
Gene Length1737 bp 
Protein Length578 aa 
Translation table11 
GC content71% 
IMG OID641245257 
ProductEmrB/QacA family drug resistance transporter 
Protein accessionYP_001510652 
Protein GI158318144 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00711] drug resistance transporter, EmrB/QacA subfamily 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.438631 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCAGTG ATCTGGGCAG CTTGGCGGGT CCGGAGGAGG CAGGGGAACC ACAGGAGGCA 
GGGGAGCTAC GAGCGGAAGG GGAAAGGGAA GGGATCCGAG GGGAGCCGGG TGGGCGATCC
GTGCCATCCG GCTCGGCCGG GCCGCCGCAC ACCGGGCTGG TGTTCACGGT GCTCGTGCTC
GGTGGCCTGC TCGTCGTTCT GGATATCACG ATCATCAATG TCGCGATCCG CACGCTGGCG
GCGGATCTCG ATGCTTCACT GCCGGTGATC CAGTGGGTTT CGACGGGATA CACCCTGGCG
CTGGCCGTGA CCGTGCCGAC GACAGCATGG CTGGTGGCAC GGTTCGGTTC CGGACGTGTC
TACATGGTGG CCCTGAGTCT GTTCGTGCTG GGGTCGGTGC TCTGCGGTCT GGCCTGGAAC
ATCGACGCGC TCATTGCCTT CCGGGTGGTG CAGGGGATCG GCGGTGGCCT GGTCAACCCG
GTCGCGATGA CGATTGTGCT ACGGGCCACG CCGCCGGAGC GGCGCGGGCG CGCCATGGGC
CTGTTGGGTC TGCCGGTGCT CGTCGGGCCG GTGATCGGGC CGACGCTGGG CGGCTGGCTG
GTCGACATCT CCTGGCGGTG GATCTTCCTG GTCAACCTGC CGCTGGGCCT GGCCGCGCTC
CTGCTCGCGA GCCGAGTCCT GCGCCCGGTC ACCGCGGCCG CGGGTGCCGT CCAGCGGGGC
GGCACGCTTG AGAGTGCTGA CGGGCGGCTC GACGTCCCAG GGTTGGCGCT TGTCGCGCCG
GGGCTGGCGC TGTTCGTCTA CGGGCTGGCG GAGAGCGGGC GGCGCGGGAC TGTGACCTCG
GCAGGTGTGC TCGTGCCGGC GCTGGCCGGG CTCGCGTTGG CGGTGGTGTT CGTGGTTCGG
GCCGCTCGGA TGCGCGCCCC ACTGGTCCAG GTTGCGCTGT TGCGGCTGCG GGCTGTCGCG
TCAGGGACGG CGACGCTGGC GCTGTTCGCG GCGGCCTACT TCGGTTCGAT GTTCGTCCTG
CCGCTTTACT GGCAGCTCGT GCGGGGGCTC AGCCCCGCGG AGACGGGGAT GCTGGCGATC
CCGCAGGCGC TCGCTACCGG GGCTTCGCTG CAGGTGGCGA GCCGGATGGT CGACCGGGTT
CCGCCTGCCC GCGTGGTGGG CTTCGGGATC GTGACGGCGT CCTGCGGGCT GATCACCGCG
ACGCTGTTGC TCGGCGTTGA CACCCCGTAC TGGCAGATGG TGGTGGCGAT GTCCGTCATG
GGGGTCGGCG CGGGCTCGAC GATCATGCCG ACGATCACGA CAGCGCTGCG GCATCTGAGT
GACCGGGACG CGCCGTCCGG CAGCACGCTG CTGACCATCA CCAACCAGGT GAGTGTCTCG
ATCGGAACCG CCCTGACCTC CGTCGTACTC GCGGCCGGCC TCACCACCCA GGGCGTGGCC
GGTGCGGCCG GTGGCGGCGG CGAGGGCGTG CTGCCGACCG TGGTGGACGG CCCGGCCGCC
GCCCGGCTTG CCGAGGCCTG CCAGGACACG CTGTTCGTGT CCGCCGCGCT GCTGGTCGCC
GCGCTCGTCG TGGCGCTGAC AGCGATCCCG GGCGGTCCGT TCAGTGCTTC TCGACGTCGC
CGGGTCCAGC CGCGTATCCG TGCTCCTCGA CGTCGCCGGG TTCGGCCTCG CGGGGAGGCG
CAGGATCAGT CGCCGGTTCG AGCAGCGCGG ACAGGCGATC TGACTCCGCC GCGGTGA
 
Protein sequence
MSSDLGSLAG PEEAGEPQEA GELRAEGERE GIRGEPGGRS VPSGSAGPPH TGLVFTVLVL 
GGLLVVLDIT IINVAIRTLA ADLDASLPVI QWVSTGYTLA LAVTVPTTAW LVARFGSGRV
YMVALSLFVL GSVLCGLAWN IDALIAFRVV QGIGGGLVNP VAMTIVLRAT PPERRGRAMG
LLGLPVLVGP VIGPTLGGWL VDISWRWIFL VNLPLGLAAL LLASRVLRPV TAAAGAVQRG
GTLESADGRL DVPGLALVAP GLALFVYGLA ESGRRGTVTS AGVLVPALAG LALAVVFVVR
AARMRAPLVQ VALLRLRAVA SGTATLALFA AAYFGSMFVL PLYWQLVRGL SPAETGMLAI
PQALATGASL QVASRMVDRV PPARVVGFGI VTASCGLITA TLLLGVDTPY WQMVVAMSVM
GVGAGSTIMP TITTALRHLS DRDAPSGSTL LTITNQVSVS IGTALTSVVL AAGLTTQGVA
GAAGGGGEGV LPTVVDGPAA ARLAEACQDT LFVSAALLVA ALVVALTAIP GGPFSASRRR
RVQPRIRAPR RRRVRPRGEA QDQSPVRAAR TGDLTPPR