Gene Franean1_3555 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3555 
Symbol 
ID5671924 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4218201 
End bp4219784 
Gene Length1584 bp 
Protein Length527 aa 
Translation table11 
GC content73% 
IMG OID641242441 
ProductEmrB/QacA family drug resistance transporter 
Protein accessionYP_001507861 
Protein GI158315353 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00711] drug resistance transporter, EmrB/QacA subfamily 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.21248 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTAAAT GGCTGACCCT GATCACGGTC TCCCTGAGCA CCTTCATGCT GCTGCTCGAC 
GTCACGATCG TCAGCGTCGC GGTGCCGGCG ATGGCCCGCG CGCTCGACTC CTCATTCACT
GATCTGCAGT GGACCGTCGA CATCTACGTC CTGGTGCTCG CCGCGCTTCT GATGGCGATC
GGGTCGGCGT CCGACCTCCT CGGCCGCCGC AAGGTCTTCC TGCTCGGGCT GGTCGTCTTC
GCGGCGGCCT CCCTCGCCTG CGGGCTGGCC CCGAACACCG GCTTCCTCAT CGCCGCCCGG
GGGGTGCAGG GGCTCGGCGC CGCGGCCATG TTCGCCACCA ACGCCGCGCT GCTCAGCGCC
ACCTACCGCG GCCGCGACGT CGGTGTCGCC TTCGGGGTGT GGGGCGCGGT CAACGGCGCC
GCCGCGGCGC TCGGCCCGAT CGTCGGCGGC CTGCTCACCG AGCACGTCAG CTGGCGGGCC
ATCTTCCTGG TCAACCTGCC GGTCGCGCTG ATCGCCATCG TGATCGCCCT GCGCTCCGTC
GCCGAATCGC GGGACCGGAT GAGCGGCCGG ATCGACATCC CCGGCACCGT CACGTTCACG
CTCACCGTCT CGCTGCTCAT CTACGGCCTC ATCGAGGCCG GTGACAAGGG CTGGTCGGAT
TCGGTCACCC TCGGGTGCCT CGCCGGCGCC GCCGTGGCCC TGGTGGTCTT CGTCCTGGTG
GAACGGGGCC GGCGCGCTCC CATGCTCGAG CCGCGGCTGT TCCGCGGCCC ATCGTTCTCC
GCGCTCATGG TCGGCGGCTT CGTGCTGACC GGGGCAGCCT TCGCGAACCT GGTGTTCGTG
TCGGTGTGGG CGCAGACCGT CCTCGACTTC GACCCGGTGA AGGCCGGGCT CGTGCTCACC
CCGTTGGCGG GGGTCTCGTT CGTGGTCGCC GGCGCCGGCG GCCGGCTGCT GCACGGCGTG
CCGCCGAGGT ACTCGATCGG GGCGGGCCTG CTGCTGGTCG GGGTCGGTAC GTTCCTCGAC
ATGATCATCG CTCCGTCGTC GGGATGGACC GCGCTGCTGG CCGGACTGAT CGTCACCGGC
GTCGGGGTGG GGCTGGCCTC GCCGGCGCTC GCCTCCGCAG CGCTCACCAC GGTGCCCCCC
GAGCGCGCGG GGATGGCCAA CGGCGCCATG AACACGTTCC GCCAGCTCGG GTTCGCCGTC
GGCATCCCCG TCTTCGGCAC GGCCCTGGCC GGGCAGGCCC GGGCCAGCCT CAGCGACAGC
GGCCAGTTCG ACGACCCGCA GGCCACCGCC AGCGCGCTGT CCGGCGGCGG CGCGCCGGAG
ATCATCGCCC ACGTCCCCGC GGCCGCTCGC GCCGCGGTCG ACCAGGCCCT GCACGCCGCG
TTCGCCGCCG GCCTCGACCG TGTCTTCCTG ATCAGCGGCA TCGCCGGGGT GGTAGCCGGT
GCCGTGGTGC TGCTGCTCGT CCGACCGGAA CAGGCGGCCG CCCGCGCCGC GGCGGACGAC
GGCCCGGCGG ACGCGGTGCC CGGAGGCCCC GCGATCCCGT CGCCCGGGGA CGGCAGCCAG
GTGCCGACGG GGGCGAACGG CTGA
 
Protein sequence
MRKWLTLITV SLSTFMLLLD VTIVSVAVPA MARALDSSFT DLQWTVDIYV LVLAALLMAI 
GSASDLLGRR KVFLLGLVVF AAASLACGLA PNTGFLIAAR GVQGLGAAAM FATNAALLSA
TYRGRDVGVA FGVWGAVNGA AAALGPIVGG LLTEHVSWRA IFLVNLPVAL IAIVIALRSV
AESRDRMSGR IDIPGTVTFT LTVSLLIYGL IEAGDKGWSD SVTLGCLAGA AVALVVFVLV
ERGRRAPMLE PRLFRGPSFS ALMVGGFVLT GAAFANLVFV SVWAQTVLDF DPVKAGLVLT
PLAGVSFVVA GAGGRLLHGV PPRYSIGAGL LLVGVGTFLD MIIAPSSGWT ALLAGLIVTG
VGVGLASPAL ASAALTTVPP ERAGMANGAM NTFRQLGFAV GIPVFGTALA GQARASLSDS
GQFDDPQATA SALSGGGAPE IIAHVPAAAR AAVDQALHAA FAAGLDRVFL ISGIAGVVAG
AVVLLLVRPE QAAARAAADD GPADAVPGGP AIPSPGDGSQ VPTGANG