Gene Franean1_3660 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3660 
Symbol 
ID5672026 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4336580 
End bp4338256 
Gene Length1677 bp 
Protein Length558 aa 
Translation table11 
GC content71% 
IMG OID641242543 
ProductEmrB/QacA family drug resistance transporter 
Protein accessionYP_001507963 
Protein GI158315455 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00711] drug resistance transporter, EmrB/QacA subfamily 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCCGATC AAGCGCACGA AGCGGGGGCG GAGCCCGTCG AAGCCCTCGC GCCGGTCGAC 
GCCGCCGAGC CCGACGCCGC CGAGCCCGCT GCCACCGAGC CCGCTGCCAC CGAGATCGCT
GCCACCGAGA TCAATGCCAC TGAGGTCGAT CCGGCGTTGC GCCGCCTCGC GCTCACGGTC
ACCCTCGGCG CGATCATGGC CATCCTCGAC ACGACGATCG TGGCCGTAGC CATCAACACC
CTCGGGCGGG ACTTCGACGC CTCGCTGTCG ACGATCTCCT GGGTGTCCAC CGGCTACCTG
CTGGCACTCG CCGTCGTCAT CCCGCTGACC GGCTGGTCGG TCGAGCGGTT CGGCGCGACC
CGGATGTGGA ACATCTCGCT GGTGCTGTTC CTGGCCGGCA GCGCGCTGTG CGGCGCCGCC
TGGTCGGCCG GCAGTCTGAT CGTCTTCCGG GTGCTCCAGG GCCTCGGCGG CGGAATGATC
ATGCCGATCT GCATGACCCT GCTGGCCAGC GCCGCCGGCC CGCAGCGAAT CGGCCGCGTC
ATGAACATCG TCGGTGTGCC TGCGCTGGTC GCCCCGATCC TGGGCCCGGT CATCGGCGGT
CTGCTCGTCG ACAACCTCGA CTGGCGCTGG ATCTTCTTCG TCAACCTGCC GATCGGCGCG
GTAGCACTGG TCGCGTCATG GCGGGTGCTG CCGCGCGACG ACCGCGGGCA GTCGCACCAC
CGGCTCGACG TCCCCGGTCT GCTGCTGATC TCGCCCGGCC TCGCGGCGCT CGTCTACGGC
CTGTCCGAAG CCGGTTCGGG GGACGGCTTC GGAGCACTGA GGGTCCAGAT CAGCACCGCC
GCCGGAGTGG TCGCGCTGGT GGCCTTCGTC GTGCACGCCC TGCGTCGCGA GGGCGCGCTG
CTCGATCTCC GTCTGTTCGG CGACCGCACC TTCACCGTCG CCGGGGTCAC CACGTTCATG
GTCGGCGCCG GCCTGTTCGG CGGGATGTTC CTGCTGCCGC TGTACTTCCA GGTCGCCCGC
GGGCAGAGCG CGCTTGCCGC CGGCCTGTCG CTCGTCCCGC AGGGGGTGGG CGCGATGATA
GGCATGCCGA TCGCCGGGCG GATCGCCGAC CGCCGCGGAG CCGGCTACGT CGTACCGGTC
GGGATGGCCG TCTGCCTGCT CGGCACGGTC GCCTTCACGC AGGTCGACGC GCACACCAAC
ACGGTCGCGC TCGGGGCGCT GCTGTTCGTG CGCGGCCTGG GCTTCGGCGC CTCGATGATG
CCGGCGATGA GCGCCGCCTA CGTGACGCTC CGGCCCGCCG CTGTGCCGCG GGCCACCACG
ACGCTCAACA TCCTGCAGCG GGTCGGTGGC TCCATAGCCA CCGCCCTGCT CGCGGTGGAG
CTACAGCACG GGATCACCAG CCGGCTCCCC GGCTCGGGCG GCGGCATGCT GAACGCATCC
GAAGGAACGG ACCTGCCCGC GACGGTCGCG GACAAGATCG CTCATGCCTT CGGCGCCACG
TTCTGGTGGG TCGTCGGCCT GACCGTGCTC GGTCTCGTGT CCAGCATTTT CCTGCCCCGC
CATGCCCCGA AGCCCACCGC CGCCCCCGAC GGGCATGCCG GGGATGGGGA GGACGAGCCG
GCCGCGGAGG GAGAGCACGT CCCGGTCGGC ACTCCGGCAC CTGATCCAGC CGTGTGA
 
Protein sequence
MSDQAHEAGA EPVEALAPVD AAEPDAAEPA ATEPAATEIA ATEINATEVD PALRRLALTV 
TLGAIMAILD TTIVAVAINT LGRDFDASLS TISWVSTGYL LALAVVIPLT GWSVERFGAT
RMWNISLVLF LAGSALCGAA WSAGSLIVFR VLQGLGGGMI MPICMTLLAS AAGPQRIGRV
MNIVGVPALV APILGPVIGG LLVDNLDWRW IFFVNLPIGA VALVASWRVL PRDDRGQSHH
RLDVPGLLLI SPGLAALVYG LSEAGSGDGF GALRVQISTA AGVVALVAFV VHALRREGAL
LDLRLFGDRT FTVAGVTTFM VGAGLFGGMF LLPLYFQVAR GQSALAAGLS LVPQGVGAMI
GMPIAGRIAD RRGAGYVVPV GMAVCLLGTV AFTQVDAHTN TVALGALLFV RGLGFGASMM
PAMSAAYVTL RPAAVPRATT TLNILQRVGG SIATALLAVE LQHGITSRLP GSGGGMLNAS
EGTDLPATVA DKIAHAFGAT FWWVVGLTVL GLVSSIFLPR HAPKPTAAPD GHAGDGEDEP
AAEGEHVPVG TPAPDPAV