Gene Franean1_1542 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1542 
Symbol 
ID5675682 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1842253 
End bp1843851 
Gene Length1599 bp 
Protein Length532 aa 
Translation table11 
GC content70% 
IMG OID641240461 
ProductEmrB/QacA family drug resistance transporter 
Protein accessionYP_001505887 
Protein GI158313379 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2271] Sugar phosphate permease 
TIGRFAM ID[TIGR00711] drug resistance transporter, EmrB/QacA subfamily 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.95287 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.532549 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCACAGC CCGTCATCGC CGACGATCAG CGGACGCGGT CCGCGCCAGC GGAGGGCACC 
GGCGGCCGGC GGGGCAATCC CTGGTGGACG CTGACGAGTG TCGCCCTCGG CGTGATCATG
GTCGGCCTGG ACGGGACGGT CGTCTCGATC GCGAACCCGC GCATCGCCGA GGATCTCGGC
GCCTCACTGA CCGACCTGCA GTGGATCACG AACTCCTACC TGCTCGCCCT GGCCGCCCTG
CTGGTGTTCG GCGGCAAGCT CGGCGACCGG TACGGCCGCA AGCTGATCTT TCTGATCGGC
GTGGTCGGGT TCGGGCTCAC CTCACTGGCC ATCGGCCTGG TCGGCAACAT CGTCGGGATC
ATCGCGCTCC GGGCGCTCCA GGGCGTGTTC GGAGCCATGC TGATGCCGAA CACGCTGGCC
ATCCTGCGCG GCGCGTTCCC GCCCAAGGAG CTGAACCGGG CGATCGGCAT CTGGAGCGGC
GCGTCGTCCA TCTCGATCGC CGGCGGACCG ATCATCGGCG GCCTGCTGGT CGAGCACGTG
AGCTGGGAGT CGGTCTTCTA CATCAACGTG CCGATGGGCG CGATCGCGCT GGCCGTGGGC
CTGGCGGTGC TGCGTGAGTC CCGCAGCGAG AGCACCGGGC AGCGCCACGA CATCCCCGGC
ATCATCACCC TGTCCGGCGG CCTGGTCGGC CTTGTGTTCG GCCTGATCAA GGCGTCGACC
TGGGGCTGGA CAGACCCGAA GACGCTCGGC TGCGTCTTCG CCGGGCTGGC CGTGCTGGTG
CTGTTCACGG TGATCGAGAC CCGGGTGGCC GCTCCCCTGC TGCCGATGCG GCTCTTCGGC
AACCGGTCGA TCTCGGTGGG CAGCGCTGTT CTCGTCATCA ACTTCTTCGC GCTGTTCGGT
GTGCTGTTCT TCGTCACGCT GTTCCTGCAG AACGTCCAGG ACACCTCGCC GATCGAGACC
GGTGTCCGCA TTCTGCCGCT GACCCTGGCG ATGATGATCA TGTCACCGAT CGCAGGCAGC
GCCACCGAGC GGTTCGGGCC CCGCCCGCCG ATGGTGATCG GCCTGGTGCT GTCCGGGACG
GCACTGCTCC TGTTGACCGG GCTCGAGCCC GGCTCCAGCT TCAACGCGCT GTGGCCGTCG
CTGCTGATGC TCGGCATCGG GATGGGCCTG GTCATCACGG CCAGCGCGGA GGCGATTGTC
GGCAACGCAC CGGTCGACGA CGCCGGGGTC GCCGGTGGCC TGCAGACCAC GGCGTTGCAG
CTCGGCGGGG TCGTCGGGAC GGCCGTCCTG GGCTCGGTGC TCAGCAGCCG CGTGGCATCG
GTCATGGTCG ACAAGCTGAC CGGCGCCGGC ACCCCCACCG AGGTCGCGAA CCGGCTGACC
GGCTCGGAGC AGCTCATCAG CCAGGGAGTG GCCCCCCAGG TGACCGGCGC GTCGGAATCC
GTCCAGGCCG CCGTGACCGC GGGCAGCCAC GCGGCCTTCA TCACCGGCCT GCACGTCTCC
CTCGTCGTCG CCGGCATCGC GACGCTCGTC GCGGCCGGGC TGGCCCTCCT GGTCCGGCGC
GGCGACAGCA GCGGCGGAAC CCCCGTCGTC GTGCACTGA
 
Protein sequence
MSQPVIADDQ RTRSAPAEGT GGRRGNPWWT LTSVALGVIM VGLDGTVVSI ANPRIAEDLG 
ASLTDLQWIT NSYLLALAAL LVFGGKLGDR YGRKLIFLIG VVGFGLTSLA IGLVGNIVGI
IALRALQGVF GAMLMPNTLA ILRGAFPPKE LNRAIGIWSG ASSISIAGGP IIGGLLVEHV
SWESVFYINV PMGAIALAVG LAVLRESRSE STGQRHDIPG IITLSGGLVG LVFGLIKAST
WGWTDPKTLG CVFAGLAVLV LFTVIETRVA APLLPMRLFG NRSISVGSAV LVINFFALFG
VLFFVTLFLQ NVQDTSPIET GVRILPLTLA MMIMSPIAGS ATERFGPRPP MVIGLVLSGT
ALLLLTGLEP GSSFNALWPS LLMLGIGMGL VITASAEAIV GNAPVDDAGV AGGLQTTALQ
LGGVVGTAVL GSVLSSRVAS VMVDKLTGAG TPTEVANRLT GSEQLISQGV APQVTGASES
VQAAVTAGSH AAFITGLHVS LVVAGIATLV AAGLALLVRR GDSSGGTPVV VH