Gene EcolC_4240 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_4240 
Symbol 
ID6067919 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp4689376 
End bp4690803 
Gene Length1428 bp 
Protein Length475 aa 
Translation table11 
GC content49% 
IMG OID641603677 
ProductEmrB/QacA family drug resistance transporter 
Protein accessionYP_001727163 
Protein GI170022209 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00711] drug resistance transporter, EmrB/QacA subfamily 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000183505 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00293146 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGAGCGATA AAAAGAAGCG CAGTATGGCG GGTTTGCCGT GGATCGCGGC GATGGCCTTC 
TTCATGCAGG CACTTGATGC CACTATTCTG AATACCGCCT TACCCGCAAT CGCTCATAGC
CTTAATCGTT CTCCTCTCGC GATGCAATCA GCCATCATCA GTTATACGCT GACGGTGGCG
ATGCTTATTC CGGTAAGCGG ATGGCTAGCC GATCGCTTCG GTACGCGTCG CATTTTTACC
CTTGCCGTGA GTCTGTTCAC ATTGGTTTCT CTGGCCTGCG CACTTTCTAA TTCGCTACCA
CAGCTGGTTG TCTTCCGGGT TATTCAAGGG ATAGGCGGCG CAATGATGAT GCCTGTTGCT
CGGCTGGCCT TACTGCGTGC TTATCCTCGT AATGAACTTC TTCCGGTATT GAATTTTGTC
GCCATGCCGG GTCTGGTGGG GCCAATTTTA GGCCCCGTTC TTGGCGGCGT GCTGGTCACC
TGGGCAACCT GGCACTGGAT ATTTTTAATC AATATCCCCA TAGGTATTGC GGGCCTTCTT
TACGCGCGCA AACATATGCC CAATTTCACC ACCGCACGAC GCAGATTCGA TATCACTGGC
TTTTTGCTGT TTGGCCTCAG TCTTGTTCTC TTCTCAAGCG GAATAGAGCT ATTCGGGGAA
AAGATTGTCG CCAGCTGGAT TGCCTTGACG GTAATTGTCA CCAGCATCGG GTTACTGCTT
CTCTATATTC TCCATGCACG ACGCACGCCA AACCCATTAA TTTCATTAGA TTTATTTAAA
ACCCGCACTT TCTCGATCGG TATCGTAGGC AATATTGCAA CCCGTCTGGG GACCGGTTGT
GTACCGTTCC TTATGCCATT GATGTTACAG GTAGGATTTG GTTATCAGGC GTTTATTGCC
GGCTGTATGA TGGCACCGAC AGCGTTAGGT TCCATTATTG CAAAATCGAT GGTTACCCAA
GTCTTACGTC GTCTGGGCTA TCGCCATACA TTAGTGGGGA TCACGGTGAT TATTGGGCTA
ATGATCGCTC AGTTCTCTTT GCAATCACCG GCAATGGCGA TATGGATGCT GATCTTGCCG
TTGTTTATAT TAGGGATGGC TATGTCGACG CAATTTACCG CGATGAATAC CATCACACTT
GCCGATCTGA CCGATGACAA CGCCAGCAGC GGTAACAGTG TTCTGGCGGT CACGCAGCAA
CTGTCGATTA GTTTAGGCGT TGCTGTAAGT GCGGCCGTCC TTCGCGTTTA TGAAGGGATG
GAAGGCACAA CGACTGTCGA ACAATTCCAC TATACGTTTA TCACGATGGG CATTATTACT
GTTGCTTCAG CAGCAATGTT CATGCTTCTG AAAACAACCG ATGGTAATAA TTTGATCAAA
AGACAGCGTA AATCTAAGCC GAACCGCGTT CCATCAGAAT CGGAGTAA
 
Protein sequence
MSDKKKRSMA GLPWIAAMAF FMQALDATIL NTALPAIAHS LNRSPLAMQS AIISYTLTVA 
MLIPVSGWLA DRFGTRRIFT LAVSLFTLVS LACALSNSLP QLVVFRVIQG IGGAMMMPVA
RLALLRAYPR NELLPVLNFV AMPGLVGPIL GPVLGGVLVT WATWHWIFLI NIPIGIAGLL
YARKHMPNFT TARRRFDITG FLLFGLSLVL FSSGIELFGE KIVASWIALT VIVTSIGLLL
LYILHARRTP NPLISLDLFK TRTFSIGIVG NIATRLGTGC VPFLMPLMLQ VGFGYQAFIA
GCMMAPTALG SIIAKSMVTQ VLRRLGYRHT LVGITVIIGL MIAQFSLQSP AMAIWMLILP
LFILGMAMST QFTAMNTITL ADLTDDNASS GNSVLAVTQQ LSISLGVAVS AAVLRVYEGM
EGTTTVEQFH YTFITMGIIT VASAAMFMLL KTTDGNNLIK RQRKSKPNRV PSESE