Gene EcolC_1302 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_1302 
Symbol 
ID6068565 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp1428907 
End bp1430445 
Gene Length1539 bp 
Protein Length512 aa 
Translation table11 
GC content42% 
IMG OID641600723 
ProductEmrB/QacA family drug resistance transporter 
Protein accessionYP_001724295 
Protein GI170019341 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00711] drug resistance transporter, EmrB/QacA subfamily
[TIGR01168] Gram-positive signal peptide, YSIRK family 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAATCA CTAAATCAAC TCCGGCACCA TTAACCGGTG GGACGTTATG GTGCGTCACT 
ATTGCATTGT CATTAGCGAC ATTTATGCAA ATGTTGGATT CCACTATTTC TAACGTCGCA
ATACCGACAA TATCTGGCTT TCTGGGAGCA TCAACAGACG AAGGCACCTG GGTTATCACC
TCGTTTGGTG TAGCAAATGC CATTGCGATC CCTGTTACTG GCAGGTTGGC ACAAAGAATA
GGCGAATTAA GATTATTTTT ACTTTCAGTC ACTTTTTTTT CGCTGTCTTC ATTAATGTGT
AGCCTATCGA CCAATCTTGA TGTGCTGATA TTTTTTAGAG TCGTTCAGGG GTTAATGGCG
GGGCCGTTAA TTCCACTGTC ACAGAGTTTA TTATTAAGGA ATTATCCGCC AGAAAAAAGA
ACATTTGCTC TGGCATTATG GTCAATGACC GTGATTATCG CTCCGATATG TGGGCCGATA
TTGGGCGGTT ATATTTGTGA TAACTTTAGC TGGGGTTGGA TATTTTTAAT CAATGTCCCT
ATGGGGATTA TCGTCCTGAC ATTATGCTTA ACCTTACTTA AAGGAAGAGA AACTGAGACT
TCACCGGTCA AAATGAATCT ACCAGGACTG ACCCTGTTAG TGCTCGGTGT TGGTGGCTTG
CAAATTATGC TTGATAAAGG GCGCGATCTG GATTGGTTCA ACTCGAGTAC AATAATAATA
TTAACAGTAG TATCAGTTAT TTCTCTGATC TCTTTAGTCA TTTGGGAGTC GACCTCAGAG
AACCCGATTC TTGATCTCAG TTTGTTTAAG TCCCGTAACT TCACCATTGG TATTGTGAGT
ATCACATGCG CGTATTTATT TTACTCTGGA GCGATCGTCC TTATGCCGCA GTTACTCCAG
GAAACGATGG GGTATAATGC GATATGGGCC GGACTTGCTT ATGCGCCCAT CGGCATCATG
CCACTATTAA TTTCACCTTT GATAGGACGT TATGGCAACA AAATAGACAT GCGGTTGTTA
GTGACATTTA GTTTTTTGAT GTATGCGGTT TGCTATTACT GGCGTTCTGT GACATTTATG
CCAACGATTG ATTTTACAGG CATCATTTTG CCGCAGTTTT TTCAGGGATT CGCCGTTGCC
TGTTTCTTTT TACCCTTAAC AACGATTTCG TTTTCAGGCT TGCCAGATAA TAAATTTGCC
AATGCCTCGA GTATGAGTAA TTTTTTTCGT ACCTTGTCAG GATCAGTTGG TACGTCGTTG
ACAATGACGC TGTGGGGACG ACGCGAATCG TTACACCATA GTCAGTTGAC AGCAACCATC
GATCAATTTA ACCCCGTGTT TAATTCATCG TCACAAATTA TGGATAAATA TTATGGTTCG
CTTTCAGGAG TTCTTAATGA AATTAATAAT GAAATAACCC AGCAGTCACT TTCTATTTCT
GCAAATGAGA TTTTCCGTAT GGCGGCTATT GCTTTTATCT TACTTACGGT TTTGGTTTGG
TTTGCGAAAC CGCCGTTTAC AGCGAAAGGC GTTGGGTGA
 
Protein sequence
MAITKSTPAP LTGGTLWCVT IALSLATFMQ MLDSTISNVA IPTISGFLGA STDEGTWVIT 
SFGVANAIAI PVTGRLAQRI GELRLFLLSV TFFSLSSLMC SLSTNLDVLI FFRVVQGLMA
GPLIPLSQSL LLRNYPPEKR TFALALWSMT VIIAPICGPI LGGYICDNFS WGWIFLINVP
MGIIVLTLCL TLLKGRETET SPVKMNLPGL TLLVLGVGGL QIMLDKGRDL DWFNSSTIII
LTVVSVISLI SLVIWESTSE NPILDLSLFK SRNFTIGIVS ITCAYLFYSG AIVLMPQLLQ
ETMGYNAIWA GLAYAPIGIM PLLISPLIGR YGNKIDMRLL VTFSFLMYAV CYYWRSVTFM
PTIDFTGIIL PQFFQGFAVA CFFLPLTTIS FSGLPDNKFA NASSMSNFFR TLSGSVGTSL
TMTLWGRRES LHHSQLTATI DQFNPVFNSS SQIMDKYYGS LSGVLNEINN EITQQSLSIS
ANEIFRMAAI AFILLTVLVW FAKPPFTAKG VG