Gene EcolC_1301 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_1301 
Symbol 
ID6068571 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp1427744 
End bp1428907 
Gene Length1164 bp 
Protein Length387 aa 
Translation table11 
GC content42% 
IMG OID641600722 
Productefflux pump membrane protein 
Protein accessionYP_001724294 
Protein GI170019340 
COG category[V] Defense mechanisms 
COG ID[COG1566] Multidrug resistance efflux pump 
TIGRFAM ID[TIGR00998] efflux pump membrane protein (multidrug resistance protein A) 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.632829 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGAACAGA TTAATTCAAA TAAAAAACAT TCTAACAGAA GAAAATACTT TTCTTTATTG 
GCGGTAGTTT TATTTATTGC GTTTTCAGGT GCCTATGCCT ATTGGTCAAT GGAATTAGAA
GACATGATTA GTACAGATGA CGCCTATGTC ACGGGGAATG CAGATCCAAT TTCTGCACAA
GTCTCAGGTA GTGTCACTGT CGTTAATCAT AAAGATACGA ACTACGTTCG ACAAGGTGAC
ATTTTAGTTT CACTGGATAA AACTGATGCC ACTATCGCAC TCAATAAAGC TAAAAATAAT
CTGGCAAATA TTGTTCGGCA AACGAATAAA CTATACTTAC AGGATAAACA ATACAGTGCC
GAAGTCGCTT CAGCACGTAT TCAGTATCAA CAATCTTTAG AAGATTATAA CCGTCGAGTG
CCGTTAGCGA AGCAGGGGGT TATTTCAAAA GAAACGCTGG AGCATACCAA AGATACGTTA
ATAAGTAGCA AAGCGGCATT GAATGCCGCT ATCCAGGCTT ATAAAGCGAA TAAAGCTTTA
GTAATGAACA CACCATTAAA CCGTCAGCCA CAAGTCGTTG AAGCGGCGGA TGCAACTAAA
GAAGCCTGGT TGGCGCTTAA ACGTACGGAT ATTAAGAGTC CGGTTACCGG CTATATTGCC
CAGAGAAGTG TTCAGGTCGG CGAAACAGTG AGCCCCGGAC AATCGTTAAT GGCTGTCGTA
CCGGCACGTC AAATGTGGGT TAATGCCAAC TTTAAAGAAA CACAACTCAC GGATGTACGG
ATTGGTCAAT CGGTCAATAT TATCAGCGAT CTTTATGGTG AAAATGTTGT GTTTCATGGT
CGGGTGACAG GGATCAATAT GGGAACCGGC AATGCGTTCT CCTTATTACC TGCACAAAAT
GCGACAGGGA ACTGGATCAA AATCGTTCAG CGTGTACCGG TTGAAGTTTC TCTTGATCCA
AAAGAACTCA TGGAACACCC CTTGCGTATT GGTTTATCGA TGACAGCAAC TATTGATACG
AAGAACGAAG ACATTGCCGA GATGCCTGAG CTGGCTTCAA CCGTGACCTC CATGCCGGCT
TATACCAGTA AGGCTTTAGT TATCGATACC AGTCCGATAG AAAAAGAAAT TAGCAACATT
ATTTCGCATA ATGGACAACT TTAA
 
Protein sequence
MEQINSNKKH SNRRKYFSLL AVVLFIAFSG AYAYWSMELE DMISTDDAYV TGNADPISAQ 
VSGSVTVVNH KDTNYVRQGD ILVSLDKTDA TIALNKAKNN LANIVRQTNK LYLQDKQYSA
EVASARIQYQ QSLEDYNRRV PLAKQGVISK ETLEHTKDTL ISSKAALNAA IQAYKANKAL
VMNTPLNRQP QVVEAADATK EAWLALKRTD IKSPVTGYIA QRSVQVGETV SPGQSLMAVV
PARQMWVNAN FKETQLTDVR IGQSVNIISD LYGENVVFHG RVTGINMGTG NAFSLLPAQN
ATGNWIKIVQ RVPVEVSLDP KELMEHPLRI GLSMTATIDT KNEDIAEMPE LASTVTSMPA
YTSKALVIDT SPIEKEISNI ISHNGQL