Gene EcolC_1022 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_1022 
Symbol 
ID6066872 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp1111464 
End bp1112636 
Gene Length1173 bp 
Protein Length390 aa 
Translation table11 
GC content53% 
IMG OID641600435 
Productefflux pump membrane protein 
Protein accessionYP_001724018 
Protein GI170019064 
COG category[V] Defense mechanisms 
COG ID[COG1566] Multidrug resistance efflux pump 
TIGRFAM ID[TIGR00998] efflux pump membrane protein (multidrug resistance protein A) 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.727271 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0227655 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGCAA ATGCGGAGAC TCAAACCCCG CAGCAACCGG TAAAGAAGAG CGGCAAACGT 
AAGCGTCTGC TCCTCCTTCT CACCTTGCTC TTTATAATTA TTGCCGTAGC GATAGGGATT
TATTGGTTTT TGGTACTGCG TCACTTCGAA GAAACCGATG ACGCATACGT GGCAGGGAAT
CAAATTCAAA TTATGTCTCA GGTGTCTGGC AGCGTGACGA AAGTCTGGGC CGATAACACC
GATTTTGTAA AAGAAGGCGA CGTGCTGGTC ACTCTCGACC CGACAGATGC TCGCCAGGCG
TTTGAAAAAG CCAAAACTGC ACTGGCTTCC AGCGTTCGCC AAACCCACCA GCTGATGATT
AACAGCAAGC AGTTGCAGGC GAATATTGAG GTGCAGAAAA TCGCCCTCGC GAAAGCACAA
AGCGACTACA ACCGCCGTGT GCCGCTGGGC AATGCCAACC TGATTGGTCG CGAAGAGCTG
CAACACGCCC GCGACGCCGT CACCAGTGCC CAGGCGCAAC TGGACGTCGC GATTCAACAA
TACAATGCCA ATCAGGCGAT GATTCTGGGG ACTAAACTGG AAGATCAGCC AGCCGTGCAA
CAGGCTGCCA CCGAAGTACG TAACGCCTGG CTGGCGCTGG AGCGTACTCG TATTGTCAGT
CCGATGACCG GTTATGTCTC CCGCCGCGCG GTACAGCCTG GGGCGCAAAT TAGCCCAACG
ACGCCGCTGA TGGCGGTCGT TCCAGCCACC AATATGTGGG TGGATGCCAA CTTTAAAGAG
ACGCAGATTG CCAATATGCG TATCGGTCAG CCGGTCACTA TCACCACGGA TATTTACGGC
GATGATGTGA AATACACCGG TAAAGTGGTT GGTCTGGATA TGGGCACAGG TAGCGCGTTC
TCACTGCTTC CAGCGCAAAA TGCGACCGGT AACTGGATCA AAGTCGTTCA GCGTCTGCCT
GTGCGTATCG AACTGGACCA GAAACAGCTG GAGCAATATC CGCTGCGTAT CGGTTTGTCC
ACGCTGGTGA GCGTCAATAC CACTAACCGT GACGGTCAGG TACTGGCAAA TAAAGTACGT
TCCACTCCGG TAGCGGTAAG CACCGCGCGT GAAATCAGCC TGGCACCTGT CAATAAACTG
ATCGACGATA TCGTAAAAGC TAACGCTGGC TAA
 
Protein sequence
MSANAETQTP QQPVKKSGKR KRLLLLLTLL FIIIAVAIGI YWFLVLRHFE ETDDAYVAGN 
QIQIMSQVSG SVTKVWADNT DFVKEGDVLV TLDPTDARQA FEKAKTALAS SVRQTHQLMI
NSKQLQANIE VQKIALAKAQ SDYNRRVPLG NANLIGREEL QHARDAVTSA QAQLDVAIQQ
YNANQAMILG TKLEDQPAVQ QAATEVRNAW LALERTRIVS PMTGYVSRRA VQPGAQISPT
TPLMAVVPAT NMWVDANFKE TQIANMRIGQ PVTITTDIYG DDVKYTGKVV GLDMGTGSAF
SLLPAQNATG NWIKVVQRLP VRIELDQKQL EQYPLRIGLS TLVSVNTTNR DGQVLANKVR
STPVAVSTAR EISLAPVNKL IDDIVKANAG