Gene EcHS_A3803 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A3803 
Symbol 
ID5591769 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp3796773 
End bp3797909 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content53% 
IMG OID640922915 
Productmembrane fusion protein family protein 
Protein accessionYP_001460393 
Protein GI157163075 
COG category[V] Defense mechanisms 
COG ID[COG1566] Multidrug resistance efflux pump 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones40 
Plasmid unclonability p-value0.456542 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATCTAT TGATTGTTTT AACTTACGTG GCGCTGGCGT GGGCGGTCTT TAAAATCTTT 
CGTATTCCGG TGAATCAGTG GACGCTGGCG ACGGCGGCGC TGGGTGGCGT GTTTCTGGTG
AGTGGTTTGA TTTTGTTGAT GAACTACAAC CACCCTTACA CTTTTACCGC GCAAAAGGCA
GTGATAGCGA TCCCCATCAC GCCACAGGTG ACGGGAATTG TTACTGAAGT CACTGACAAG
AATAATCAGC TTATTCAAAA GGGCGAGGTG CTTTTTAAGC TCGACCCGGT TCGTTACCAG
GCGCGAGTTG ACAGGCTTCA GGCTGACCTG ATGACGGCGA CGCATAATAT AAAGACTCTG
CGCGCGCAGC TCACAGAAGC GCAGGCCAAC ACCACCCAGG TTTCAGCGGA GCGCGACCGT
CTGTTTAAAA ATTATCAACG TTATCTGAAA GGCAGCCAGG CGGCGGTGAA TCCGTTCTCG
GAACGTGACA TCGACGATGC GCGGCAAAAT TTCCTCGCGC AGGATGCGCT GGTGAAAGGC
TCGGTGGCGG AGCAGGCGCA GATCCAGAGC CAGCTCGACA GTATGGTTAA CGGCGAGCAA
TCGCAGATTG TGAGCTTAAG AGCGCAACTT ACTGAAGCAA AATATAATCT TGAGCAGACT
GTCATTCGCG CACCAAGCAA TGGCTACGTC ACTCAGGTAC TGATCCGCCC AGGCACATAC
GCAGCTGCCT TGCCGTTGCG TCCGGTGATG GTTTTCATCC CCGAGCAAAA ACGGCAAATT
GTCGCCCAAT TTCGGCAAAA CTCGCTGTTA CGTCTGAAAC CTGGTGATGA TGCAGAAGTG
GTGTTTAACG CGCTACCTGG GCAGGTGTTC CACGGCAAAC TGACCAGTAT TTTACCTGTC
GTGCCAGGCG GTTCTTATCA GGCGCAGGGG GTATTGCAAT CATTAACGGT CGTGCCCGGC
ACGGACGGTG TGCTGGGAAC CATTGAACTG GACCCTAACG ATGATATCGA TGCCTTACCC
GACGGCATCT ACGCCCAGGT GGCGGTTTAC TCCGACCATT TCAGCCATGT TTCGGTGATG
CGGAAAGTGC TGCTAAGAAT GACCAGCTGG ATGCATTATC TTTATTTGGA TCATTGA
 
Protein sequence
MDLLIVLTYV ALAWAVFKIF RIPVNQWTLA TAALGGVFLV SGLILLMNYN HPYTFTAQKA 
VIAIPITPQV TGIVTEVTDK NNQLIQKGEV LFKLDPVRYQ ARVDRLQADL MTATHNIKTL
RAQLTEAQAN TTQVSAERDR LFKNYQRYLK GSQAAVNPFS ERDIDDARQN FLAQDALVKG
SVAEQAQIQS QLDSMVNGEQ SQIVSLRAQL TEAKYNLEQT VIRAPSNGYV TQVLIRPGTY
AAALPLRPVM VFIPEQKRQI VAQFRQNSLL RLKPGDDAEV VFNALPGQVF HGKLTSILPV
VPGGSYQAQG VLQSLTVVPG TDGVLGTIEL DPNDDIDALP DGIYAQVAVY SDHFSHVSVM
RKVLLRMTSW MHYLYLDH