Gene EcolC_0115 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_0115 
Symbol 
ID6065097 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp123447 
End bp124583 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content53% 
IMG OID641599517 
Productsecretion protein HlyD family protein 
Protein accessionYP_001723126 
Protein GI170018172 
COG category[V] Defense mechanisms 
COG ID[COG1566] Multidrug resistance efflux pump 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0435233 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATCTAT TGATTGTTTT AACTTACGTG GCGCTGGCGT GGGCGGTCTT TAAAATCTTC 
CGCATTCCGG TAAATCAGTG GACGCTGGCG ACGGCGGCGC TGGGAGGCGT GTTTCTGGTG
AGTGGTTTGA TTTTGTTGAT GAACTACAAC CACCCTTACA CTTTTACCGC GCAAAAGGCA
GTGATAGCGA TCCCTATCAC GCCACAGGTG ACGGGAATTG TTACTGAAGT CACTGACAAG
AATAATCAGC TTATTCAAAA GGGCGAGGTG CTTTTTAAGC TCGACCCGGT TCGTTACCAG
GCGCGAGTTG ACAGACTTCA GGCTGACCTG ATGACGGCGA CGCATAATAT AAAGACGCTG
CGTGCGCAGC TCACTGAAGC GCAGGCCAAC ACCACCCAGG TTTCAGCGGA GCGCGACCGT
CTGTTTAAAA ATTATCAACG TTACTTGAAT GGCAGCCAGG CGGCGGTGAA TCCGTTCTCG
GAACGTGACA TCGACGATGC GCGGCAAAAT TTCCTCGCGC AGGATGCGCT GGTGAAAGGC
TCGGTGGCGG AGCAGGCGCA GATCCAGAGC CAGCTCGACA GTATGGTTAA CGGCGAGCAA
TCGCAGATTG TGAGCTTAAG AGCGCAACTT ACTGAAGCAA AATATAACCT TGAGCAGACT
GTCATTCGCG CGCCGAGCAA TGGCTACGTT ACTCAGGTAC TGATCCGCCC AGGTACATAC
GCAGCTGCCT TGCCGCTGCG TCCGGTGATG GTCTTCATCC CCGAGCAAAA ACGGCAAATT
GTCGCCCAAT TTCGGCAAAA CTCGCTGTTA CGTCTGAAAC CCGGCGATGA TGCGGAAGTG
GTGTTTAACG CGCTACCTGG GCAGGTGTTT CACGGCAAAC TGACTAGTAT TTTACCTGTC
GTGCCAGGCG GTTCTTATCA GGCGCAGGGG GTATTGCAAT CATTAACGGT CGTGCCCGGC
ACGGACGGTG TGCTGGGAAC CATTGAACTG GACCCTAACG ATGATATCGA TGCCTTACCC
GACGGCATCT ACGCCCAGGT GGCGGTTTAC TCCGACCATT TCAGCCATGT TTCGGTGATG
CGGAAAGTGC TGCTAAGAAT GACCAGCTGG ATGCATTATC TTTATTTGGA TCATTGA
 
Protein sequence
MDLLIVLTYV ALAWAVFKIF RIPVNQWTLA TAALGGVFLV SGLILLMNYN HPYTFTAQKA 
VIAIPITPQV TGIVTEVTDK NNQLIQKGEV LFKLDPVRYQ ARVDRLQADL MTATHNIKTL
RAQLTEAQAN TTQVSAERDR LFKNYQRYLN GSQAAVNPFS ERDIDDARQN FLAQDALVKG
SVAEQAQIQS QLDSMVNGEQ SQIVSLRAQL TEAKYNLEQT VIRAPSNGYV TQVLIRPGTY
AAALPLRPVM VFIPEQKRQI VAQFRQNSLL RLKPGDDAEV VFNALPGQVF HGKLTSILPV
VPGGSYQAQG VLQSLTVVPG TDGVLGTIEL DPNDDIDALP DGIYAQVAVY SDHFSHVSVM
RKVLLRMTSW MHYLYLDH