Gene EcolC_0466 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_0466 
Symbol 
ID6068357 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp507509 
End bp509476 
Gene Length1968 bp 
Protein Length655 aa 
Translation table11 
GC content52% 
IMG OID641599871 
Productp-hydroxybenzoic acid efflux subunit AaeB 
Protein accessionYP_001723470 
Protein GI170018516 
COG category[S] Function unknown 
COG ID[COG1289] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.165029 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTATTT TCTCCATTGC TAACCAACAT ATTCGCTTTG CGGTAAAACT GGCGACCGCC 
ATTGTACTGG CGCTGTTTGT TGGCTTTCAC TTCCAGCTGG AAACGCCACG CTGGGCGGTA
CTGACAGCGG CGATTGTTGC TGCCGGTCCG GCCTTTGCTG CGGGAGGTGA ACCGTATTCT
GGCGCTATTC GCTATCGTGG CTTTTTGCGC ATCATCGGCA CATTTATTGG CTGTATTGCC
GGACTGGTGA TCATCATTGC GATGATCCGC GCACCATTAT TGATGATTCT GGTGTGCTGT
ATCTGGGCCG GTTTTTGTAC CTGGATATCC TCGCTGGTAC GAATAGAAAA CTCGTATGCG
TGGGGGCTGG CCGGTTATAC CGCGCTGATC ATTGTGATCA CCATTCAGCC TGAACCATTG
CTTACGCCGC AGTTTGCCGT CGAACGTTGT AGCGAGATCG TTATCGGTAT TGTGTGTGCA
ATTATGGCGG ATTTGCTCTT TTCTCCGCGA TCGATCAAAC AAGAAGTGGA TCGAGAGCTG
GAAAGTTTGC TGGTCGCGCA ATATCAATTA ATGCAACTCT GTATCAAGCA TGGCGATGGT
GAAGTTGTCG ATAAAGCCTG GGGCGACCTG GTTCGACGCA CCACGGCGCT ACAAGGTATG
CGCAGCAACC TGAATATGGA ATCTTCCCGC TGGGCGCGGG CCAATCGACG TTTAAAAGCG
ATCAATACGC TATCGCTGAC GCTGATTACC CAATCCTGCG AAACTTATCT TATTCAGAAT
ACGCGCCCGG AATTGATCAC TGATACTTTC CGCGAATTTT TTGACACACC GGTAGAAACC
GCGCAGGACG TCCACAAGCA GCTCAAACGC CTGCGAAGAG TTATCGCCTG GACCGGGGAA
CGGGAAACGC CTGTCACCAT TTATAGCTGG GTCGCGGCGG CAACGCGTTA TCAGCTTCTC
AAGCGCGGCG TTATCAGTAA CACAAAAATC AACGCCACCG AAGAAGAGAT CCTGCAAGGC
GAACCGGAAG TAAAAGTAGA GTCAGCCGAA CGTCATCATG CAATGGTTAA CTTCTGGCGA
ACCACACTTT CCTGCATTCT GGGAACGCTT TTCTGGCTGT GGACGGGCTG GACTTCCGGT
AGTGGTGCAA TGGTGATGAT TGCGGTCGTG ACGTCACTGG CAATGCGTTT GCCGAATCCA
CGCATGGTGG CGATCGACTT TATCTACGGG ACGCTGGCCG CGCTGCCGTT AGGGCTGCTC
TACTTTTTGG TGATTATCCC TAATACCCAA CAGAGCATGT TGCTGCTGTG TATTAGCCTG
GCAGTGCTGG GATTCTTCCT CGGTATAGAA GTACAGAAAC GGCGACTGGG CTCGATGGGG
GCACTGGCCA GCACCATAAA TATTATCGTG CTGGATAACC CGATGACTTT CCATTTCAGT
CAGTTTCTCG ACAGCGCATT AGGGCAAATC GTCGGCTGTG TGCTCGCGTT CACCGTTATT
TTGCTGGTGC GGGATAAATC GCGCGACAGG ACTGGACGTG TACTGCTTAA TCAGTTTGTT
TCTGCCGCTG TTTCCGCGAT GACTACCAAT GTGGCACGTC GTAAAGAGAA CCACCTCCCG
GCACTTTATC AGCAGCTGTT TTTGCTGATG AATAAGTTCC CAGGGGATTT GCCGAAATTT
CGCCTGGCGC TGACGATGAT TATCGCCCAC CAGCGCCTGC GTGATGCGCC GATCCCGGTT
AACGAGGATT TATCGGCGTT TCACCGACAA ATGCGCCGCA CAGCAGACCA TGTGATATCT
GCCCGTAGCG ATGATAAACG TCGTCGGTAC TTTGGTCAGT TGCTGGAAGA ACTGGAAATC
TACCAGGAAA AGCTACGCAT CTGGCAAGCT CCACCGCAGG TGACGGAACC GGTTCATCGG
CTGGCGGGGA TGCTCCATAA GTATCAACAT GCGTTGACCG ATAGTTAA
 
Protein sequence
MGIFSIANQH IRFAVKLATA IVLALFVGFH FQLETPRWAV LTAAIVAAGP AFAAGGEPYS 
GAIRYRGFLR IIGTFIGCIA GLVIIIAMIR APLLMILVCC IWAGFCTWIS SLVRIENSYA
WGLAGYTALI IVITIQPEPL LTPQFAVERC SEIVIGIVCA IMADLLFSPR SIKQEVDREL
ESLLVAQYQL MQLCIKHGDG EVVDKAWGDL VRRTTALQGM RSNLNMESSR WARANRRLKA
INTLSLTLIT QSCETYLIQN TRPELITDTF REFFDTPVET AQDVHKQLKR LRRVIAWTGE
RETPVTIYSW VAAATRYQLL KRGVISNTKI NATEEEILQG EPEVKVESAE RHHAMVNFWR
TTLSCILGTL FWLWTGWTSG SGAMVMIAVV TSLAMRLPNP RMVAIDFIYG TLAALPLGLL
YFLVIIPNTQ QSMLLLCISL AVLGFFLGIE VQKRRLGSMG ALASTINIIV LDNPMTFHFS
QFLDSALGQI VGCVLAFTVI LLVRDKSRDR TGRVLLNQFV SAAVSAMTTN VARRKENHLP
ALYQQLFLLM NKFPGDLPKF RLALTMIIAH QRLRDAPIPV NEDLSAFHRQ MRRTADHVIS
ARSDDKRRRY FGQLLEELEI YQEKLRIWQA PPQVTEPVHR LAGMLHKYQH ALTDS