Gene EcSMS35_3536 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3536 
SymbolaaeB 
ID6144594 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3613146 
End bp3615113 
Gene Length1968 bp 
Protein Length655 aa 
Translation table11 
GC content52% 
IMG OID641618365 
Productp-hydroxybenzoic acid efflux subunit AaeB 
Protein accessionYP_001745512 
Protein GI170682336 
COG category[S] Function unknown 
COG ID[COG1289] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones53 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTATTT TCTCCATTGC TAACCAACAT ATTCGCTTTG CGGTAAAACT GGCGACCGCC 
ATTGTACTGG CGCTGTTTGT TGGCTTTCAC TTCCAGCTGG AAACGCCACG CTGGGCGGTA
CTGACAGCGG CGATTGTTGC CGCCGGTCCG GCCTTTGCTG CGGGAGGTGA ACCGTATTCT
GGCGCTATTC GCTATCGTGG CTTTTTGCGC ATCATCGGCA CATTTATTGG CTGTATTGCC
GGACTGGTAA TCATCATTGC GATGATCCGC GCACCATTAT TGATGATTCT GGTGTGCTGT
ATCTGGGCCG GTTTTTGTAC CTGGATATCC TCGCTGGTAC GAATAGAAAA CTCGTATGCG
TGGGGGCTGG CCGGTTATAC CGCGCTGATC ATTGTGATCA CTATTCAGCC GGAACCATTG
CTTACGCCGC AGTTTGCCGT CGAACGTTGT AGCGAGATCG TTATCGGTAT TGTGTGTGCA
ATTATGGCGG ATTTGCTCTT TTCTCCGCGA TCGATCAAAC AAGAAGTGGA TCGAGAGCTG
GAAAGTTTGC TGGTCGCGCA ATATCAATTA ATGCAACTCT GTATCAAACA TGGCGATGGT
GAAGTTGTCG ATAAAGCCTG GGGCGATCTG GTGCGACGTA CTACGGCGTT ACAAGGTATG
CGCAGCAACC TGAATATGGA ATCTTCCCGC TGGGCGCGGG CCAATCGACG TTTAAAAGCG
ATCAATACGC TATCGCTGAC GCTGATTACA CAATCCTGCG AAACTTATCT TATTCAGAAT
ACGCGCCCGG AATTGATCAC TGATACTTTC CGCGAATTTT TTGACACGCC GGTAGAAACC
GCGCAGGACG TCCACAAGCA GCTCAAACGC CTGCGGAGAG TGATCGCCTG GACCGGGGAA
CGGGAAACGC CTGTTACCAT TTATAGCTGG GTCGCGGCGG CAACGCGTTA TCAGCTTCTC
AAGCGCGGCG TTATCAGTAA CACAAAAATC AACGCCACCG AAGAAGAGAT CCTGCAAGGC
GAACCGGAAG TCAAAGTAGA GTCAGCCGAA CGTCATCATG CGATGGTTAA CTTCTGGCGA
ACCACACTTT CCTGCATTCT GGGGACGCTT TTCTGGCTGT GGACGGGCTG GACTTCCGGC
AGTGGTGCAA TGGTGATGAT TGCGGTAGTG ACGTCACTGG CAATGCGTTT GCCAAATCCA
CGCATGGTGG CGATCGACTT TATCTACGGG ACGCTGGCCG CGCTGCCGTT AGGGCTGCTC
TACTTTTTGG TGATTATCCC TAATACTCAA CAGAGCATGT TGCTGCTGTG TATTAGCCTG
GCAGTGCTGG GATTTTTCCT CGGTATAGAA GTACAGAAAC GGCGACTGGG CTCGATGGGG
GCGCTGGCCA GCACCATAAA TATTATCGTG CTGGATAACC CGATGACTTT CCATTTCAGT
CAGTTTCTCG ACAGCGCATT AGGGCAAATC GTCGGCTGTG TGCTCGCGTT CACCGTTATT
TTGCTGGTGC GGGATAAATC GCGCGACAGG ACTGGACGTG TACTGCTTAA TCAGTTTGTT
TCTGCCGCTG TTTCCGCGAT GACTACCAAT GTGGCACGTC GTAAAGAGAA TCACCTCCCG
GCACTTTATC AGCAGCTGTT TTTGCTGATG AATAAGTTCC CAGGGGATTT GCCGAAATTT
CGCCTGGCGC TGACGATGAT TATCGCGCAC CAGCGCCTGC GTGATGCGCC GATCCCGGTT
AACGAGGATT TATCGGCGTT TCACCGACAA ATGCGCCGCA CAGCAGACCA TGTGATATCT
GCCCGTAGCG ATGATAAACG TCGTCGGTAC TTTGGCCAGT TGCTGGAAGA ACTTGAAATC
TACCAGGAAA AGCTACGCAT CTGGCAAGCG CCACCGCAGG TGACGGAACC AGTACATCGG
CTTACGGGTA TGCTCCATAA GTATCAACAT GCGTTGACCG ATAGTTAA
 
Protein sequence
MGIFSIANQH IRFAVKLATA IVLALFVGFH FQLETPRWAV LTAAIVAAGP AFAAGGEPYS 
GAIRYRGFLR IIGTFIGCIA GLVIIIAMIR APLLMILVCC IWAGFCTWIS SLVRIENSYA
WGLAGYTALI IVITIQPEPL LTPQFAVERC SEIVIGIVCA IMADLLFSPR SIKQEVDREL
ESLLVAQYQL MQLCIKHGDG EVVDKAWGDL VRRTTALQGM RSNLNMESSR WARANRRLKA
INTLSLTLIT QSCETYLIQN TRPELITDTF REFFDTPVET AQDVHKQLKR LRRVIAWTGE
RETPVTIYSW VAAATRYQLL KRGVISNTKI NATEEEILQG EPEVKVESAE RHHAMVNFWR
TTLSCILGTL FWLWTGWTSG SGAMVMIAVV TSLAMRLPNP RMVAIDFIYG TLAALPLGLL
YFLVIIPNTQ QSMLLLCISL AVLGFFLGIE VQKRRLGSMG ALASTINIIV LDNPMTFHFS
QFLDSALGQI VGCVLAFTVI LLVRDKSRDR TGRVLLNQFV SAAVSAMTTN VARRKENHLP
ALYQQLFLLM NKFPGDLPKF RLALTMIIAH QRLRDAPIPV NEDLSAFHRQ MRRTADHVIS
ARSDDKRRRY FGQLLEELEI YQEKLRIWQA PPQVTEPVHR LTGMLHKYQH ALTDS