Gene EcolC_2547 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_2547 
Symbol 
ID6066277 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp2794422 
End bp2795648 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content52% 
IMG OID641601953 
Productdrug efflux system protein MdtG 
Protein accessionYP_001725505 
Protein GI170020551 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0112458 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.379099 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCACCCT GTGAAAATGA CACCCCTATA AACTGGAAAC GAAACCTGAT CGTCGCCTGG 
CTAGGCTGTT TTCTTACCGG GGCCGCCTTC AGTCTGGTAA TGCCCTTCTT ACCCCTCTAC
GTTGAGCAGC TTGGCGTTAC CGGTCACTCC GCCCTGAATA TGTGGTCCGG TATTGTCTTC
AGCATTACAT TTTTATTTTC GGCCATCGCC TCACCGTTTT GGGGTGGACT CGCCGACCGT
AAAGGCCGAA AACTCATGCT ATTACGCTCT GCTCTCGGCA TGGGCATCGT AATGGTGTTG
ATGGGACTGG CACAAAATAT CTGGCAGTTT TTGATCCTAC GGGCGCTTCT TGGGTTACTT
GGCGGATTTG TCCCCAACGC TAATGCTCTT ATCGCCACAC AAGTACCGCG TAATAAAAGC
GGCTGGGCGC TGGGTACGCT CTCCACAGGC GGCGTTAGTG GTGCGTTGCT CGGCCCAATG
GCTGGCGGCC TGCTCGCCGA TAGCTACGGC TTACGTCCGG TATTCTTTAT TACCGCCAGT
GTGCTCATAC TCTGCTTTTT CGTCACCCTG TTTTGCATCA GAGAAAAATT CCAGCCGGTC
AGCAAAAAAG AGATGCTGCA CATGCGGGAA GTGGTGACAT CACTTAAAAA CCCGAAACTG
GTACTCAGCC TGTTTGTCAC TACGTTAATC ATCCAGGTGG CGACGGGCTC AATTGCCCCC
ATTCTGACGC TGTATGTCCG CGAACTGGCG GGTAACGTCA GTAACGTCGC CTTTATCAGT
GGCATGATCG CCTCGGTGCC AGGCGTGGCG GCTCTGCTAA GTGCACCACG ACTCGGCAAA
CTTGGCGATC GAATCGGACC CGAAAAGATC CTGATTACAG CGCTGATCTT TTCTGTACTG
CTGTTGATCC CAATGTCTTA CGTTCAGACG CCATTGCAAC TTGGGATTTT ACGTTTTTTG
CTCGGTGCCG CCGATGGTGC ACTACTCCCC GCCGTACAGA CACTGTTGGT TTACAACTCG
AGCAACCAGA TCGCCGGGCG TATCTTCAGC TATAACCAAT CGTTTCGTGA TATTGGCAAC
GTTACCGGAC CATTGATGGG AGCAGCGATT TCAGCGAACT ACGGTTTCAG AGCGGTATTT
CTCGTCACCG CTGGCGTAGT GTTATTCAAC GCAGTCTATT CATGGAACAG TCTACGTCGT
CGTCGAATAC CCCAGGTATC GAACTGA
 
Protein sequence
MSPCENDTPI NWKRNLIVAW LGCFLTGAAF SLVMPFLPLY VEQLGVTGHS ALNMWSGIVF 
SITFLFSAIA SPFWGGLADR KGRKLMLLRS ALGMGIVMVL MGLAQNIWQF LILRALLGLL
GGFVPNANAL IATQVPRNKS GWALGTLSTG GVSGALLGPM AGGLLADSYG LRPVFFITAS
VLILCFFVTL FCIREKFQPV SKKEMLHMRE VVTSLKNPKL VLSLFVTTLI IQVATGSIAP
ILTLYVRELA GNVSNVAFIS GMIASVPGVA ALLSAPRLGK LGDRIGPEKI LITALIFSVL
LLIPMSYVQT PLQLGILRFL LGAADGALLP AVQTLLVYNS SNQIAGRIFS YNQSFRDIGN
VTGPLMGAAI SANYGFRAVF LVTAGVVLFN AVYSWNSLRR RRIPQVSN