Gene EcolC_3587 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3587 
Symbol 
ID6066270 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp3920782 
End bp3921960 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content48% 
IMG OID641603004 
Productsugar efflux transporter 
Protein accessionYP_001726528 
Protein GI170021574 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00899] sugar efflux transporter 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000416053 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGATCTGGA TAATGACGAT GGCTCGCCGT ATGAACGGTG TTTACGCGGC ATTTATGCTG 
GTCGCTTTTA TGATGGGGGT GGCCGGGGCG CTACAGGCTC CTACATTGAG CTTATTTCTG
AGTCGTGAGG TTGGCGCGCA ACCTTTCTGG ATCGGACTCT TTTATACGGT GAATGCTATT
GCTGGGATCG GCGTAAGCCT CTGGTTGGCA AAACGTTCTG ACAGTCAGGG CGATCGGCGA
AAACTGATTA TATTTTGCTG TTTGATGGCT ATCGGCAATG CGCTATTGTT TGCATTTAAT
CGTCATTATC TGACGCTTAT CACCTGTGGT GTGCTTCTGG CATCTCTGGC CAATACGGCA
ATGCCACAGT TATTTGCTCT GGCGCGGGAA TATGCGGATA ACTCGGCGCG AGAAGTGGTG
ATGTTTAGCT CGGTGATGCG TGCGCAGCTT TCTCTGGCAT GGGTTATCGG TCCACCGTTG
GCCTTTATGC TGGCGTTGAA TTACGGCTTT ACGGTGATGT TTTCGATTGC CGCCGGGATA
TTCACACTCA GTCTGGTATT GATTGCATTT ATGCTTCCGT CTGTGGCGCG GGTAGAACTG
CCGTCGGAAA ATGCTTTATC AATGCAAGGT GGCTGGCAGG ATAGTAACGT ACGGATGTTA
TTTGTCGCCT CGACGTTAAT GTGGACCTGC AACACCATGT ACATTATTGA TATGCCGTTG
TGGATCAGTA GCGAGTTAGG ATTGCCAGAC AAACTGGCGG GTTTCCTGAT GGGGACGGCA
GCTGGACTGG AAATACCAGC AATGATTCTG GCTGGCTACT ATGTCAAACG TTATGGTAAG
CGGCGAATGA TGGTCATAGC AGTGGCGGCA GGAGTACTGT TTTACACCGG ATTGATTTTA
TTTCATAGCC GTCTGGCGTT GATGACGCTG CAACTTTTTA ACGCTGTATT TATCGGCATT
GTTGCGGGTA TTGGGATGCT ATGGTTTCAG GATTTAATGC CTGGAAGAGC GGGGGCAGCT
ACCACCTTAT TTACTAACAG TATTTCTACC GGGGTAATTC TGGCTGGCGT TATTCAGGGA
GCAATTGCAC AAAGTTGGGG GCACTTTGCT GTCTACTGGA TAATTGCGGT TATTTCTGTT
GTCGCATTAT TTTTAACCGC AAAGGTTAAA GACATTTGA
 
Protein sequence
MIWIMTMARR MNGVYAAFML VAFMMGVAGA LQAPTLSLFL SREVGAQPFW IGLFYTVNAI 
AGIGVSLWLA KRSDSQGDRR KLIIFCCLMA IGNALLFAFN RHYLTLITCG VLLASLANTA
MPQLFALARE YADNSAREVV MFSSVMRAQL SLAWVIGPPL AFMLALNYGF TVMFSIAAGI
FTLSLVLIAF MLPSVARVEL PSENALSMQG GWQDSNVRML FVASTLMWTC NTMYIIDMPL
WISSELGLPD KLAGFLMGTA AGLEIPAMIL AGYYVKRYGK RRMMVIAVAA GVLFYTGLIL
FHSRLALMTL QLFNAVFIGI VAGIGMLWFQ DLMPGRAGAA TTLFTNSIST GVILAGVIQG
AIAQSWGHFA VYWIIAVISV VALFLTAKVK DI