Gene EcolC_3206 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3206 
Symbol 
ID6066451 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp3512981 
End bp3514345 
Gene Length1365 bp 
Protein Length454 aa 
Translation table11 
GC content52% 
IMG OID641602621 
Productmajor facilitator transporter 
Protein accessionYP_001726155 
Protein GI170021201 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000090157 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAACGATT ATAAAATGAC GCCAGGTGAG CGGCGCGCGA CCTGGGGTTT AGGAACCGTT 
TTCTCGTTGC GCATGCTGGG CATGTTTATG GTTCTGCCGG TTCTGACCAC GTATGGCATG
GCTCTGCAAG GTGCCAGCGA AGCATTAATC GGTATTGCCA TTGGTATTTA TGGTCTGACT
CAGGCCGTTT TTCAGATTCC GTTTGGCCTG CTTTCCGACC GCATTGGTCG CAAACCATTA
ATTGTCGGTG GGCTGGCGGT GTTTGCCGCC GGTAGCGTTA TCGCTGCGCT TTCTGACTCC
ATCTGGGGAA TTATTCTGGG CCGGGCGCTA CAAGGCTCCG GTGCGATTGC CGCTGCCGTT
ATGGCGCTGC TTTCCGATCT CACGCGCGAA CAAAACCGTA CCAAAGCAAT GGCGTTTATC
GGCGTGAGCT TTGGCATTAC CTTTGCCATT GCGATGGTGC TTGGCCCGAT CATCACTCAC
AAACTTGGGC TGCACGCGCT GTTCTGGATG ATCGCTATTC TGGCAACGAC CGGCATTGCG
TTGACCATTT GGGTTGTGCC CAACAGTAGC ACTCACGTAC TTAATCGTGA GTCCGGAATG
GTGAAAGGCA GTTTCAGTAA AGTGCTGGCG GAACCGCGGC TGCTGAAACT CAACTTTGGC
ATTATGTGTC TGCATATTTT GCTGATGTCG ACGTTTGTTG CCCTGCCCGG ACAACTGGCT
GATGCAGGGT TCCCGGCGGC TGAACACTGG AAGGTCTATC TGGCGACAAT GCTAATCGCC
TTTGGCTCGG TCGTGCCTTT CATTATCTAC GCTGAAGTTA AGCGCAAAAT GAAGCAAGTC
TTTGTCTTCT GCGTCGGGTT GATCGTGGTT GCGGAAATTG TGTTGTGGAA CGCGCAAACG
CAGTTCTGGC AACTGGTGGT CGGCGTGCAG CTTTTCTTTG TGGCGTTTAA TTTGATGGAA
GCCCTCCTGC CCTCACTTAT CAGTAAAGAG TCGCCAGCAG GTTACAAAGG TACGGCGATG
GGTGTTTACT CCACCAGCCA GTTTCTTGGC GTGGCGATTG GCGGTTCGCT GGGCGGCTGG
ATTAACGGCA TGTTTGACGG TCAGGGGGTA TTTCTCGCTG GCGCAATGCT GGCCGCAGTG
TGGCTGACAG TCGCCAGTAC CATGAAAGAA CCGCCGTATG TCAGCAGTTT GCGCATTGAA
ATCCCGGCGA ACATTGCCGC AAACGAGGCG TTAAAAGTGC GTTTGCTAGA AACTGAAGGC
ATCAAAGAAG TGTTGATTGC AGAAGAAGAA CATTCAGCTT ATGTGAAAAT CGACAGCAAA
GTGACGAATC GCTTTGAGAT AGAACAGGCA ATTCGTCAGG CATAA
 
Protein sequence
MNDYKMTPGE RRATWGLGTV FSLRMLGMFM VLPVLTTYGM ALQGASEALI GIAIGIYGLT 
QAVFQIPFGL LSDRIGRKPL IVGGLAVFAA GSVIAALSDS IWGIILGRAL QGSGAIAAAV
MALLSDLTRE QNRTKAMAFI GVSFGITFAI AMVLGPIITH KLGLHALFWM IAILATTGIA
LTIWVVPNSS THVLNRESGM VKGSFSKVLA EPRLLKLNFG IMCLHILLMS TFVALPGQLA
DAGFPAAEHW KVYLATMLIA FGSVVPFIIY AEVKRKMKQV FVFCVGLIVV AEIVLWNAQT
QFWQLVVGVQ LFFVAFNLME ALLPSLISKE SPAGYKGTAM GVYSTSQFLG VAIGGSLGGW
INGMFDGQGV FLAGAMLAAV WLTVASTMKE PPYVSSLRIE IPANIAANEA LKVRLLETEG
IKEVLIAEEE HSAYVKIDSK VTNRFEIEQA IRQA