Gene EcolC_2800 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_2800 
Symbol 
ID6064960 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp3061172 
End bp3062404 
Gene Length1233 bp 
Protein Length410 aa 
Translation table11 
GC content52% 
IMG OID641602206 
Productmajor facilitator transporter 
Protein accessionYP_001725755 
Protein GI170020801 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAAATA AATTAGCTTC CGGTGCCAGG CTTGGACGTC AGGCGTTACT TTTCCCTCTC 
TGTCTGGTGC TTTACGAATT TTCAACCTAT ATCGGCAACG ATATGATTCA ACCCGGTATG
TTGGCCGTGG TGGAACAATA TCAGGCGGGC ATTGATTGGG TTCCTACTTC GATGACCGCG
TATCTGGCGG GCGGGATGTT TTTACAATGG CTGCTGGGGC CGCTGTCGGA TCGTATTGGT
CGCCGTCCGG TGATGCTGGC GGGAGTGGTG TGGTTTATCG TCACCTGTCT GGCAATATTG
CTGGCGCAAA ACATTGAACA ATTCACCCTG TTGCGCTTCT TGCAGGGCAT AAGCCTCTGT
TTCATTGGCG CTGTGGGATA CGCCGCAATT CAGGAATCCT TCGAAGAGGC GGTTTGTATC
AAGATCACCG CGCTGATGGC GAACGTGGCG CTGATTGCTC CGCTACTTGG TCCGCTGGTG
GGCGCGGCGT GGATCCATGT GCTGCCCTGG GAGGGGATGT TTGTTTTGTT TGCCGCATTG
GCAGCGATCT CCTTTTTCGG TCTGCAACGA GCCATGCCTG AAACCGCCAC GCGTATAGGC
GAGAAACTGT CACTGAAAGA ACTCGGTCGT GACTATAAGC TGGTGCTGAA GAACGGCCGC
TTTGTGGCGG GGGCGCTGGC GCTGGGATTC GTTAGCCTGC CATTGCTGGC GTGGATCGCC
CAGTCGCCGA TTATCATCAT TACCGGCGAG CAGTTGAGCA GCTATGAATA TGGCTTGCTG
CAAGTGCCTA TTTTCGGGGC GTTAATTGCG GGTAACTTGC TGTTAGCGCG TCTGACCTCG
CGCCGCACCG TACGTTCGCT GATAATTATG GGCGGCTGGC CGATTATGAT TGGTCTGTTG
GTCGCTGCTG CGGCAACGGT TATCTCATCG CACGCGTATT TATGGATGAC TGCCGGGTTA
AGTATTTATG CTTTCGGTAT TGGTCTGGCG AATGCGGGAC TGGTGCGATT AACCCTGTTT
GCCAGCGATA TGAGTAAAGG TACGGTTTCT GCGGCGATGG GAATGCTGCA AATGCTGATC
TTTACCGTTG GTATTGAAAT CAGCAAACAT GCCTGGCTGA ACGGGGGCAA CGGACAGTTT
AATCTCTTCA ACCTTGTCAA CGGAATTTTG TGGCTGTCGC TGATGGTTAT CTTTTTAAAA
GATAAACAGA TGGGAAATTC TCACGAAGGG TAA
 
Protein sequence
MQNKLASGAR LGRQALLFPL CLVLYEFSTY IGNDMIQPGM LAVVEQYQAG IDWVPTSMTA 
YLAGGMFLQW LLGPLSDRIG RRPVMLAGVV WFIVTCLAIL LAQNIEQFTL LRFLQGISLC
FIGAVGYAAI QESFEEAVCI KITALMANVA LIAPLLGPLV GAAWIHVLPW EGMFVLFAAL
AAISFFGLQR AMPETATRIG EKLSLKELGR DYKLVLKNGR FVAGALALGF VSLPLLAWIA
QSPIIIITGE QLSSYEYGLL QVPIFGALIA GNLLLARLTS RRTVRSLIIM GGWPIMIGLL
VAAAATVISS HAYLWMTAGL SIYAFGIGLA NAGLVRLTLF ASDMSKGTVS AAMGMLQMLI
FTVGIEISKH AWLNGGNGQF NLFNLVNGIL WLSLMVIFLK DKQMGNSHEG