Gene EcHS_A4271 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A4271 
SymbolxylE 
ID5594932 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp4272523 
End bp4273998 
Gene Length1476 bp 
Protein Length491 aa 
Translation table11 
GC content49% 
IMG OID640923373 
ProductD-xylose transporter XylE 
Protein accessionYP_001460818 
Protein GI157163500 
COG category 
COG ID 
TIGRFAM ID[TIGR00879] MFS transporter, sugar porter (SP) family 


Plasmid Coverage information

Num covering plasmid clones59 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATACCC AGTATAATTC CAGTTATATA TTTTCGATTA CCTTAGTCGC TACATTAGGT 
GGTTTATTAT TTGGCTACGA CACCGCCGTT ATTTCCGGTA CTGTTGAGTC ACTCAATACC
GTCTTTGTTG CTCCACAAAA CTTAAGTGAA TCCGCTGCCA ACTCCCTGTT AGGATTTTGC
GTAGCCAGCG CTCTGATTGG TTGCATCATC GGCGGTGCCC TCGGTGGTTA TTGCAGTAAC
CGCTTCGGCC GTCGTGATTC ACTTAAGATT GCTGCTGTCC TGTTTTTTAT TTCTGGTGTA
GGTTCTGCCT GGCCAGAACT TGGTTTTACC TCTATAAACC CGGACAACAC AGTACCTATT
TATCTGGCAG GTTATGTCCC GGAATTTGTT ATTTATCGTA TTATTGGCGG TATTGGCGTT
GGTTTAGCCT CAATGCTTTC ACCAATGTAT ATTGCGGAAC TGGCTCCAGC TCATATTCGC
GGAAAACTGG TTTCATTTAA CCAGTTTGCG ATTATTTTCG GGCAACTTTT AGTTTACTGC
GTAAACTATT TTATTGCCCG TTCCGGTGAT GCCAGCTGGC TGAATACTGA CGGCTGGCGT
TATATGTTTG CCTCGGAATG TATCCCTGCA CTGCTGTTCT TAATGCTGCT GTATACCGTG
CCAGAAAGTC CTCGCTGGCT GATGTCACGC GGCAAGCAAG AACAGGCGGA AAGTATCCTG
CGCAAAATTA TGGGCAACAC GCTTGCAACT CAGGCAGTAC AGGAAATTAA ACACTCCCTG
GATCATGGCC GCAAAACCGG TGGTCGTCTG CTGATGTTTG GCGTGGGCGT GATTGTAATC
GGCGTTATGC TCTCCATCTT CCAGCAATTT GTCGGCATCA ATGTGGTGTT GTACTACGCG
CCGGAAGTGT TCAAAACGCT GGGGGCCAGC ACGGATATCG CGCTGTTGCA GACCATTATT
GTCGGAGTTA TCAACCTCAC CTTTACCGTA CTGGCAATTA TGACGGTGGA TAAATTTGGT
CGTAAGCCAC TGCAAATTAT CGGCGCACTC GGAATGGCAA TCGGTATGTT TAGCCTCGGT
ACCGCGTTTT ACACTCAGGC ATCGGGTATT GTGGCGCTAC TGTCGATGCT GTTCTATGTT
GCCGCCTTTG CCATGTCCTG GGGTCCGGTA TGCTGGGTAC TGCTGTCGGA AATCTTCCCG
AATGCTATTC GTGGTAAAGC GCTGGCAATC GCGGTGGCGG CCCAGTGGCT GGCGAACTAC
TTCGTCTCCT GGACCTTCCC GATGATGGAC AAAAACTCCT GGCTGGTGGC CCATTTCCAC
AACGGTTTCT CCTACTGGAT TTACGGGTGT ATGGGCGTTC TGGCAGCACT GTTTATGTGG
AAATTTGTCC CGGAAACCAA AGGTAAAACC CTTGAGGAGC TGGAAGCCCT CTGGGAACCG
GAAACGAAGA AAACACAACA AACTGCTACG CTGTAA
 
Protein sequence
MNTQYNSSYI FSITLVATLG GLLFGYDTAV ISGTVESLNT VFVAPQNLSE SAANSLLGFC 
VASALIGCII GGALGGYCSN RFGRRDSLKI AAVLFFISGV GSAWPELGFT SINPDNTVPI
YLAGYVPEFV IYRIIGGIGV GLASMLSPMY IAELAPAHIR GKLVSFNQFA IIFGQLLVYC
VNYFIARSGD ASWLNTDGWR YMFASECIPA LLFLMLLYTV PESPRWLMSR GKQEQAESIL
RKIMGNTLAT QAVQEIKHSL DHGRKTGGRL LMFGVGVIVI GVMLSIFQQF VGINVVLYYA
PEVFKTLGAS TDIALLQTII VGVINLTFTV LAIMTVDKFG RKPLQIIGAL GMAIGMFSLG
TAFYTQASGI VALLSMLFYV AAFAMSWGPV CWVLLSEIFP NAIRGKALAI AVAAQWLANY
FVSWTFPMMD KNSWLVAHFH NGFSYWIYGC MGVLAALFMW KFVPETKGKT LEELEALWEP
ETKKTQQTAT L