Gene EcHS_A0388 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A0388 
Symbol 
ID5593107 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp406862 
End bp407833 
Gene Length972 bp 
Protein Length323 aa 
Translation table11 
GC content51% 
IMG OID640919573 
Productsugar ABC transporter, permease protein 
Protein accessionYP_001457159 
Protein GI157159841 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1172] Ribose/xylose/arabinose/galactoside ABC-type transport systems, permease components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones55 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTGAAC TGAAAAAACG CCATGAATTC TGGCTGGCAT TACTGATTGT GGTGCTGTTT 
GTTGGCCTTG CCTGGCGCAG CGACGAGTTT CTGACATTCG GTAATTTGTA CGATCTCGCC
AATAACTATG CCATGTTGAC TATTCTCGCC TGTGGTTTGT TTGTGGTGTT GATTTCCGGT
GGAATTGATA TTTCGTTTCC AGCAATGACC ATCATTGCGC AATACGGCAT GGTGCTGTTG
CTGCAAAAAA TTGGTGGCAA CTTCGCTGTC GCGTTTGCAC TGGCGGGCTG CATCGGCATT
TTACTTGGCT TAATTAACGC CTTACTGGTT AATCGCCTAC GGGTGCCTTC TATCATCATC
ACTATCTCGA CGCTGAATAT TTTCTATGGC CTGCTGTTAT GGTTGAGTAA AGGTGTGTGG
CTGTACGACT TTCCGCCGTG GTTTGAGCAG GGGGTTATGT TGTTCAAGTA CACCGATGCT
GATGGCTATG ACTATGGCCT TGGTCTGCCG CTGATCGCCA TGATTACGGT GGTGCTGCTA
ACAGCGTTTA TCATGAATTT CACCAGTGTA GGGCGCAAAA TTTATGCCCT TGGCGGGAAC
CGCGAATCAG CCAGTCGCAT CGGCTTTAGC GTGCTGAAAC TGCAACTTTT CGTCTATGGC
TATATGGGAT TGATGTCTGG CGCTGCGGGT GTAGTGCAGT CGTGGACGGT GATGACTGTC
GCCCCCGATT CTCTTCTGGG TTATGAGCTG ACAGTACTGG CTGCGGTGGT GCTTGGCGGC
ACTAGTTTGC TCGGCGGGCG CGGCACGTTA ACCGGTACTT TGCTCGGCGT GGTGTTGTTG
GCAGTGATGC AAAACGGGCT AAATTTATTG GGAGTCTCGT CTTACTGGCA AACATTGATC
ACCGGCATCA TCATCGTTGC CAGCATTAGT GCCACGGCGT GGAGTCAGCA TCAGAACCGG
AGTCTGCTAT GA
 
Protein sequence
MAELKKRHEF WLALLIVVLF VGLAWRSDEF LTFGNLYDLA NNYAMLTILA CGLFVVLISG 
GIDISFPAMT IIAQYGMVLL LQKIGGNFAV AFALAGCIGI LLGLINALLV NRLRVPSIII
TISTLNIFYG LLLWLSKGVW LYDFPPWFEQ GVMLFKYTDA DGYDYGLGLP LIAMITVVLL
TAFIMNFTSV GRKIYALGGN RESASRIGFS VLKLQLFVYG YMGLMSGAAG VVQSWTVMTV
APDSLLGYEL TVLAAVVLGG TSLLGGRGTL TGTLLGVVLL AVMQNGLNLL GVSSYWQTLI
TGIIIVASIS ATAWSQHQNR SLL