Gene ECH74115_0393 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_0393 
Symbol 
ID6969940 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp399972 
End bp401456 
Gene Length1485 bp 
Protein Length494 aa 
Translation table11 
GC content51% 
IMG OID643384445 
Productsugar ABC transporter, ATP-binding protein 
Protein accessionYP_002268960 
Protein GI209399620 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1129] ABC-type sugar transport system, ATPase component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones56 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAACCT TCCTTTCCCT TCGTCATATC AACAAAACGT TTCATGCCAC GCGAGCGCTG 
CGCGATGTTT CTCTCGACTT TATGTCGGGT GAAGTGCACT GCCTTGCCGG GCAAAACGGC
TGTGGTAAAT CGACGTTGAT TAAAATTATG TCCGGTGTTT ATCGCCCGGA TGAGGGAGCG
GAAATTACGC TTGGTGGGAA AAACTGGTCA AAGCTGACAC CCGCCGCTTC GGTGGCGCAG
GGGATTCAGG TGATTTATCA GGACCTCTCT TTATTTCCTA ACCTGAGTGT CTGGGAGAAT
ATCGCCGTGA ATCACTATCA CCACGGTCTG TTTGTTAACC GCCGTCGTCT GCGTGAGGTG
GCGCAGGCGG CAATGACTAG TATTAACGTC ACATTGCCAC TGGATACGCT CGTTTCTGAG
CTTTCAATTG CCCGCTGTCA GCTGGTGGCA ATTTGCCGTG CACTGGCGCA GGACGCACGG
TTGATCGTGA TGGATGAACC AACCGCTTCG CTGACCCATC AGGAAGTGCA AGGGTTATTA
CAGGTGGTGC ATCAATTGCG TGAACGCGGG ATCTGCGTGG TCTTTGTCAG TCATCGTCTG
GAAGAAGTGA TGGAAGTTTC CGACCGTATT TCAGTGCTGA AAGATGGTGA GCTGGTCGGG
ACTTTTCCGG CTGCAGAGAT GACCACAAAA CAACTCGGTT TCCTGATGAC CGGCCAGGAG
TTTGAATATC AGGTGCGAGA GTTGTGGCAG GGAAAATCCA GCACCCCGGT GCTGGAGGTG
CGTAATTTAA GTCGTCATGG GGAATATCTA AATATTAACT TGCGGGTGGA AGCGGGCGAA
GTGGTATCGA TTGTTGGCCT GCTTGGTGCG GGGCGCACGG AATTATGTCT GAGTCTGTTT
GGTATGACCC GACCAGATGC TGGCGAGATC CTCATCAACG GCCAGCTGGT TACACTGCAT
AGTAACCAGG ACGCCATCCG CCACGGTATT GGTTATGTTT CGGAAGATCG CATGTCGCGC
GGTTTGGTGA TGGCGCAATC TATTGAAGAC AACATCATTA GCACTGTTTT TCACAAGGTA
AAAGATCGCT TCGGCTTTCT GAGTGAAACT AAAGTGCGTG ACCTGGTGGA CAGGCTAATC
AAGGCATTGA CCATCAAAGT CTCTGATCCT CATCTTCCGG TGAATACGCT TTCTGGTGGT
AACGCGCAGC GAGTATCGAT CGCCAAATGG CTGGCGATTG GGCCACGACT GTTGATTCTC
GATAGCCCGA CAGTCGGTGT TGATATCGCC AACAAAGCAG GAATATACGG CATTATCAGC
GATCTGGCGG CTCATGGCAT TGCGGTATTG ATGATCTGCG ATGAAATTGA AGAAGCGTGG
TATCAAAGCC ACCGGATTTT GGTAATGCAA AAAGGTCAGA TCACCCATAG CTTCCTGCCT
GACAGCAGTT CTCAGGCCAG GATTGCGGAG GTGGTAAATG GCTGA
 
Protein sequence
METFLSLRHI NKTFHATRAL RDVSLDFMSG EVHCLAGQNG CGKSTLIKIM SGVYRPDEGA 
EITLGGKNWS KLTPAASVAQ GIQVIYQDLS LFPNLSVWEN IAVNHYHHGL FVNRRRLREV
AQAAMTSINV TLPLDTLVSE LSIARCQLVA ICRALAQDAR LIVMDEPTAS LTHQEVQGLL
QVVHQLRERG ICVVFVSHRL EEVMEVSDRI SVLKDGELVG TFPAAEMTTK QLGFLMTGQE
FEYQVRELWQ GKSSTPVLEV RNLSRHGEYL NINLRVEAGE VVSIVGLLGA GRTELCLSLF
GMTRPDAGEI LINGQLVTLH SNQDAIRHGI GYVSEDRMSR GLVMAQSIED NIISTVFHKV
KDRFGFLSET KVRDLVDRLI KALTIKVSDP HLPVNTLSGG NAQRVSIAKW LAIGPRLLIL
DSPTVGVDIA NKAGIYGIIS DLAAHGIAVL MICDEIEEAW YQSHRILVMQ KGQITHSFLP
DSSSQARIAE VVNG