Gene ECH74115_3780 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_3780 
Symbol 
ID6971122 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3501468 
End bp3502979 
Gene Length1512 bp 
Protein Length503 aa 
Translation table11 
GC content56% 
IMG OID643387567 
Productputative sugar ABC transporter, ATP-binding protein 
Protein accessionYP_002272020 
Protein GI209396711 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1129] ABC-type sugar transport system, ATPase component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones66 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCACGG CAACAGAGGC AGTCCCGGTA GCAAAAGTGG TGGCAGGAAA TAAGCGTTAT 
CCCGGCGTCG TTGCGTTGGA TAACGTTAAC TTCACGCTCA ATAAAGGCGA GGTTCGTGCG
CTGTTAGGCA AAAACGGCGC GGGCAAATCG ACCCTCATTC GAATGCTTAC CGGCAGTGAA
CGTCCGGATA GCGGTGAAAT CTGGATTGGC GAGACGCGAC TGGAAGGTGA CGAAGCTACG
CTGACTCGCC GTGCCGCTGA ACTGGGGGTA CGCGCGGTTT ATCAGGAGTT AAGTCTGGTG
GAAGGGCTGA CGGTGGCGGA AAACCTCTGC CTCGGTCAGT GGCCCCGCCG CAACGGCATG
ATTGATTACC TGCAAATGGC GCAGGATGCC CAACGTTGCT TACAGGCGCT GGGCGTTGAC
GTTAGTCCTG AACAACTTGT TTCAACACTA AGCCCGGCGC ATAAGCAGCT GGTGGAAATT
GCGCGGGTGA TGAAGGGCGA GCCGCGCGTG GTCATTCTTG ATGAACCTAC CAGTTCGCTT
GCGAGTGCGG AAGTTGAACT GGTGATCAGA GCGGTGAAAA AGATGTCAGC ACTGGGCGTG
GCGGTGATTT ATGTCAGCCA CCGGATGGAA GAAATTCGCC GCATTGCCTC CAGTGCCACC
GTTATGCGCG ACGGGCAGGT GGCTGGCGAT GTGATGCTCG AAAACACTTC CACGCATCAT
ATTGTGTCGC TAATGCTAGG GCGCGATCAC GTTGATATTG CGCCGGTTGC ACCTCAGGAA
ATTGTGGATC AGGCCGTGCT GGAAGTCCGT GCGTTACGCC ATAAGCCCAA GCTGGAGGAT
ATCAGCTTTA CGCTACGTCG TGGCGAAGTG CTCGGCATTG CTGGTCTGCT GGGGGCAGGG
CGCAGTGAAT TGCTGAAGGC GATTGTTGGG CTGGAGACGT ATGAACAGGG CGAAATTGTT
ATCAACGGCG AGAAAATCAC GCGCCCCGAT TACGGCGACA TGCTGAAACG CGGCATTGGC
TATACGCCAG AAAACCGCAA AGAAGCGGGG ATCATTCCCT GGTTGGGCGT TGACGAAAAT
ACAGTGCTGA CCAATCGGCA AAAAATCAGC GCCAACGGAG TGCTGCAATG GTCCACCATC
CGCCGCCTGA CCGAAGAAGT GATGCAGCGG ATGACGGTCA AGGCCGCCAG TAGCGAAACG
CCCATCGGCA CGCTTTCTGG CGGCAATCAG CAAAAAGTAG TGATCGGTCG TTGGGTCTAT
GCCGCCAGCC AGATTTTGTT GCTCGACGAG CCAACGCGCG GCGTCGATAT CGAAGCCAAA
CAGCAGATTT ACCGTATTGT CCGTGAGCTG GCTGCCGAAG GAAAAAGCGT GGTGTTTATC
TCCAGTGAAG TGGAGGAGTT GCCGTTGGTG TGTGACCGCA TCCTGTTATT ACAGCACGGC
ACGTTCTCGC AGGAGTTTCA CGCTCCGGTC AATGTGGATG AGCTGATGTC CGCCATTCTG
TCTGTGCACT GA
 
Protein sequence
MFTATEAVPV AKVVAGNKRY PGVVALDNVN FTLNKGEVRA LLGKNGAGKS TLIRMLTGSE 
RPDSGEIWIG ETRLEGDEAT LTRRAAELGV RAVYQELSLV EGLTVAENLC LGQWPRRNGM
IDYLQMAQDA QRCLQALGVD VSPEQLVSTL SPAHKQLVEI ARVMKGEPRV VILDEPTSSL
ASAEVELVIR AVKKMSALGV AVIYVSHRME EIRRIASSAT VMRDGQVAGD VMLENTSTHH
IVSLMLGRDH VDIAPVAPQE IVDQAVLEVR ALRHKPKLED ISFTLRRGEV LGIAGLLGAG
RSELLKAIVG LETYEQGEIV INGEKITRPD YGDMLKRGIG YTPENRKEAG IIPWLGVDEN
TVLTNRQKIS ANGVLQWSTI RRLTEEVMQR MTVKAASSET PIGTLSGGNQ QKVVIGRWVY
AASQILLLDE PTRGVDIEAK QQIYRIVREL AAEGKSVVFI SSEVEELPLV CDRILLLQHG
TFSQEFHAPV NVDELMSAIL SVH