Gene ECH74115_5634 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_5634 
SymbolmelB 
ID6971552 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp5272994 
End bp5274439 
Gene Length1446 bp 
Protein Length481 aa 
Translation table11 
GC content46% 
IMG OID643389268 
Productmelibiose:sodium symporter 
Protein accessionYP_002273665 
Protein GI209396194 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2211] Na+/melibiose symporter and related transporters 
TIGRFAM ID[TIGR00792] sugar (Glycoside-Pentoside-Hexuronide) transporter 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones64 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGTAACAG CGACCCGATA CCCTATGAGC ATTTCAATGA CTACAAAACT CAGTTATGGA 
TTTGGAGCGT TCGGGAAGGA TTTTGCGATC GGCATTGTGT ATATGTACCT CATGTATTAC
TACACCGATG TCGTCGGGCT GTCTGTGGGT TTGGTCGGTA CTTTGTTTCT GGTGGCGAGG
ATCTGGGATG CTATTAACGA TCCGATTATG GGATGGATTG TAAATGCTAC GCGATCGCGA
TGGGGTAAGT TCAAACCCTG GATCCTGATC GGTACGTTGG CAAACTCTGT AATCTTATTT
CTCCTCTTTA GTGCGCATCT GTTTGAAGGT ACTACTCAGA TTGCCTTTGT TTGCGTGACC
TACATCCTCT GGGGCATGAC TTACACCATT ATGGATATTC CCTTCTGGTC GCTGGTTCCA
ACCATCACGC TCGATAAACG TGAGCGTGAA CAACTGGTTC CTTATCCGCG TTTTTTTGCC
AGTCTGGCGG GCTTTGTTAC GGCAGGTGTG ACGCTACCAT TTGTTAATTA TGTCGGCGGT
GGCGATCGGG GATTTGGCTT TCAGATGTTC ACACTGGTAC TGATCGCCTT TTTTATTGTT
TCAACCATCA TCACTCTGCG CAATGTGCAT GAAGTCTTTT CGTCAGACAA TCAACCGTCT
GCTGAAGGAA GCCATCTGAC ACTTAAAGCC ATCGTTGCGC TTATTTATAA AAACGATCAG
CTTTCATGCC TCTTGGGTAT GGCTCTTGCT TATAATGTAG CCAGCAACAT TATTACCGGC
TTTGCTATCT ATTATTTCTC ATATGTTATC GGTGATGCGG ATTTGTTCCC CTATTATCTG
TCGTATGCGG GAGCTGCTAA CCTGGTGACG TTAGTATTCT TCCCACGCTT AGTTAAATCA
TTATCCCGAC GCATTTTATG GGCCGGAGCA TCTATTCTTC CGGTGTTAAG CTGTGGTGTT
CTCCTGTTAA TGGCATTAAT GAGCTATCAC AACGTCGTCC TCATTGTGAT TGCGGGTATT
TTGCTGAATG TGGGAACGGC GCTTTTCTGG GTATTACAGG TCATCATGGT GGCAGATACC
GTTGATTACG GTGAATATAA ACTGCACGTA CGCTGTGAAA GCATCGCTTA CTCCGTGCAG
ACTATGGTGG TGAAGGGCGG TTCAGCCTTT GCGGCTTTTT TCATTGCGGT TGTGTTAGGG
ATGATTGGCT ATGTACCGAA TGTTGAACAG TCTACGCAAG CCCTATTAGG TATGCAGTTT
ATTATGATTG CTCTACCGAC TCTGTTTTTC ATGGTAACGC TGATTCTCTA CTTCCGTTTC
TATCGCCTCA ATGGCGACAC GCTGCGCAGG ATCCAGATTC ATCTGCTGGA TAAATATCGC
AAAATACCGC CCGAGCCTGT TCATGCTGAT ATTCCGGTCG GTGCAGTGAG TGATGTGAAA
GCCTGA
 
Protein sequence
MVTATRYPMS ISMTTKLSYG FGAFGKDFAI GIVYMYLMYY YTDVVGLSVG LVGTLFLVAR 
IWDAINDPIM GWIVNATRSR WGKFKPWILI GTLANSVILF LLFSAHLFEG TTQIAFVCVT
YILWGMTYTI MDIPFWSLVP TITLDKRERE QLVPYPRFFA SLAGFVTAGV TLPFVNYVGG
GDRGFGFQMF TLVLIAFFIV STIITLRNVH EVFSSDNQPS AEGSHLTLKA IVALIYKNDQ
LSCLLGMALA YNVASNIITG FAIYYFSYVI GDADLFPYYL SYAGAANLVT LVFFPRLVKS
LSRRILWAGA SILPVLSCGV LLLMALMSYH NVVLIVIAGI LLNVGTALFW VLQVIMVADT
VDYGEYKLHV RCESIAYSVQ TMVVKGGSAF AAFFIAVVLG MIGYVPNVEQ STQALLGMQF
IMIALPTLFF MVTLILYFRF YRLNGDTLRR IQIHLLDKYR KIPPEPVHAD IPVGAVSDVK
A