Gene ECH74115_0700 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_0700 
Symbol 
ID6968287 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp729746 
End bp731176 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content51% 
IMG OID643384735 
Productsodium:sulfate symporter family protein 
Protein accessionYP_002269248 
Protein GI209400659 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0471] Di- and tricarboxylate transporters 
TIGRFAM ID[TIGR00785] anion transporter 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones75 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGCCCCAC TGGTGGTGAT GGGTGTCATG TTTCTTATCC CTGTCCCCGA CGGTATGCCG 
CCGCAGGCAT GGCATTACTT CGCTGTGTTT GTGGCAATGA TTGTCGGCAT GATCCTCGAG
CCAATTCCGG CAACAGCGAT CAGTTTTATT GCGGTTACTA TTTGCGTTAT TGGCAGTAAT
TACCTGCTCT TTGATGCCAA AGAATTAGCT GACCCAGCGT TTAATGCGCA AAAACAGGCG
CTGAAATGGG GCCTGGCTGG TTTTTCCAGC ACTACGGTAT GGCTGGTATT TGGCGCATTT
ATTTTTGCAT TAGGGTATGA AGTTTCCGGG TTAGGTCGTC GCATTGCCCT TTTCCTGGTG
AAATTCATGG GCAAACGCAC GCTGACGTTG GGTTATGCGA TTGTCATTAT CGACATTCTG
CTGGCACCGT TTACACCGTC CAACACCGCG CGTACCGGGG GTACGGTTTT TCCGGTCATT
AAAAACCTGC CGCCGTTGTT TAAATCATTC CCGAACGATC CGTCCGCGCG TCGTATTGGC
GGCTATTTGA TGTGGATGAT GGTCATTAGT ACCAGTCTGA GTTCGTCCAT GTTTGTCACC
GGTGCGGCAC CAAACGTGCT GGGTCTGGAG TTCGTCAGCA AAATTGCCGG TATCCAGATT
AGCTGGTTGC AGTGGTTCCT CTGCTTCCTG CCGGTTGGGG TTATCTTGCT TATCATTGCG
CCGTGGCTTT CCTACGTGCT GTACAAACCG GAAATCACAC ACAGTGAAGA AGTGGCAACC
TGGGCGGGTG ATGAACTAAA AACCATGGGT GCGCTGACAC GCAGAGAGTG GACGCTGATT
GGCCTTGTAT TGCTCAGCTT AGGTTTGTGG GTATTTGGCA GTGAAGTCAT TAATGCTACT
GCGGTTGGTC TGCTGGCAGT TTCGCTAATG CTGGCTCTGC ACGTTGTGCC GTGGAAAGAC
ATTACCCGCT ATAACAGCGC ATGGAACACG CTGGTCAACC TGGCAACTCT GGTTGTGATG
GCTAACGGCC TGACTCGTTC TGGTTTTATT GACTGGTTCG CCGGTACCAT GAGTACGCAC
CTGGAAGGAT TCTCACCAAA CGCAACGGTG ATTGTACTGG TTCTGGTGTT CTACTTTGCA
CACTACCTGT TTGCCAGCCT GTCTGCGCAC ACCGCAACCA TGCTGCCGGT TATTCTGGCC
GTCGGTAAAG GTATTCCGGG CGTACCAATG GAACAACTGT GTATCCTGCT GGTGCTGTCT
ATCGGTATCA TGGGCTGTCT GACGCCGTAT GCAACCGGTC CTGGGGTGAT TATTTACGGC
TGTGGCTATG TGAAATCAAA AGATTACTGG CGTCTTGGCG CAATCTTCGG GGTGATTTAC
ATCTCTATGT TGCTGTTGGT TGGCTGGCCG ATTCTCGCCA TGTGGAACTA A
 
Protein sequence
MAPLVVMGVM FLIPVPDGMP PQAWHYFAVF VAMIVGMILE PIPATAISFI AVTICVIGSN 
YLLFDAKELA DPAFNAQKQA LKWGLAGFSS TTVWLVFGAF IFALGYEVSG LGRRIALFLV
KFMGKRTLTL GYAIVIIDIL LAPFTPSNTA RTGGTVFPVI KNLPPLFKSF PNDPSARRIG
GYLMWMMVIS TSLSSSMFVT GAAPNVLGLE FVSKIAGIQI SWLQWFLCFL PVGVILLIIA
PWLSYVLYKP EITHSEEVAT WAGDELKTMG ALTRREWTLI GLVLLSLGLW VFGSEVINAT
AVGLLAVSLM LALHVVPWKD ITRYNSAWNT LVNLATLVVM ANGLTRSGFI DWFAGTMSTH
LEGFSPNATV IVLVLVFYFA HYLFASLSAH TATMLPVILA VGKGIPGVPM EQLCILLVLS
IGIMGCLTPY ATGPGVIIYG CGYVKSKDYW RLGAIFGVIY ISMLLLVGWP ILAMWN