Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_0700 |
Symbol | |
ID | 6968287 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 729746 |
End bp | 731176 |
Gene Length | 1431 bp |
Protein Length | 476 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 643384735 |
Product | sodium:sulfate symporter family protein |
Protein accession | YP_002269248 |
Protein GI | 209400659 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0471] Di- and tricarboxylate transporters |
TIGRFAM ID | [TIGR00785] anion transporter |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 75 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGGCCCCAC TGGTGGTGAT GGGTGTCATG TTTCTTATCC CTGTCCCCGA CGGTATGCCG CCGCAGGCAT GGCATTACTT CGCTGTGTTT GTGGCAATGA TTGTCGGCAT GATCCTCGAG CCAATTCCGG CAACAGCGAT CAGTTTTATT GCGGTTACTA TTTGCGTTAT TGGCAGTAAT TACCTGCTCT TTGATGCCAA AGAATTAGCT GACCCAGCGT TTAATGCGCA AAAACAGGCG CTGAAATGGG GCCTGGCTGG TTTTTCCAGC ACTACGGTAT GGCTGGTATT TGGCGCATTT ATTTTTGCAT TAGGGTATGA AGTTTCCGGG TTAGGTCGTC GCATTGCCCT TTTCCTGGTG AAATTCATGG GCAAACGCAC GCTGACGTTG GGTTATGCGA TTGTCATTAT CGACATTCTG CTGGCACCGT TTACACCGTC CAACACCGCG CGTACCGGGG GTACGGTTTT TCCGGTCATT AAAAACCTGC CGCCGTTGTT TAAATCATTC CCGAACGATC CGTCCGCGCG TCGTATTGGC GGCTATTTGA TGTGGATGAT GGTCATTAGT ACCAGTCTGA GTTCGTCCAT GTTTGTCACC GGTGCGGCAC CAAACGTGCT GGGTCTGGAG TTCGTCAGCA AAATTGCCGG TATCCAGATT AGCTGGTTGC AGTGGTTCCT CTGCTTCCTG CCGGTTGGGG TTATCTTGCT TATCATTGCG CCGTGGCTTT CCTACGTGCT GTACAAACCG GAAATCACAC ACAGTGAAGA AGTGGCAACC TGGGCGGGTG ATGAACTAAA AACCATGGGT GCGCTGACAC GCAGAGAGTG GACGCTGATT GGCCTTGTAT TGCTCAGCTT AGGTTTGTGG GTATTTGGCA GTGAAGTCAT TAATGCTACT GCGGTTGGTC TGCTGGCAGT TTCGCTAATG CTGGCTCTGC ACGTTGTGCC GTGGAAAGAC ATTACCCGCT ATAACAGCGC ATGGAACACG CTGGTCAACC TGGCAACTCT GGTTGTGATG GCTAACGGCC TGACTCGTTC TGGTTTTATT GACTGGTTCG CCGGTACCAT GAGTACGCAC CTGGAAGGAT TCTCACCAAA CGCAACGGTG ATTGTACTGG TTCTGGTGTT CTACTTTGCA CACTACCTGT TTGCCAGCCT GTCTGCGCAC ACCGCAACCA TGCTGCCGGT TATTCTGGCC GTCGGTAAAG GTATTCCGGG CGTACCAATG GAACAACTGT GTATCCTGCT GGTGCTGTCT ATCGGTATCA TGGGCTGTCT GACGCCGTAT GCAACCGGTC CTGGGGTGAT TATTTACGGC TGTGGCTATG TGAAATCAAA AGATTACTGG CGTCTTGGCG CAATCTTCGG GGTGATTTAC ATCTCTATGT TGCTGTTGGT TGGCTGGCCG ATTCTCGCCA TGTGGAACTA A
|
Protein sequence | MAPLVVMGVM FLIPVPDGMP PQAWHYFAVF VAMIVGMILE PIPATAISFI AVTICVIGSN YLLFDAKELA DPAFNAQKQA LKWGLAGFSS TTVWLVFGAF IFALGYEVSG LGRRIALFLV KFMGKRTLTL GYAIVIIDIL LAPFTPSNTA RTGGTVFPVI KNLPPLFKSF PNDPSARRIG GYLMWMMVIS TSLSSSMFVT GAAPNVLGLE FVSKIAGIQI SWLQWFLCFL PVGVILLIIA PWLSYVLYKP EITHSEEVAT WAGDELKTMG ALTRREWTLI GLVLLSLGLW VFGSEVINAT AVGLLAVSLM LALHVVPWKD ITRYNSAWNT LVNLATLVVM ANGLTRSGFI DWFAGTMSTH LEGFSPNATV IVLVLVFYFA HYLFASLSAH TATMLPVILA VGKGIPGVPM EQLCILLVLS IGIMGCLTPY ATGPGVIIYG CGYVKSKDYW RLGAIFGVIY ISMLLLVGWP ILAMWN
|
| |