Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_1687 |
Symbol | ychM |
ID | 6970448 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 1626502 |
End bp | 1628181 |
Gene Length | 1680 bp |
Protein Length | 559 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 643385646 |
Product | putative sulfate transporter YchM |
Protein accession | YP_002270140 |
Protein GI | 209397423 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0659] Sulfate permease and related transporters (MFS superfamily) |
TIGRFAM ID | [TIGR00815] high affinity sulphate transporter 1 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.000554758 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 50 |
Fosmid unclonability p-value | 0.520634 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAACAAAA TATTTTCCTC ACATGTGATG CCTTTCCGCG CTCTGATCGA CGCTTGCTGG AAAGAAAAAT ATACTGCCGC ACGGTTTACC CGTGACCTGA TTGCCGGGAT AACCGTCGGG ATTATTGCTA TCCCGCTGGC GATGGCGTTG GCTATTGGTA GTGGTGTGGC ACCCCAGTAC GGTTTATATA CCGCAGCTGT TGCGGGGATT GTCATTGCTC TGACGGGTGG GTCACGCTTT AGCGTTTCCG GTCCGACTGC GGCATTTGTG GTAATTCTCT ATCCCGTTTC GCAACAGTTT GGGCTGGCAG GACTGCTGGT TGCGACCTTG CTGTCGGGGA TCTTTTTGAT TCTGATGGGT CTGGCACGCT TTGGTCGCCT GATTGAGTAT ATTCCGGTTT CCGTCACCTT AGGTTTCACC TCGGGTATCG GGATCACCAT CGGTACCATG CAGATTAAAG ATTTTCTCGG TCTGCAAATG GCCCATGTCC CGGAACATTA TCTACAAAAA GTCGGCGCAT TATTTATGGC GCTGCCGACC ATTAATGTGG GTGATGCTGC CATTGGCATT GTGACGCTAG GTATTCTTGT TTTCTGGCCG CGTCTGGGCA TTCGTTTACC CGGTCACCTT CCGGCCTTGC TGGCTGGTTG CGCGGTGATG GGGATTGTTA ACCTGCTCGG CGGACATGTT GCTACCATCG GTTCGCAATT CCACTACGTC CTGGCCGATG GTTCTCAGGG TAACGGTATT CCGCAACTGC TACCGCAACT GGTGCTGCCG TGGGATCTGC CTAATTCAGA ATTCACGCTA ACCTGGGATT CTATTCGCAC ACTGCTGCCT GCGGCATTCT CAATGGCAAT GCTCGGCGCA ATCGAATCTC TGCTCTGCGC CGTGGTGCTG GATGGTATGA CCGGGACGAA GCACAAGGCG AATAGCGAAC TGGTTGGACA GGGGCTGGGG AATATCATCG CTCCGTTCTT TGGTGGTATT ACCGCTACCG CTGCCATCGC GCGTTCTGCC GCTAACGTCC GTGCCGGGGC AACTTCCCCT ATCTCGGCGG TGATCCACTC TATTCTGGTT ATTCTTGCCC TGCTGGTACT GGCACCGCTG CTCTCCTGGC TGCCGCTTTC CGCTATGGCA GCCCTGCTGT TGATGGTGGC GTGGAACATG AGTGAAGCGC ATAAAGTGGT CGACTTGCTG CGTCATGCAC CGAAAGATGA CATCATTGTC ATGCTGCTGT GCATGTCGCT GACCGTGCTG TTTGATATGG TTATTGCCAT CAGCGTGGGG ATCGTGCTGG CATCGCTGCT GTTTATGCGT CGTATCGCAC GTATGACTCG CCTGGCACCG GTAGTCGTAG ATGTTCCAGA CGATGTCCTG GTTCTGCGCG TTATTGGCCC GCTGTTTTTT GCTGCTGCTG AAGGCTTATT CACGGACCTG GAGTCACGTC TTGAAGGCAA ACGGATTGTG ATTCTGAAGT GGGATGCCGT TCCGGTACTT GATGCTGGTG GTCTTGATGC GTTCCAGCGT TTTGTGAAGC GTCTGCCCGA AGGATGTGAA CTGCGCGTGT GCAACGTGGA ATTCCAGCCA CTGCGCACTA TGGCTCGCGC AGGCATTCAA CCGATCCCGG GACGCCTCGC GTTCTTCCCG AATCGTCGCG CGGCGATGGC GGATTTATAA
|
Protein sequence | MNKIFSSHVM PFRALIDACW KEKYTAARFT RDLIAGITVG IIAIPLAMAL AIGSGVAPQY GLYTAAVAGI VIALTGGSRF SVSGPTAAFV VILYPVSQQF GLAGLLVATL LSGIFLILMG LARFGRLIEY IPVSVTLGFT SGIGITIGTM QIKDFLGLQM AHVPEHYLQK VGALFMALPT INVGDAAIGI VTLGILVFWP RLGIRLPGHL PALLAGCAVM GIVNLLGGHV ATIGSQFHYV LADGSQGNGI PQLLPQLVLP WDLPNSEFTL TWDSIRTLLP AAFSMAMLGA IESLLCAVVL DGMTGTKHKA NSELVGQGLG NIIAPFFGGI TATAAIARSA ANVRAGATSP ISAVIHSILV ILALLVLAPL LSWLPLSAMA ALLLMVAWNM SEAHKVVDLL RHAPKDDIIV MLLCMSLTVL FDMVIAISVG IVLASLLFMR RIARMTRLAP VVVDVPDDVL VLRVIGPLFF AAAEGLFTDL ESRLEGKRIV ILKWDAVPVL DAGGLDAFQR FVKRLPEGCE LRVCNVEFQP LRTMARAGIQ PIPGRLAFFP NRRAAMADL
|
| |