Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1936 |
Symbol | ychM |
ID | 6144803 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 1956292 |
End bp | 1957971 |
Gene Length | 1680 bp |
Protein Length | 559 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641616812 |
Product | putative sulfate transporter YchM |
Protein accession | YP_001743988 |
Protein GI | 170679649 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0659] Sulfate permease and related transporters (MFS superfamily) |
TIGRFAM ID | [TIGR00815] high affinity sulphate transporter 1 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.000016191 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 39 |
Fosmid unclonability p-value | 0.11461 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAACAAAA TATTTTCCTC ACATGTGATG CCTTTCCGCG CTCTGATCGA CGCTTGCTGG AAAGAAAAAT ATACTGCCGC ACGGTTTACC CGTGACCTGA TTGCCGGGAT AACCGTCGGG ATTATTGCTA TCCCGCTGGC GATGGCGTTG GCTATTGGTA GTGGTGTGGC ACCCCAGTAC GGTTTATATA CCGCAGCTGT TGCGGGGATT GTCATTGCTC TGACGGGGGG GTCACGCTTT AGCGTTTCCG GTCCGACTGC GGCATTTGTG GTAATTCTCT ATCCCGTTTC GCAACAGTTT GGACTGGCAG GACTGCTGGT TGCGACCTTG CTGTCGGGGA TCTTTTTGAT TCTGATGGGG CTGGCACGCT TTGGGCGCCT GATTGAGTAT ATTCCGGTTT CCGTCACCTT AGGTTTCACC TCGGGTATCG GGATCACCAT CGGTACCATG CAGATTAAAG ATTTTCTCGG TCTGCAAATG GCCCATGTCC CGGAACATTA TCTACAGAAA GTCGGCGCAT TATTTATGGC GCTGCCGACC ATTAATGTGG GTGATGCAGC CATTGGCATT GTGACGCTTG GTATTCTGGT TTTCTGGCCG CGTCTGGGCA TTCGTTTACC CGGTCATCTT CCGGCCTTGC TGGCTGGTTG CGCGGTGATG GGGATTGTTA ATCTGCTCGG CGGACATGTT GCTACCATCG GTTCGCAATT CCACTACGTC CTGGCCGATG GTTCTCAGGG TAACGGTATT CCGCAACTGC TGCCGCAACT TGTGCTGCCG TGGGATCTGC CTGATTCAGA ATTCACGCTA ACCTGGGATT CTATTCGCAC ACTGCTGCCT GCGGCATTCT CAATGGCAAT GCTCGGCGCA ATCGAATCTC TGCTCTGCGC CGTGGTACTG GATGGTATGA CCGGAACGAA ACACAAAGCG AACAGCGAAC TGGTTGGACA GGGACTGGGG AATATTATCG CTCCGTTCTT TGGTGGCATT ACCGCTACAG CTGCCATCGC GCGTTCTGCC GCTAACGTCC GTGCCGGGGC AACTTCCCCT ATCTCGGCGG TGATCCACTC TATTCTGGTT ATTCTTGCCC TGCTGGTACT GGCACCGCTG CTCTCCTGGC TGCCGCTTTC CGCCATGGCA GCCCTGCTGT TGATGGTGGC GTGGAACATG AGTGAAGCGC ACAAAGTGGT CGACTTGCTG CGTCATGCAC CGAAAGATGA CATCATTGTC ATGCTGTTGT GCATGTCGCT GACCGTGCTG TTTGATATGG TTATTGCCAT CAGCGTGGGG ATCGTGCTGG CATCGCTGCT GTTTATGCGT CGTATCGCAC GTATGACTCG CCTGGCACCG GTAGTCGTAG ATGTTCCAGA CGATGTCCTG GTACTGCGCG TTATTGGCCC GCTGTTTTTT GCTGCTGCTG AAGGCTTATT CACAGACCTG GAGTCACGTC TTGAAGGCAA ACGGATTGTG ATTCTGAAGT GGGATGCCGT TCCGGTACTT GATGCTGGTG GTCTTGATGC GTTCCAGCGT TTTGTGAAGC GTCTACCCGA AGGATGTGAA CTGCGCGTGT GCAACGTGGA ATTCCAGCCA CTGCGCACGA TGGCTCGCGC AGGCATTCAA CCGATCCCGG GCCGCCTTGC TTTCTTCCCG AATCGTCGCG CGGCGATGGC GGATTTATAA
|
Protein sequence | MNKIFSSHVM PFRALIDACW KEKYTAARFT RDLIAGITVG IIAIPLAMAL AIGSGVAPQY GLYTAAVAGI VIALTGGSRF SVSGPTAAFV VILYPVSQQF GLAGLLVATL LSGIFLILMG LARFGRLIEY IPVSVTLGFT SGIGITIGTM QIKDFLGLQM AHVPEHYLQK VGALFMALPT INVGDAAIGI VTLGILVFWP RLGIRLPGHL PALLAGCAVM GIVNLLGGHV ATIGSQFHYV LADGSQGNGI PQLLPQLVLP WDLPDSEFTL TWDSIRTLLP AAFSMAMLGA IESLLCAVVL DGMTGTKHKA NSELVGQGLG NIIAPFFGGI TATAAIARSA ANVRAGATSP ISAVIHSILV ILALLVLAPL LSWLPLSAMA ALLLMVAWNM SEAHKVVDLL RHAPKDDIIV MLLCMSLTVL FDMVIAISVG IVLASLLFMR RIARMTRLAP VVVDVPDDVL VLRVIGPLFF AAAEGLFTDL ESRLEGKRIV ILKWDAVPVL DAGGLDAFQR FVKRLPEGCE LRVCNVEFQP LRTMARAGIQ PIPGRLAFFP NRRAAMADL
|
| |