Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A1311 |
Symbol | ychM |
ID | 5592231 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 1307563 |
End bp | 1309215 |
Gene Length | 1653 bp |
Protein Length | 550 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 640920468 |
Product | putative sulfate transporter YchM |
Protein accession | YP_001458029 |
Protein GI | 157160711 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0659] Sulfate permease and related transporters (MFS superfamily) |
TIGRFAM ID | [TIGR00815] high affinity sulphate transporter 1 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.0000228855 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCTTTCC GCGCTCTGAT CGACGCTTGC TGGAAAGAAA AATATACTGC CGCACGGTTT ACCCGTGACC TGATTGCCGG GATAACCGTC GGGATTATTG CTATCCCGCT GGCGATGGCG TTGGCTATTG GTAGTGGTGT GGCACCCCAG TACGGTTTAT ATACCGCAGC TGTTGCGGGG ATTGTCATTG CTCTGACGGG TGGGTCACGC TTTAGCGTTT CCGGTCCGAC TGCGGCATTT GTGGTAATTC TCTATCCCGT GTCGCAACAG TTTGGACTGG CAGGACTGCT GGTTGCGACC TTGCTGTCGG GGATCTTTTT GATTCTGATG GGTCTGGCAC GCTTTGGTCG CCTGATTGAG TATATTCCGG TTTCCGTCAC CTTAGGTTTC ACCTCGGGTA TCGGGATCAC CATCGGTACC ATGCAGATTA AAGATTTTCT CGGTCTGCAA ATGGCCCATG TCCCGGAACA TTATCTACAA AAAGTCGGCG CATTATTTAT GGCGCTGCCG ACCATTAATG TGGGTGATGC TGCCATTGGC ATTGTGACGC TAGGTATTCT TGTTTTTTGG CCGCGTCTGG GCATTCGTTT ACCCGGTCAC CTTCCGGCCT TGCTGGCTGG TTGCGCGGTG ATGGGGATTG TTAACCTGCT CGGCGGACAT GTTGCTACCA TCGGTTCGCA ATTCCACTAC GTCCTGGCCG ATGGTTCTCA GGGTAACGGT ATTCCGCAAC TGCTGCCGCA ACTGGTGCTG CCGTGGGATC TGCCTAATTC AGAATTCACG CTAACCTGGG ATTCTATTCG CACACTGCTG CCTGCGGCAT TCTCAATGGC AATGCTCGGC GCAATCGAAT CTCTGCTCTG CGCCGTGGTA CTGGATGGTA TGACCGGGAC GAAACACAAG GCGAACAGCG AACTGGTTGG ACAGGGACTG GGGAATATTA TCGCTCCGTT CTTTGGTGGT ATTACCGCTA CAGCTGCCAT CGCGCGTTCT GCCGCTAACG TCCGTGCCGG GGCAACTTCC CCTATCTCGG CGGTGATCCA CTCTATTCTG GTTATTCTTG CCCTGCTGGT ACTGGCACCG CTGCTCTCCT GGCTGCCGCT TTCCGCTATG GCAGCCCTGC TGTTGATGGT GGCGTGGAAC ATGAGTGAAG CGCATAAAGT GGTCGACTTG CTGCGTCATG CACCGAAAGA TGACATCATT GTCATGCTGC TGTGCATGTC GCTGACCGTG CTGTTTGATA TGGTTATTGC CATCAGCGTG GGGATCGTGC TGGCATCGCT GCTGTTTATG CGTCGTATCG CACGTATGAC TCGCCTGGCA CCGGTAGTCG TAGATGTTCC AGACGATGTT CTGGTACTGC GCGTTATTGG CCCGCTGTTT TTTGCTGCTG CTGAAGGCTT GTTCACGGAC CTGGAGTCAC GTCTTGAAGG CAAACGGATT GTGATTCTGA AGTGGGATGC CGTTCCGGTA CTTGATGCTG GTGGTCTTGA TGCGTTCCAG CGTTTTGTGA AGCGTCTGCC CGAAGGATGT GAACTGCGCG TGTGCAACGT GGAATTCCAG CCACTGCGCA CTATGGCTCG CGCAGGCATT CAACCGATCC CGGGACGCCT CGCGTTCTTC CCGAATCGTC GCGCGGCGAT GGCGGATTTA TAA
|
Protein sequence | MPFRALIDAC WKEKYTAARF TRDLIAGITV GIIAIPLAMA LAIGSGVAPQ YGLYTAAVAG IVIALTGGSR FSVSGPTAAF VVILYPVSQQ FGLAGLLVAT LLSGIFLILM GLARFGRLIE YIPVSVTLGF TSGIGITIGT MQIKDFLGLQ MAHVPEHYLQ KVGALFMALP TINVGDAAIG IVTLGILVFW PRLGIRLPGH LPALLAGCAV MGIVNLLGGH VATIGSQFHY VLADGSQGNG IPQLLPQLVL PWDLPNSEFT LTWDSIRTLL PAAFSMAMLG AIESLLCAVV LDGMTGTKHK ANSELVGQGL GNIIAPFFGG ITATAAIARS AANVRAGATS PISAVIHSIL VILALLVLAP LLSWLPLSAM AALLLMVAWN MSEAHKVVDL LRHAPKDDII VMLLCMSLTV LFDMVIAISV GIVLASLLFM RRIARMTRLA PVVVDVPDDV LVLRVIGPLF FAAAEGLFTD LESRLEGKRI VILKWDAVPV LDAGGLDAFQ RFVKRLPEGC ELRVCNVEFQ PLRTMARAGI QPIPGRLAFF PNRRAAMADL
|
| |