Gene EcE24377A_1353 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_1353 
SymbolychM 
ID5586517 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp1349896 
End bp1351548 
Gene Length1653 bp 
Protein Length550 aa 
Translation table11 
GC content55% 
IMG OID640925049 
Productputative sulfate transporter YchM 
Protein accessionYP_001462458 
Protein GI157158365 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0659] Sulfate permease and related transporters (MFS superfamily) 
TIGRFAM ID[TIGR00815] high affinity sulphate transporter 1 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000101083 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTTTCC GCGCTCTGAT CGACGCTTGC TGGAAAGAAA AATATACTGC CGCACGGTTT 
ACCCGTGACC TGATTGCCGG GATAACCGTC GGGATTATTG CTATCCCGCT GGCGATGGCG
TTGGCTATTG GTAGTGGTGT GGCACCCCAG TACGGTTTAT ATACCGCAGC TGTTGCGGGG
ATTGTCATTG CTCTGACGGG TGGGTCACGC TTTAGCGTTT CCGGTCCGAC TGCGGCATTT
GTGGTAATTC TCTATCCCGT GTCGCAACAG TTTGGACTGG CAGGACTGCT GGTTGCGACC
TTGCTGTCGG GGATTTTTTT GATTCTGATG GGTCTGGCAC GCTTTGGTCG CCTGATTGAG
TATATTCCGG TTTCCGTCAC CTTAGGTTTC ACCTCGGGTA TCGGGATCAC CATCGGTACC
ATGCAGATTA AAGATTTTCT CGGTCTGCAA ATGGCCCATG TCCCGGAACA TTATCTACAA
AAAGTCGGCG CATTATTTAT GGCGCTGCCG ACCATTAATG TGGGTGATGC TGCCATTGGC
ATTGTGACGC TAGGTATTCT TGTTTTCTGG CCGCGTCTGG GCATTCGTTT ACCCGGTCAC
CTTCCGGCCT TGCTGGCTGG TTGCGCGGTG ATGGGGATTG TTAACCTGCT CGGCGGACAT
GTTGCTACCA TCGGTTCGCA ATTCCACTAC GTCCTGGCCG ATGGTTCTCA GGGTAACGGT
ATTCCGCAAC TGCTGCCGCA ACTGGTGCTG CCGTGGGATC TGCCTAATTC AGAATTCACG
CTAACCTGGG ATTCTATTCG CACACTGCTG CCTGCGGCAT TCTCAATGGC AATGCTCGGC
GCAATCGAAT CTCTGCTCTG CGCCGTGGTG CTGGATGGTA TGACCGGGAC GAAACACAAG
GCGAATAGCG AACTGGTTGG ACAGGGACTG GGGAATATCA TCGCTCCGTT CTTTGGTGGC
ATTACCGCTA CAGCTGCCAT CGCGCGTTCT GCCGCTAACG TCCGTGCCGG GGCAACTTCC
CCTATCTCGG CGGTGATCCA CTCTATTCTG GTTATTCTTG CCCTGCTGGT ACTGGCACCG
CTGCTCTCCT GGCTGCCGCT TTCCGCCATG GCTGCCCTGC TGTTGATGGT GGCGTGGAAC
ATGAGTGAAG CGCACAAAGT GGTCGACTTG CTGCGTCATG CGCCGAAAGA TGACATCATT
GTCATGCTGC TGTGCATGTC GCTGACCGTG CTGTTTGATA TGGTTATTGC CATCAGCGTG
GGGATCGTGC TGGCATCGCT GCTGTTTATG CGTCGTATCG CACGTATGAC TCGCCTGGCA
CCGGTAGTCG TAGATGTTCC AGACGATGTC CTGGTTCTGC GCGTTATTGG CCCGCTGTTT
TTTGCTGCTG CTGAAGGCTT GTTCACGGAC CTGGAGTCAC GTCTTGAAGG CAAACGGATT
GTGATTCTGA AGTGGGATGC GGTTCCGGTA CTTGATGCTG GTGGTCTTGA TGCGTTCCAG
CGTTTTGTGA AGCGTCTGCC CGAGGGATGT GAACTGCGCG TGTGCAACGT GGAATTCCAG
CCACTGCGCA CTATGGCTCG CGCTGGCATT CAACCGATCC CGGGACGCCT GGCGTTCTTC
CCGAATCGTC GCGCGGCGAT GGCGGATTTA TAA
 
Protein sequence
MPFRALIDAC WKEKYTAARF TRDLIAGITV GIIAIPLAMA LAIGSGVAPQ YGLYTAAVAG 
IVIALTGGSR FSVSGPTAAF VVILYPVSQQ FGLAGLLVAT LLSGIFLILM GLARFGRLIE
YIPVSVTLGF TSGIGITIGT MQIKDFLGLQ MAHVPEHYLQ KVGALFMALP TINVGDAAIG
IVTLGILVFW PRLGIRLPGH LPALLAGCAV MGIVNLLGGH VATIGSQFHY VLADGSQGNG
IPQLLPQLVL PWDLPNSEFT LTWDSIRTLL PAAFSMAMLG AIESLLCAVV LDGMTGTKHK
ANSELVGQGL GNIIAPFFGG ITATAAIARS AANVRAGATS PISAVIHSIL VILALLVLAP
LLSWLPLSAM AALLLMVAWN MSEAHKVVDL LRHAPKDDII VMLLCMSLTV LFDMVIAISV
GIVLASLLFM RRIARMTRLA PVVVDVPDDV LVLRVIGPLF FAAAEGLFTD LESRLEGKRI
VILKWDAVPV LDAGGLDAFQ RFVKRLPEGC ELRVCNVEFQ PLRTMARAGI QPIPGRLAFF
PNRRAAMADL