Gene EcHS_A1311 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A1311 
SymbolychM 
ID5592231 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp1307563 
End bp1309215 
Gene Length1653 bp 
Protein Length550 aa 
Translation table11 
GC content55% 
IMG OID640920468 
Productputative sulfate transporter YchM 
Protein accessionYP_001458029 
Protein GI157160711 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0659] Sulfate permease and related transporters (MFS superfamily) 
TIGRFAM ID[TIGR00815] high affinity sulphate transporter 1 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.0000228855 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTTTCC GCGCTCTGAT CGACGCTTGC TGGAAAGAAA AATATACTGC CGCACGGTTT 
ACCCGTGACC TGATTGCCGG GATAACCGTC GGGATTATTG CTATCCCGCT GGCGATGGCG
TTGGCTATTG GTAGTGGTGT GGCACCCCAG TACGGTTTAT ATACCGCAGC TGTTGCGGGG
ATTGTCATTG CTCTGACGGG TGGGTCACGC TTTAGCGTTT CCGGTCCGAC TGCGGCATTT
GTGGTAATTC TCTATCCCGT GTCGCAACAG TTTGGACTGG CAGGACTGCT GGTTGCGACC
TTGCTGTCGG GGATCTTTTT GATTCTGATG GGTCTGGCAC GCTTTGGTCG CCTGATTGAG
TATATTCCGG TTTCCGTCAC CTTAGGTTTC ACCTCGGGTA TCGGGATCAC CATCGGTACC
ATGCAGATTA AAGATTTTCT CGGTCTGCAA ATGGCCCATG TCCCGGAACA TTATCTACAA
AAAGTCGGCG CATTATTTAT GGCGCTGCCG ACCATTAATG TGGGTGATGC TGCCATTGGC
ATTGTGACGC TAGGTATTCT TGTTTTTTGG CCGCGTCTGG GCATTCGTTT ACCCGGTCAC
CTTCCGGCCT TGCTGGCTGG TTGCGCGGTG ATGGGGATTG TTAACCTGCT CGGCGGACAT
GTTGCTACCA TCGGTTCGCA ATTCCACTAC GTCCTGGCCG ATGGTTCTCA GGGTAACGGT
ATTCCGCAAC TGCTGCCGCA ACTGGTGCTG CCGTGGGATC TGCCTAATTC AGAATTCACG
CTAACCTGGG ATTCTATTCG CACACTGCTG CCTGCGGCAT TCTCAATGGC AATGCTCGGC
GCAATCGAAT CTCTGCTCTG CGCCGTGGTA CTGGATGGTA TGACCGGGAC GAAACACAAG
GCGAACAGCG AACTGGTTGG ACAGGGACTG GGGAATATTA TCGCTCCGTT CTTTGGTGGT
ATTACCGCTA CAGCTGCCAT CGCGCGTTCT GCCGCTAACG TCCGTGCCGG GGCAACTTCC
CCTATCTCGG CGGTGATCCA CTCTATTCTG GTTATTCTTG CCCTGCTGGT ACTGGCACCG
CTGCTCTCCT GGCTGCCGCT TTCCGCTATG GCAGCCCTGC TGTTGATGGT GGCGTGGAAC
ATGAGTGAAG CGCATAAAGT GGTCGACTTG CTGCGTCATG CACCGAAAGA TGACATCATT
GTCATGCTGC TGTGCATGTC GCTGACCGTG CTGTTTGATA TGGTTATTGC CATCAGCGTG
GGGATCGTGC TGGCATCGCT GCTGTTTATG CGTCGTATCG CACGTATGAC TCGCCTGGCA
CCGGTAGTCG TAGATGTTCC AGACGATGTT CTGGTACTGC GCGTTATTGG CCCGCTGTTT
TTTGCTGCTG CTGAAGGCTT GTTCACGGAC CTGGAGTCAC GTCTTGAAGG CAAACGGATT
GTGATTCTGA AGTGGGATGC CGTTCCGGTA CTTGATGCTG GTGGTCTTGA TGCGTTCCAG
CGTTTTGTGA AGCGTCTGCC CGAAGGATGT GAACTGCGCG TGTGCAACGT GGAATTCCAG
CCACTGCGCA CTATGGCTCG CGCAGGCATT CAACCGATCC CGGGACGCCT CGCGTTCTTC
CCGAATCGTC GCGCGGCGAT GGCGGATTTA TAA
 
Protein sequence
MPFRALIDAC WKEKYTAARF TRDLIAGITV GIIAIPLAMA LAIGSGVAPQ YGLYTAAVAG 
IVIALTGGSR FSVSGPTAAF VVILYPVSQQ FGLAGLLVAT LLSGIFLILM GLARFGRLIE
YIPVSVTLGF TSGIGITIGT MQIKDFLGLQ MAHVPEHYLQ KVGALFMALP TINVGDAAIG
IVTLGILVFW PRLGIRLPGH LPALLAGCAV MGIVNLLGGH VATIGSQFHY VLADGSQGNG
IPQLLPQLVL PWDLPNSEFT LTWDSIRTLL PAAFSMAMLG AIESLLCAVV LDGMTGTKHK
ANSELVGQGL GNIIAPFFGG ITATAAIARS AANVRAGATS PISAVIHSIL VILALLVLAP
LLSWLPLSAM AALLLMVAWN MSEAHKVVDL LRHAPKDDII VMLLCMSLTV LFDMVIAISV
GIVLASLLFM RRIARMTRLA PVVVDVPDDV LVLRVIGPLF FAAAEGLFTD LESRLEGKRI
VILKWDAVPV LDAGGLDAFQ RFVKRLPEGC ELRVCNVEFQ PLRTMARAGI QPIPGRLAFF
PNRRAAMADL