Gene EcolC_2420 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_2420 
Symbol 
ID6066168 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp2664831 
End bp2666483 
Gene Length1653 bp 
Protein Length550 aa 
Translation table11 
GC content55% 
IMG OID641601829 
Productputative sulfate transporter YchM 
Protein accessionYP_001725381 
Protein GI170020427 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0659] Sulfate permease and related transporters (MFS superfamily) 
TIGRFAM ID[TIGR00815] high affinity sulphate transporter 1 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.643639 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00551985 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCCTTTCC GCGCTCTGAT CGACGCTTGC TGGAAAGAAA AATATACTGC CGCACGGTTT 
ACCCGTGACC TGATTGCCGG GATAACCGTC GGGATTATTG CTATCCCGCT GGCGATGGCG
TTGGCTATTG GTAGTGGTGT GGCACCCCAG TACGGTTTAT ATACCGCAGC TGTTGCGGGG
ATTGTCATTG CTCTGACGGG TGGGTCACGC TTTAGCGTTT CCGGTCCGAC TGCGGCATTT
GTGGTAATTC TCTATCCCGT GTCGCAACAG TTTGGACTGG CAGGACTGCT GGTTGCGACC
TTGCTGTCGG GGATCTTTTT GATTCTGATG GGTCTGGCAC GCTTTGGTCG CCTGATTGAG
TATATTCCGG TTTCCGTCAC CTTAGGTTTC ACCTCGGGTA TCGGGATCAC CATCGGTACC
ATGCAGATTA AAGATTTTCT CGGTCTGCAA ATGGCCCATG TCCCGGAACA TTATCTACAA
AAAGTCGGCG CATTATTTAT GGCGCTGCCG ACCATTAATG TGGGTGATGC TGCCATTGGC
ATTGTGACGC TAGGTATTCT TGTTTTTTGG CCGCGTCTGG GCATTCGTTT ACCCGGTCAC
CTTCCGGCCT TGCTGGCTGG TTGCGCGGTG ATGGGGATTG TTAACCTGCT CGGCGGACAT
GTTGCTACCA TCGGTTCGCA ATTCCACTAC GTCCTGGCCG ATGGTTCTCA GGGTAACGGT
ATTCCGCAAC TGCTGCCGCA ACTGGTGCTG CCGTGGGATC TGCCTAATTC AGAATTCACG
CTAACCTGGG ATTCTATTCG CACACTGCTG CCTGCGGCAT TCTCAATGGC AATGCTCGGC
GCAATCGAAT CTCTGCTCTG CGCCGTGGTA CTGGATGGTA TGACCGGGAC GAAACACAAG
GCGAACAGCG AACTGGTTGG ACAGGGACTG GGGAATATTA TCGCTCCGTT CTTTGGTGGT
ATTACCGCTA CAGCTGCCAT CGCGCGTTCT GCCGCTAACG TCCGTGCCGG GGCAACTTCC
CCTATCTCGG CGGTGATCCA CTCTATTCTG GTTATTCTTG CCCTGCTGGT ACTGGCACCG
CTGCTCTCCT GGCTGCCGCT TTCCGCTATG GCAGCCCTGC TGTTGATGGT GGCGTGGAAC
ATGAGTGAAG CGCATAAAGT GGTCGACTTG CTGCGTCATG CACCGAAAGA TGACATCATT
GTCATGCTGC TGTGCATGTC GCTGACCGTG CTGTTTGATA TGGTTATTGC CATCAGCGTG
GGGATCGTGC TGGCATCGCT GCTGTTTATG CGTCGTATCG CACGTATGAC TCGCCTGGCA
CCGGTAGTCG TAGATGTTCC AGACGATGTT CTGGTACTGC GCGTTATTGG CCCGCTGTTT
TTTGCTGCTG CTGAAGGCTT GTTCACGGAC CTGGAGTCAC GTCTTGAAGG CAAACGGATT
GTGATTCTGA AGTGGGATGC CGTTCCGGTA CTTGATGCTG GTGGTCTTGA TGCGTTCCAG
CGTTTTGTGA AGCGTCTGCC CGAAGGATGT GAACTGCGCG TGTGCAACGT GGAATTCCAG
CCACTGCGCA CTATGGCTCG CGCAGGCATT CAACCGATCC CGGGACGCCT CGCGTTCTTC
CCGAATCGTC GCGCGGCGAT GGCGGATTTA TAA
 
Protein sequence
MPFRALIDAC WKEKYTAARF TRDLIAGITV GIIAIPLAMA LAIGSGVAPQ YGLYTAAVAG 
IVIALTGGSR FSVSGPTAAF VVILYPVSQQ FGLAGLLVAT LLSGIFLILM GLARFGRLIE
YIPVSVTLGF TSGIGITIGT MQIKDFLGLQ MAHVPEHYLQ KVGALFMALP TINVGDAAIG
IVTLGILVFW PRLGIRLPGH LPALLAGCAV MGIVNLLGGH VATIGSQFHY VLADGSQGNG
IPQLLPQLVL PWDLPNSEFT LTWDSIRTLL PAAFSMAMLG AIESLLCAVV LDGMTGTKHK
ANSELVGQGL GNIIAPFFGG ITATAAIARS AANVRAGATS PISAVIHSIL VILALLVLAP
LLSWLPLSAM AALLLMVAWN MSEAHKVVDL LRHAPKDDII VMLLCMSLTV LFDMVIAISV
GIVLASLLFM RRIARMTRLA PVVVDVPDDV LVLRVIGPLF FAAAEGLFTD LESRLEGKRI
VILKWDAVPV LDAGGLDAFQ RFVKRLPEGC ELRVCNVEFQ PLRTMARAGI QPIPGRLAFF
PNRRAAMADL