Gene EcDH1_2441 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_2441 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp2617702 
End bp2619354 
Gene Length1653 bp 
Protein Length550 aa 
Translation table11 
GC content55% 
IMG OID 
Productsulfate transporter 
Protein accessionACX40081 
Protein GI260449659 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.445001 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTTTCC GCGCTCTGAT CGACGCTTGC TGGAAAGAAA AATATACTGC CGCACGGTTT 
ACCCGTGACC TGATTGCCGG GATAACCGTC GGGATTATTG CTATCCCGCT GGCGATGGCG
TTGGCTATTG GTAGTGGTGT GGCACCCCAG TACGGTTTAT ATACCGCAGC TGTTGCGGGG
ATTGTCATTG CTCTGACGGG TGGGTCACGC TTTAGCGTTT CCGGTCCGAC TGCGGCATTT
GTGGTAATTC TCTATCCCGT GTCGCAACAG TTTGGACTGG CAGGACTGCT GGTTGCGACC
TTGCTGTCGG GGATCTTTTT GATTCTGATG GGTCTGGCAC GCTTTGGTCG CCTGATTGAG
TATATTCCGG TTTCCGTCAC CTTAGGTTTC ACCTCGGGTA TCGGGATCAC CATCGGTACC
ATGCAGATTA AAGATTTTCT CGGTCTGCAA ATGGCCCATG TCCCGGAACA TTATCTACAA
AAAGTCGGCG CATTATTTAT GGCGCTGCCG ACCATTAATG TGGGTGATGC TGCCATTGGC
ATTGTGACGC TAGGTATTCT TGTTTTTTGG CCGCGTCTGG GCATTCGTTT ACCCGGTCAC
CTTCCGGCCT TGCTGGCTGG TTGCGCGGTG ATGGGGATTG TTAACCTGCT CGGCGGACAT
GTTGCTACCA TCGGTTCGCA ATTCCACTAC GTCCTGGCCG ATGGTTCTCA GGGTAACGGT
ATTCCGCAAC TGCTGCCGCA ACTGGTGCTG CCGTGGGATC TGCCTAATTC AGAATTCACG
CTAACCTGGG ATTCTATTCG CACACTGCTG CCTGCGGCAT TCTCAATGGC AATGCTCGGC
GCAATCGAAT CTCTGCTCTG CGCCGTGGTG CTGGATGGTA TGACCGGGAC GAAACACAAG
GCGAACAGCG AACTGGTTGG ACAGGGACTG GGGAATATTA TCGCTCCGTT CTTTGGTGGT
ATTACCGCTA CAGCTGCCAT CGCGCGTTCT GCCGCTAACG TCCGTGCCGG GGCAACGTCC
CCTATCTCGG CGGTGATCCA CTCTATTCTG GTTATTCTTG CCCTGCTGGT ACTGGCACCG
CTGCTCTCCT GGCTGCCGCT TTCCGCCATG GCAGCCCTGC TGTTGATGGT GGCGTGGAAC
ATGAGTGAAG CGCACAAAGT GGTCGACTTG CTGCGTCATG CGCCGAAAGA TGACATCATC
GTCATGCTGC TGTGCATGTC GCTGACCGTG TTGTTTGATA TGGTTATTGC CATCAGCGTG
GGGATCGTGC TGGCATCGCT GCTGTTTATG CGTCGTATCG CACGTATGAC TCGCCTGGCA
CCGGTAGTCG TAGATGTTCC AGACGATGTC CTGGTTCTGC GCGTTATTGG CCCGCTGTTT
TTTGCTGCTG CTGAAGGCTT ATTCACGGAC CTGGAGTCAC GTCTTGAAGG CAAACGGATT
GTGATTCTGA AGTGGGATGC CGTTCCGGTA CTTGATGCTG GTGGTCTTGA TGCGTTCCAG
CGTTTTGTGA AGCGTCTGCC CGAGGGATGT GAACTGCGCG TGTGCAACGT GGAATTCCAG
CCACTGCGCA CTATGGCTCG CGCTGGCATT CAACCGATCC CGGGACGCCT GGCGTTCTTC
CCGAATCGTC GCGCGGCGAT GGCGGATTTA TAA
 
Protein sequence
MPFRALIDAC WKEKYTAARF TRDLIAGITV GIIAIPLAMA LAIGSGVAPQ YGLYTAAVAG 
IVIALTGGSR FSVSGPTAAF VVILYPVSQQ FGLAGLLVAT LLSGIFLILM GLARFGRLIE
YIPVSVTLGF TSGIGITIGT MQIKDFLGLQ MAHVPEHYLQ KVGALFMALP TINVGDAAIG
IVTLGILVFW PRLGIRLPGH LPALLAGCAV MGIVNLLGGH VATIGSQFHY VLADGSQGNG
IPQLLPQLVL PWDLPNSEFT LTWDSIRTLL PAAFSMAMLG AIESLLCAVV LDGMTGTKHK
ANSELVGQGL GNIIAPFFGG ITATAAIARS AANVRAGATS PISAVIHSIL VILALLVLAP
LLSWLPLSAM AALLLMVAWN MSEAHKVVDL LRHAPKDDII VMLLCMSLTV LFDMVIAISV
GIVLASLLFM RRIARMTRLA PVVVDVPDDV LVLRVIGPLF FAAAEGLFTD LESRLEGKRI
VILKWDAVPV LDAGGLDAFQ RFVKRLPEGC ELRVCNVEFQ PLRTMARAGI QPIPGRLAFF
PNRRAAMADL