Gene EcSMS35_1575 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1575 
Symbol 
ID6143838 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1558354 
End bp1559394 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content54% 
IMG OID641616452 
Productputative oxidoreductase 
Protein accessionYP_001743630 
Protein GI170680179 
COG category[R] General function prediction only 
COG ID[COG0673] Predicted dehydrogenases and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.544728 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones72 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGACA ACATCCGTGT TGGGTTGATT GGGTATGGTT ATGCGAGCAA AACCTTCCAT 
GCGCCCCTGA TTGCGGGCAC GCCCGGGCTG GAACTGGCGG TAATCTCCAG CAGCGATGAA
ACAAAAGTAA AAGCCGACTG GCCAACGGTT GCGGTTGTCT CTGAGCCGAA GCATCTGTTT
AACGATCCCA ACATAGACCT GATTGTCATT CCTACACCCA ACGATACCCA TTTCCCGTTA
GCCAAAGCGG CGCTTGAGGC GGGTAAACAT GTGGTCGTTG ATAAACCCTT TACCGTGACA
CTGTCACAAG CGCGAGAGCT GGAAGCGCTG GCAAAAAGCC TGGGGCGTGT GCTGTCTGTA
TTCCATAACC GTCGCTGGGA TAGCGATTTC CTGACGCTAA AAGGTTTGCT CGTGGAAGGC
GTACTGGGTG AAGTTGCTTA CTTTGAGTCT CATTTTGACC GCTTCCGTCC GCAGGTGCGC
GATCGTTGGC GTGAACAGGG CGGTCCTGGC AGCGGTATCT GGTACGATTT AGCACCGCAT
CTTCTTGATC AGGCCATTAC GCTATTTGGT TTACCGGTCA GCATGACGGT TGATTTGGCA
CAGTTACGGC CCGGAGCGCA GTCGACCGAT TATTTCCACG CCATCTTGTC CTATCCGCAG
CGGCGAGTCA TTTTACACGG TACCATGCTG GCAGCTGCTG AGTCAGCACG TTATATCGTG
CATGGATCCC GAGGCAGTTA TGTGAAATAT GGCCTCGATC CACAGGAAGA ACGTCTGAAA
AATGGCGAGC GTCTGCCGCA GGAAGACTGG GGCTACGATA TGCGTGATGG CGTACTTACC
CGCGTGGAAG GTGAGGAACG TGTCGAAGAA ACGCTGTTGA CAGTACCAGG GAATTATCCG
GCTTACTATG CGGCTATTCG TGATGCGTTA AATGGCGATG GTGAAAATCC GGTTCCGGCA
AGTCAGGCAA TCCAGGTAAT GGAGTTGATT GAGCAGGGCA TCGAATCCGC CAAACATCGC
GCGACGCTGT GCCTTGCGTG A
 
Protein sequence
MSDNIRVGLI GYGYASKTFH APLIAGTPGL ELAVISSSDE TKVKADWPTV AVVSEPKHLF 
NDPNIDLIVI PTPNDTHFPL AKAALEAGKH VVVDKPFTVT LSQARELEAL AKSLGRVLSV
FHNRRWDSDF LTLKGLLVEG VLGEVAYFES HFDRFRPQVR DRWREQGGPG SGIWYDLAPH
LLDQAITLFG LPVSMTVDLA QLRPGAQSTD YFHAILSYPQ RRVILHGTML AAAESARYIV
HGSRGSYVKY GLDPQEERLK NGERLPQEDW GYDMRDGVLT RVEGEERVEE TLLTVPGNYP
AYYAAIRDAL NGDGENPVPA SQAIQVMELI EQGIESAKHR ATLCLA