Gene EcSMS35_1548 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1548 
SymbolnemA 
ID6144423 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1535100 
End bp1536197 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content54% 
IMG OID641616425 
ProductN-ethylmaleimide reductase 
Protein accessionYP_001743603 
Protein GI170683580 
COG category[C] Energy production and conversion 
COG ID[COG1902] NADH:flavin oxidoreductases, Old Yellow Enzyme family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.284503 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value0.889457 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCATCTG AAAAACTGTA TTCCCCACTG AAAGTGGGCG CGATCACGGC GGCAAACCGT 
ATTTTTATGG CACCGCTGAC GCGTCTTCGC AGTATTGAAC CGGGTGACAT CCCTACCCCG
TTGATGGCGG AATACTATCG CCAACGTGCC AGTGCCGGTT TGATTATTAG CGAAGCCACG
CAAATTTCTG CCCAGGCAAA AGGGTATGCT GGTGCACCTG GCATCCATAG TCCGGAACAA
ATTGCCGCAT GGAAAAAAAT TACCGCTGGC GTTCATGCTG AAAATGGTCA TATGGCCGTA
CAGCTGTGGC ACACCGGACG CATTTCTCAC GCCAGCCTGC AACCTGGCGG TCAGGCACCG
GTAGCGCCTT CCGCACTTAG CGCGGGAACA CGTACTTCTC TGCGCGATGA AAATGGTCAG
GCGATCCGTG TTGAAACATC CATGCCGCGT GCGCTTGAAC TGGAAGAGAT TCCGGGTATC
GTCAATGATT TCCGTCAGGC CATTTCTAAC GCGCGTGAAG CCGGTTTTGA TCTGGTAGAG
CTCCACTCTG CTCATGGTTA TTTGCTGCAT CAGTTCCTTT CTCCTTCTTC AAACCATCGT
ACCGATCAGT ACGGCGGCAG CGTGGAAAAT CGCGCACGTC TGGTGCTGGA AGTGGTCGAT
GCAGGGATTG AAGAATGGGG TGCCGATCGC ATTGGCATTC GCGTTTCGCC AATCGGTACT
TTCCAGAACA CAGATAACGG CCCGAATGAA GAAGCCGATG CACTGTATCT GATTGAACAA
CTGGGTAAAC GCGGCATTGC TTATCTGCAT ATGTCAGAAC CAGATTGGGC GGGGGGTGAG
CCGTATACTG ATGCGTTCCG CGAAAAAGTA CGCGCCCGTT TCCACGGTCC GATTATCGGC
GCAGGTGCAT ACACGGTAGA AAAAGCCGAA ACGCTGATCG GCAAAGGGTT AATTGATGCG
GTGGCATTTG GTCGTGACTG GATTGCGAAC CCGGATCTGG TCGCCCGCTT GCAACGCAAA
GCTGAGCTTA ACCCACAGCG TGCCGAAAGT TTTTACGGTG GCGGCGCGGA AGGCTACACG
GATTACCCGA CGCTGTAA
 
Protein sequence
MSSEKLYSPL KVGAITAANR IFMAPLTRLR SIEPGDIPTP LMAEYYRQRA SAGLIISEAT 
QISAQAKGYA GAPGIHSPEQ IAAWKKITAG VHAENGHMAV QLWHTGRISH ASLQPGGQAP
VAPSALSAGT RTSLRDENGQ AIRVETSMPR ALELEEIPGI VNDFRQAISN AREAGFDLVE
LHSAHGYLLH QFLSPSSNHR TDQYGGSVEN RARLVLEVVD AGIEEWGADR IGIRVSPIGT
FQNTDNGPNE EADALYLIEQ LGKRGIAYLH MSEPDWAGGE PYTDAFREKV RARFHGPIIG
AGAYTVEKAE TLIGKGLIDA VAFGRDWIAN PDLVARLQRK AELNPQRAES FYGGGAEGYT
DYPTL