Gene EcolC_1979 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_1979 
Symbol 
ID6068191 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp2188919 
End bp2190016 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content54% 
IMG OID641601393 
ProductN-ethylmaleimide reductase 
Protein accessionYP_001724952 
Protein GI170019998 
COG category[C] Energy production and conversion 
COG ID[COG1902] NADH:flavin oxidoreductases, Old Yellow Enzyme family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0402402 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00127595 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCATCTG AAAAACTGTA TTCCCCACTG AAAGTGGGCG CGATCACGGC GGCAAACCGT 
ATTTTTATGG CACCGCTGAC GCGTCTGCGC AGTATTGAAC CGGGTGACAT TCCTACCCCG
TTGATGGCGG AATACTATCG CCAACGTGCC AGTGCCGGTT TGATTATTAG TGAAGCCACG
CAAATTTCTG CCCAGGCAAA AGGATATGCA GGTGCGCCTG GCATCCATAG TCCGGAGCAA
ATTGCCGCAT GGAAAAAAAT CACCGCTGGC GTTCATGCTG AAAATGGTCA TATGGCCGTG
CAGCTGTGGC ACACCGGACG CATTTCTCAC GCCAGCCTGC AACCTGGCGG TCAGGCACCG
GTAGCGCCTT CAGCACTTAG CGCGGGAACA CGTACTTCTC TGCGCGATGA AAATGGTCAG
GCGATCCGTG TTGAAACATC CATGCCGCGT GCGCTTGAAC TGGAAGAGAT TCCAGGTATC
GTCAATGATT TCCGTCAGGC CATTGCTAAC GCGCGTGAAG CCGGTTTTGA TCTGGTAGAG
CTCCACTCTG CTCACGGTTA TTTGCTGCAT CAGTTCCTTT CTCCTTCTTC AAACCATCGT
ACCGATCAGT ACGGCGGCAG CGTGGAAAAT CGCGCACGTT TGGTACTGGA AGTGGTCGAT
GCCGGGATTG AAGAATGGGG TGCCGATCGC ATTGGCATTC GCGTTTCGCC AATCGGTACT
TTCCAGAACA CAGATAACGG CCCGAATGAA GAAGCCGATG CACTGTATCT GATTGAACAA
CTGGGTAAAC GCGGCATTGC TTATCTGCAT ATGTCAGAAC CAGATTGGGC GGGGGGTGAA
CCGTATACTG ATGCGTTCCG CGAAAAAGTA CGCGCCCGTT TCCACGGTCC GATTATCGGC
GCAGGTGCAT ACACAGTAGA AAAAGCTGAA ACGCTGATCG GCAAAGGGTT AATTGATGCG
GTGGCATTTG GTCGTGACTG GATTGCGAAC CCGGATCTGG TCGCCCGCTT GCAGCGCAAA
GCTGAGCTTA ACCCACAGCG TGCCGAAAGT TTCTACGGTG GCGGCGCGGA AGGCTATACC
GATTACCCGA CGTTGTAA
 
Protein sequence
MSSEKLYSPL KVGAITAANR IFMAPLTRLR SIEPGDIPTP LMAEYYRQRA SAGLIISEAT 
QISAQAKGYA GAPGIHSPEQ IAAWKKITAG VHAENGHMAV QLWHTGRISH ASLQPGGQAP
VAPSALSAGT RTSLRDENGQ AIRVETSMPR ALELEEIPGI VNDFRQAIAN AREAGFDLVE
LHSAHGYLLH QFLSPSSNHR TDQYGGSVEN RARLVLEVVD AGIEEWGADR IGIRVSPIGT
FQNTDNGPNE EADALYLIEQ LGKRGIAYLH MSEPDWAGGE PYTDAFREKV RARFHGPIIG
AGAYTVEKAE TLIGKGLIDA VAFGRDWIAN PDLVARLQRK AELNPQRAES FYGGGAEGYT
DYPTL