Gene EcSMS35_2431 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2431 
SymbolnuoM 
ID6142650 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2478388 
End bp2479917 
Gene Length1530 bp 
Protein Length509 aa 
Translation table11 
GC content55% 
IMG OID641617303 
ProductNADH dehydrogenase subunit M 
Protein accessionYP_001744475 
Protein GI170683852 
COG category[C] Energy production and conversion 
COG ID[COG1008] NADH:ubiquinone oxidoreductase subunit 4 (chain M) 
TIGRFAM ID[TIGR01972] proton-translocating NADH-quinone oxidoreductase, chain M 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones59 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTACTAC CCTGGCTAAT ATTAATTCCC TTTATCGGCG GCTTCCTGTG CTGGCAGACC 
GAACGCTTTG GCGTCAAGGT GCCGCGCTGG ATCGCGCTGA TCACCATGGG ATTGACGCTG
GCGCTGTCGC TGCAACTGTG GTTGCAGGGC GGTTATTCAC TGACGCAATC CGCCGGAATT
CCGCAGTGGC AGTCTGAATT CGACATGCCG TGGATCCCGC GTTTTGGTAT CTCTATCCAT
CTCGCCATTG ACGGGCTATC GCTGCTGATG GTCGTGCTGA CCGGTCTGCT CGGTGTGCTG
GCGGTACTCT GTTCGTGGAA AGAGATCGAA AAATACCAGG GCTTCTTCCA CCTCAACCTG
ATGTGGATCC TGGGCGGCGT TATCGGCGTG TTCCTTGCCA TCGACATGTT CCTGTTCTTC
TTCTTCTGGG AAATGATGCT GGTGCCGATG TACTTCCTGA TCGCACTGTG GGGGCATAAA
GCCTCTGACG GTAAAACGCG TATCACGGCG GCAACCAAGT TCTTCATTTA CACCCAGGCG
AGTGGTCTGG TGATGTTGAT CGCCATCCTG GCGCTGGTAT TTGTTCACTA CAATGCGACT
GGCGTCTGGA CCTTCAACTA TGAAGAGCTG TTGAATACGC CAATGTCCAG TGGTGTGGAA
TATCTGTTGA TGCTGGGCTT CTTCATCGCG TTTGCGGTGA AAATGCCGGT GGTACCGCTG
CATGGCTGGC TGCCGGATGC GCACTCACAG GCTCCGACCG CCGGTTCCGT GGACCTCGCG
GGGATCTTGC TGAAAACCGC TGCTTACGGT CTGCTGCGTT TCTCCCTGCC GCTGTTCCCG
AACGCGTCGG CAGAGTTCGC GCCAATCGCT ATGTGGCTGG GTGTTATCGG CATCTTCTAC
GGTGCGTGGA TGGCCTTCGC CCAGACCGAT ATCAAACGTC TGATCGCCTA CACCTCGGTT
TCCCACATGG GCTTCGTGCT GATTGCTATC TATACCGGCA GCCAGTTGGC CTACCAGGGC
GCGGTAATCC AGATGATTGC GCACGGCTTG TCAGCGGCGG GTCTGTTTAT TCTTTGTGGT
CAGCTTTATG AACGTATCCA TACCCGCGAC ATGCGCATGA TGGGCGGTCT GTGGAGCAAG
ATGAAATGGC TGCCAGCACT GTCGCTGTTC TTTGCGGTGG CAACGCTTGG GATGCCAGGC
ACCGGTAACT TCGTCGGCGA ATTTATGATT CTGTTCGGCA GCTTCCAGGT TGTCCCGGTG
ATTACCGTTA TCTCTACCTT TGGGCTGGTC TTTGCATCTG TTTATTCGCT GGCGATGTTG
CATCGCGCTT ACTTCGGTAA AGCGAAAAGC CAGATTGCCA GCCAGGAACT GCCAGGGATG
TCGCTGCGTG AGCTGTTTAT GATCCTGTTG CTGGTGGTGC TGCTGGTACT GCTGGGCTTC
TATCCGCAGC CGATTCTGGA TACCTCGCAC TCCGCGATTG GCAATATCCA GCAGTGGTTT
GTTAATTCCG TTACTACTAC AAGGCCGTAA
 
Protein sequence
MLLPWLILIP FIGGFLCWQT ERFGVKVPRW IALITMGLTL ALSLQLWLQG GYSLTQSAGI 
PQWQSEFDMP WIPRFGISIH LAIDGLSLLM VVLTGLLGVL AVLCSWKEIE KYQGFFHLNL
MWILGGVIGV FLAIDMFLFF FFWEMMLVPM YFLIALWGHK ASDGKTRITA ATKFFIYTQA
SGLVMLIAIL ALVFVHYNAT GVWTFNYEEL LNTPMSSGVE YLLMLGFFIA FAVKMPVVPL
HGWLPDAHSQ APTAGSVDLA GILLKTAAYG LLRFSLPLFP NASAEFAPIA MWLGVIGIFY
GAWMAFAQTD IKRLIAYTSV SHMGFVLIAI YTGSQLAYQG AVIQMIAHGL SAAGLFILCG
QLYERIHTRD MRMMGGLWSK MKWLPALSLF FAVATLGMPG TGNFVGEFMI LFGSFQVVPV
ITVISTFGLV FASVYSLAML HRAYFGKAKS QIASQELPGM SLRELFMILL LVVLLVLLGF
YPQPILDTSH SAIGNIQQWF VNSVTTTRP