Gene EcSMS35_2440 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2440 
SymbolnuoC 
ID6143204 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2488931 
End bp2490733 
Gene Length1803 bp 
Protein Length600 aa 
Translation table11 
GC content55% 
IMG OID641617312 
Productbifunctional NADH:ubiquinone oxidoreductase subunit C/D 
Protein accessionYP_001744484 
Protein GI170682181 
COG category[C] Energy production and conversion 
COG ID[COG0649] NADH:ubiquinone oxidoreductase 49 kD subunit 7
[COG0852] NADH:ubiquinone oxidoreductase 27 kD subunit 
TIGRFAM ID[TIGR01961] NADH (or F420H2) dehydrogenase, subunit C
[TIGR01962] NADH dehydrogenase I, D subunit 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones53 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGAACA ATATGACCGA CTTAACCGCG CAAGAACCCG CCTGGCAGAC CCGCGATCAT 
CTTGATGATC CGGTGATTGG CGAACTGCGC AACCGTTTTG GGCCGGATGC CTTTACTGTT
CAGGCGACTC GCACCGGGGT TCCCGTTGTG TGGATCAAGC GTGAACAATT ACTGGAAGTT
GGCGATTTCT TAAAGAAACT GCCGAAACCT TACGTCATGC TGTTTGACTT ACACGGCATG
GACGAACGTC TGCGCACACA CCGCGAAGGG TTACCTGCCG CGGATTTTTC CGTTTTCTAC
CATCTGATTT CTATCGATCG TAACCGCGAC ATCATGCTGA AGGTGGCGCT GGCAGAAAAC
GACCTGCACG TACCGACCTT CACCAAACTG TTCCCGAACG CTAACTGGTA TGAGCGTGAG
ACCTGGGATC TGTTTGGCAT TACTTTCGAC GGTCATCCTA ATTTGCGCCG CATCATGATG
CCGCAAACCT GGAAAGGTCA TCCGCTGCGT AAAGATTACC CGGCACGCGC TACCGAATTC
TCGCCGTTTG AGCTGACCAA AGCCAAACAG GATCTGGAAA TGGAAGCCCT GACCTTCAAA
CCGGAAGAGT GGGGGATGAA GCGCGGCACC GAAAACGAGG ACTTCATGTT CCTCAACCTC
GGTCCGAACC ACCCGTCTGC GCACGGTGCT TTCCGTATCG TTTTGCAGCT CGATGGCGAA
GAGATTGTCG ACTGCGTACC AGACATCGGC TACCACCACC GTGGTGCGGA GAAAATGGGC
GAACGCCAGT CCTGGCATAG CTACATTCCG TATACCGACC GTATCGAATA CCTCGGCGGC
TGCGTTAACG AAATGCCTTA CGTGCTGGCG GTAGAGAAAC TGGCCGGGAT CACCGTGCCG
GATCGCGTTA ACGTCATTCG CGTAATGCTT TCTGAGCTGT TCCGCATCAA CAGCCATCTG
CTGTATATCT CCACCTTTAT TCAGGACGTC GGCGCGATGA CGCCAGTGTT CTTCGCCTTT
ACCGATCGTC AGAAAATTTA CGATCTGGTG GAAGCAATCA CGGGTTTTCG TATGCACCCG
GCGTGGTTCC GTATTGGCGG CGTAGCGCAC GACCTGCCGC GCGGCTGGGA TCGCCTGCTG
CGTGAGTTCC TCGACTGGAT GCCGAAACGT CTGGCGTCTT ACGAGAAAGC GGCGCTGCAA
AATACCATTC TGAAAGGTCG TTCCCAGGGC GTTGCCGCCT ATGGCGCGAA AGAGGCACTG
GAGTGGGGCA CCACTGGCGC GGGCCTGCGT GCTACCGGGA TCGACTTCGA CGTGCGTAAG
GCGCGTCCGT ATTCTGGCTA TGAAAACTTC GACTTTGAAA TCCCGGTGGG TGGTGGTGTT
TCTGACTGCT ACACCCGCGT AATGCTGAAA GTGGAAGAGC TGCGCCAGAG TCTGCGCATT
CTTGAGCAGT GCCTCAACAA CATGCCGGAA GGCCCGTTCA AAGCGGATCA CCCGCTGACC
ACGCCGCCGC CGAAAGAGCG CACGTTGCAA CATATCGAAA CCCTGATCAC CCACTTCCTG
CAAGTGTCGT GGGGTCCGGT GATGCCAGCC AATGAATCTT TCCAGATGAT TGAGGCGACC
AAGGGGATCA ACAGTTACTA CCTGACCAGC GACGGTAGCA CCATGAGTTA TCGCACCCGT
ATCCGTACGC CGAGTTATGC GCATTTGCAG CAAATTCCGG CGGCGATCCG CGGCAGCCTG
GTGTCTGACC TGATTGTTTA TCTGGGCAGT ATCGATTTTG TTATGTCAGA TGTGGACCGC
TAA
 
Protein sequence
MVNNMTDLTA QEPAWQTRDH LDDPVIGELR NRFGPDAFTV QATRTGVPVV WIKREQLLEV 
GDFLKKLPKP YVMLFDLHGM DERLRTHREG LPAADFSVFY HLISIDRNRD IMLKVALAEN
DLHVPTFTKL FPNANWYERE TWDLFGITFD GHPNLRRIMM PQTWKGHPLR KDYPARATEF
SPFELTKAKQ DLEMEALTFK PEEWGMKRGT ENEDFMFLNL GPNHPSAHGA FRIVLQLDGE
EIVDCVPDIG YHHRGAEKMG ERQSWHSYIP YTDRIEYLGG CVNEMPYVLA VEKLAGITVP
DRVNVIRVML SELFRINSHL LYISTFIQDV GAMTPVFFAF TDRQKIYDLV EAITGFRMHP
AWFRIGGVAH DLPRGWDRLL REFLDWMPKR LASYEKAALQ NTILKGRSQG VAAYGAKEAL
EWGTTGAGLR ATGIDFDVRK ARPYSGYENF DFEIPVGGGV SDCYTRVMLK VEELRQSLRI
LEQCLNNMPE GPFKADHPLT TPPPKERTLQ HIETLITHFL QVSWGPVMPA NESFQMIEAT
KGINSYYLTS DGSTMSYRTR IRTPSYAHLQ QIPAAIRGSL VSDLIVYLGS IDFVMSDVDR