Gene EcolC_1366 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_1366 
Symbol 
ID6068139 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp1497067 
End bp1498869 
Gene Length1803 bp 
Protein Length600 aa 
Translation table11 
GC content56% 
IMG OID641600788 
Productbifunctional NADH:ubiquinone oxidoreductase subunit C/D 
Protein accessionYP_001724359 
Protein GI170019405 
COG category[C] Energy production and conversion 
COG ID[COG0649] NADH:ubiquinone oxidoreductase 49 kD subunit 7
[COG0852] NADH:ubiquinone oxidoreductase 27 kD subunit 
TIGRFAM ID[TIGR01961] NADH (or F420H2) dehydrogenase, subunit C
[TIGR01962] NADH dehydrogenase I, D subunit 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGAACA ATATGACCGA CTTAACCGCG CAAGAACCCG CCTGGCAGAC CCGCGATCAT 
CTTGATGATC CGGTAATTGG CGAACTGCGC AACCGTTTTG GGCCGGATGC CTTTACTGTT
CAGGCGACTC GCACCGGGGT TCCCGTTGTG TGGATCAAGC GTGAACAATT ACTGGAAGTT
GGCGATTTCT TAAAGAAACT GCCGAAACCT TACGTCATGC TGTTTGACTT ACACGGCATG
GACGAACGTC TGCGCACACA CCGCGAAGGG TTACCTGCCG CGGATTTTTC CGTTTTCTAC
CATCTGATTT CTATCGATCG TAACCGCGAC ATCATGCTGA AGGTGGCGCT GGCAGAAAAC
GACCTGCACG TACCGACCTT CACCAAACTG TTCCCGAACG CTAACTGGTA TGAGCGTGAA
ACCTGGGATC TGTTTGGCAT TACTTTCGAC GGTCACCCGA ACCTGCGCCG CATCATGATG
CCGCAAACCT GGAAAGGTCA CCCGCTGCGT AAAGATTACC CGGCGCGCGC TACCGAATTC
TCGCCGTTTG AGCTGACCAA AGCCAAACAG GATCTGGAGA TGGAAGCCCT GACCTTCAAA
CCGGAAGAGT GGGGGATGAA GCGCGGCACC GAAAACGAGG ACTTCATGTT CCTCAACCTC
GGTCCGAACC ACCCGTCGGC GCACGGGGCT TTCCGTATCG TTTTACAACT CGACGGCGAA
GAGATTGTCG ACTGCGTACC AGACATCGGC TACCACCACC GTGGTGCGGA GAAAATGGGC
GAGCGCCAGT CCTGGCACAG CTACATTCCG TATACCGACC GTATTGAATA CCTCGGCGGC
TGCGTTAACG AAATGCCTTA CGTGCTGGCG GTAGAGAAAC TGGCCGGGAT CACCGTGCCG
GATCGCGTTA ACGTCATTCG CGTTATGCTC TCCGAACTGT TCCGTATCAA CAGCCACCTG
CTGTACATCT CGACCTTTAT TCAGGACGTC GGCGCAATGA CGCCCGTGTT CTTCGCCTTT
ACCGATCGTC AGAAAATTTA CGATCTGGTG GAAGCGATCA CGGGTTTCCG TATGCACCCG
GCGTGGTTCC GTATTGGCGG CGTAGCGCAC GACCTGCCGC GCGGCTGGGA TCGCCTGCTG
CGTGAGTTCC TCGACTGGAT GCCGAAACGT CTGGCGTCTT ACGAGAAAGC GGCGCTGCAA
AATACCATTC TGAAAGGTCG TTCCCAGGGC GTTGCCGCCT ATGGCGCGAA AGAGGCGCTG
GAGTGGGGCA CCACTGGCGC GGGCCTGCGT GCTACCGGGA TCGACTTCGA CGTGCGTAAG
GCGCGTCCTT ATTCTGGCTA TGAAAACTTC GACTTTGAAA TCCCGGTTGG TGGTGGTGTT
TCTGACTGCT ACACCCGCGT AATGCTGAAA GTGGAAGAGC TGCGCCAGAG TCTGCGCATT
CTTGAGCAGT GCCTCAACAA CATGCCGGAA GGCCCGTTCA AAGCGGATCA CCCGCTGACC
ACGCCGCCGC CGAAAGAGCG CACGCTGCAA CATATCGAAA CCCTGATCAC CCACTTCCTG
CAAGTGTCGT GGGGGCCGGT GATGCCTGCC AATGAATCTT TCCAGATGAT TGAGGCGACC
AAGGGGATCA ACAGTTACTA CCTGACCAGC GACGGTAGCA CCATGAGTTA TCGCACTCGT
ATCCGCACGC CGAGTTATGC GCATTTGCAG CAAATTCCGG CGGCGATCCG CGGCAGCCTG
GTGTCTGACC TGATTGTTTA TCTGGGCAGT ATCGATTTTG TTATGTCAGA TGTGGACCGC
TAA
 
Protein sequence
MVNNMTDLTA QEPAWQTRDH LDDPVIGELR NRFGPDAFTV QATRTGVPVV WIKREQLLEV 
GDFLKKLPKP YVMLFDLHGM DERLRTHREG LPAADFSVFY HLISIDRNRD IMLKVALAEN
DLHVPTFTKL FPNANWYERE TWDLFGITFD GHPNLRRIMM PQTWKGHPLR KDYPARATEF
SPFELTKAKQ DLEMEALTFK PEEWGMKRGT ENEDFMFLNL GPNHPSAHGA FRIVLQLDGE
EIVDCVPDIG YHHRGAEKMG ERQSWHSYIP YTDRIEYLGG CVNEMPYVLA VEKLAGITVP
DRVNVIRVML SELFRINSHL LYISTFIQDV GAMTPVFFAF TDRQKIYDLV EAITGFRMHP
AWFRIGGVAH DLPRGWDRLL REFLDWMPKR LASYEKAALQ NTILKGRSQG VAAYGAKEAL
EWGTTGAGLR ATGIDFDVRK ARPYSGYENF DFEIPVGGGV SDCYTRVMLK VEELRQSLRI
LEQCLNNMPE GPFKADHPLT TPPPKERTLQ HIETLITHFL QVSWGPVMPA NESFQMIEAT
KGINSYYLTS DGSTMSYRTR IRTPSYAHLQ QIPAAIRGSL VSDLIVYLGS IDFVMSDVDR