Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_1375 |
Symbol | |
ID | 6068083 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | + |
Start bp | 1507883 |
End bp | 1509412 |
Gene Length | 1530 bp |
Protein Length | 509 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641600797 |
Product | NADH dehydrogenase subunit M |
Protein accession | YP_001724368 |
Protein GI | 170019414 |
COG category | [C] Energy production and conversion |
COG ID | [COG1008] NADH:ubiquinone oxidoreductase subunit 4 (chain M) |
TIGRFAM ID | [TIGR01972] proton-translocating NADH-quinone oxidoreductase, chain M |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.905412 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTACTAC CCTGGCTAAT ATTAATTCCC TTTATTGGCG GCTTCCTGTG CTGGCAGACC GAACGCTTTG GCGTCAAGGT GCCGCGCTGG ATCGCGCTGA TCACCATGGG ATTGACGCTG GCGCTGTCGC TGCAACTGTG GTTGCAGGGC GGTTATTCAC TGACGCAATC CGCCGGAATT CCGCAGTGGC AGTCTGAATT CGACATGCCG TGGATCCCGC GTTTTGGTAT CTCTATCCAT CTCGCCATTG ACGGGCTGTC GCTGCTGATG GTCGTGCTGA CCGGTCTGCT CGGTGTGCTG GCGGTACTCT GCTCGTGGAA AGAGATCGAA AAATATCAGG GCTTCTTCCA CCTCAACCTG ATGTGGATCC TGGGCGGCGT TATCGGCGTG TTCCTTGCCA TCGACATGTT CCTGTTCTTC TTCTTCTGGG AAATGATGCT GGTGCCGATG TACTTCCTGA TCGCACTGTG GGGGCATAAA GCCTCTGACG GTAAAACGCG TATCACGGCG GCAACCAAGT TCTTCATTTA CACCCAGGCG AGTGGTCTGG TGATGTTGAT CGCCATCCTG GCGCTGGTTT TTGTTCACTA CAATGCGACC GGCGTCTGGA CCTTCAACTA TGAAGAGCTG CTGAATACGC CAATGTCCAG TGGTGTGGAA TACCTGTTAA TGCTGGGCTT CTTCATCGCC TTCGCAGTCA AAATGCCGGT GGTTCCGCTG CATGGCTGGC TGCCGGATGC GCACTCCCAG GCTCCGACCG CCGGTTCCGT TGACCTCGCG GGGATCTTGC TGAAAACTGC CGCTTACGGT TTGCTGCGTT TCTCCCTGCC GCTGTTCCCG AACGCGTCGG CAGAGTTCGC GCCAATTGCT ATGTGGCTGG GTGTTATCGG CATCTTCTAC GGTGCGTGGA TGGCCTTCGC CCAGACCGAT ATCAAACGTC TGATCGCCTA CACCTCGGTT TCCCACATGG GCTTCGTGCT GATTGCAATC TACACCGGCA GCCAGTTGGC CTACCAGGGC GCGGTAATCC AGATGATTGC GCACGGCTTG TCGGCGGCGG GTCTGTTTAT TCTCTGCGGT CAGCTTTATG AACGTATCCA TACCCGCGAC ATGCGCATGA TGGGCGGTCT GTGGAGCAAG ATGAAATGGC TGCCAGCACT GTCGCTGTTC TTTGCGGTGG CAACGCTTGG GATGCCTGGC ACCGGTAACT TCGTCGGCGA ATTTATGATT CTGTTCGGCA GCTTCCAGGT TGTCCCGGTG ATTACCGTTA TCTCTACCTT TGGGCTGGTC TTTGCATCTG TTTATTCGCT GGCGATGTTA CATCGCGCTT ACTTCGGTAA AGCGAAAAGC CAGATTGCCA GCCAGGAACT GCCAGGGATG TCGCTGCGTG AGCTGTTTAT GATCCTGTTG CTGGTGGTGC TGCTGGTACT GCTGGGCTTC TATCCGCAGC CGATTCTGGA TACCTCGCAC TCCGCGATTG GCAATATCCA GCAGTGGTTT GTTAATTCCG TTACTACTAC AAGGCCGTAA
|
Protein sequence | MLLPWLILIP FIGGFLCWQT ERFGVKVPRW IALITMGLTL ALSLQLWLQG GYSLTQSAGI PQWQSEFDMP WIPRFGISIH LAIDGLSLLM VVLTGLLGVL AVLCSWKEIE KYQGFFHLNL MWILGGVIGV FLAIDMFLFF FFWEMMLVPM YFLIALWGHK ASDGKTRITA ATKFFIYTQA SGLVMLIAIL ALVFVHYNAT GVWTFNYEEL LNTPMSSGVE YLLMLGFFIA FAVKMPVVPL HGWLPDAHSQ APTAGSVDLA GILLKTAAYG LLRFSLPLFP NASAEFAPIA MWLGVIGIFY GAWMAFAQTD IKRLIAYTSV SHMGFVLIAI YTGSQLAYQG AVIQMIAHGL SAAGLFILCG QLYERIHTRD MRMMGGLWSK MKWLPALSLF FAVATLGMPG TGNFVGEFMI LFGSFQVVPV ITVISTFGLV FASVYSLAML HRAYFGKAKS QIASQELPGM SLRELFMILL LVVLLVLLGF YPQPILDTSH SAIGNIQQWF VNSVTTTRP
|
| |