Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_0744 |
Symbol | |
ID | 4076153 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | + |
Start bp | 801379 |
End bp | 802590 |
Gene Length | 1212 bp |
Protein Length | 403 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 638006041 |
Product | NADH dehydrogenase subunit D |
Protein accession | YP_612739 |
Protein GI | 99080585 |
COG category | [C] Energy production and conversion |
COG ID | [COG0649] NADH:ubiquinone oxidoreductase 49 kD subunit 7 |
TIGRFAM ID | [TIGR01962] NADH dehydrogenase I, D subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACGGCT CCAAATTCGA CGACGCCCAG ACGGGCGAAC AGAAAATCCG TAACTTCAAC ATCAACTTCG GCCCGCAGCA CCCTGCGGCG CACGGCGTGC TGCGTCTGGT GCTGGAACTG GATGGCGAGA TCGTGGAACG CTGCGACCCG CACATCGGTC TTCTGCACCG TGGCACCGAA AAGCTGATGG AAAGCCGCAC CTACCTGCAG AACCTGCCGT ATTTCGACCG CCTCGACTAT GTGGCGCCGA TGAACCAGGA GCACGCTTGG TGTCTGGCAA TCGAAAAGCT GACTGGCGTG GAGGTCCCCC GCCGTGCGCA GCTGATCCGA GTGCTCTATT CTGAGATCGG CCGTATCCTC AATCACCTCT TGAACATCAC CACTCAGGCG ATGGACGTGG GCGCGCTGAC GCCGCCGCTC TGGGGCTTTG AGGAACGCGA GAAGCTGATG ATCTTCTACG AGCGGGCCTG TGGTGCACGC TTGCACGCGG CCTACTTCCG CCCTGGTGGC GTGCATCAGG ATCTGCCGGA CGAGCTGCTG GATGATATCG ACCTCTGGGC GATGGAATTT CCGAAGGTCA TGGACGACAT CGACGGCCTC TTGACCGAGA ACCGGATCTT CAAGCAGCGC AACTGCGACA TTGGCGTAGT CACCGAGGAT GACATCCAGA AGTATGGCTT CTCCGGTGTG ATGGTGCGCG GGTCTGGCCT GGCTTGGGAT TTGCGCCGCG CGCAGCCCTA TGAATGCTAC GACGAGTTCG ATTTCCAGAT CCCGGTCGGC AAGAACGGCG ACTGCTACGA TCGCTATCTG GTGCGGATGG AAGAGATGCG TCAGTCGCTC TCGATCATCC GTCAGGCTAT CGCAAAATTG CGCGAGGCCA CCGGTGACGT TCTGGCCCGT GGCAAGCTCA CCCCGCCTAA GCGCGGCGAT ATGAAGACCT CGATGGAGAG CCTGATCCAC CACTTCAAGC TCTACACCGA AGGCTTCCAT GTTCCCGAGG GCGAGGTCTA TGCCGCTGTC GAGGCGCCCA AAGGCGAATT TGGCGTCTAT CTCGTGGCGG ATGGCAGCAA CAAGCCCTAC CGCGCCAAGC TGCGCGCACC GGGGTTCTTG CATCTTCAAG CGATGGATTA CGTCGCCAAG GGCCACCAGC TTGCGGATGT CGCTGCAATT ATTGGAACCA TGGACATCGT GTTTGGAGAG ATTGACCGAT GA
|
Protein sequence | MDGSKFDDAQ TGEQKIRNFN INFGPQHPAA HGVLRLVLEL DGEIVERCDP HIGLLHRGTE KLMESRTYLQ NLPYFDRLDY VAPMNQEHAW CLAIEKLTGV EVPRRAQLIR VLYSEIGRIL NHLLNITTQA MDVGALTPPL WGFEEREKLM IFYERACGAR LHAAYFRPGG VHQDLPDELL DDIDLWAMEF PKVMDDIDGL LTENRIFKQR NCDIGVVTED DIQKYGFSGV MVRGSGLAWD LRRAQPYECY DEFDFQIPVG KNGDCYDRYL VRMEEMRQSL SIIRQAIAKL REATGDVLAR GKLTPPKRGD MKTSMESLIH HFKLYTEGFH VPEGEVYAAV EAPKGEFGVY LVADGSNKPY RAKLRAPGFL HLQAMDYVAK GHQLADVAAI IGTMDIVFGE IDR
|
| |