Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_0753 |
Symbol | |
ID | 4076162 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | + |
Start bp | 809184 |
End bp | 810221 |
Gene Length | 1038 bp |
Protein Length | 345 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 638006050 |
Product | NADH dehydrogenase subunit H |
Protein accession | YP_612748 |
Protein GI | 99080594 |
COG category | [C] Energy production and conversion |
COG ID | [COG1005] NADH:ubiquinone oxidoreductase subunit 1 (chain H) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.112017 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTGAAT TCTTTAACAC CCCAGGCGGC ATTGCTGTCA TCATCCTGGC GCAGACCCTG GCGGTTGTCG CCTTTGTGAT GATCTCGCTT CTGTTCCTTG TCTACGGCGA CCGCAAGATC TGGGCCGCCG TCCAGATGCG GCGCGGCCCC AACGTGGTTG GGGTTTACGG TCTGTTGCAG ACCGTCGCGG ATGCGCTCAA ATATGTTGTG AAAGAGGTGG TCATCCCGGC CGGCTCTGAC CGGACAGTCT TCATTCTGGC ACCGCTCACC TCCTTTGTGC TGGCGATGAT CGCCTGGGCG GTGATCCCGT TCAACGACAC TTGGGTGCTC TCGGACATCA ACGTCGCCAT CCTGTATGTA TTCGCGGTCT CCTCGCTTGA GGTTTACGGC GTCATCATGG GCGGCTGGGC TTCGAACTCC AAGTATCCGT TCCTCGGCTC CTTGCGGTCG GCGGCGCAGA TGATTTCCTA CGAGGTCTCC ATCGGCCTCA TCATCATCGG TGTGATCCTC TCGACCGGGT CCATGAACTT TGGCGATATC GTGCGCGCGC AGGACGGGGA TGCTGGCCTC TTCAACTGGT ACTGGCTGCC GCATTTCCCG ATGGTGTTCC TGTTCTTCAT CTCCTGCCTT GCGGAAACCA ACCGCCCGCC GTTTGACCTT CCCGAAGCGG AGTCGGAACT GGTGGCAGGC TACCAGGTGG AATACTCCTC AACGCCGTTC CTGTTGTTCA TGGCCGGTGA ATACATTGCC ATCTTCCTGA TGTGCGCGCT CACCTCGCTT CTGTTTTTCG GCGGCTGGCT CTCGCCGGTA CCGTTCCTGC CGGACAGCCC GCTGTGGATG GTCGCAAAGA TGGCGTTCTT CTTCTTCCTC TTTGCCATGG TCAAAGCCAT CACCCCGCGC TACCGCTACG ATCAGTTGAT GCGTCTGGGC TGGAAAGTCT TCCTGCCGTT CTCGCTGATC TGGGTGGTGT TCGTGGCCTT TGCCGCACGT TTCGAATGGT TCTGGGGTGC GTTTGCACGC TGGAGCACAG GAGGCTGA
|
Protein sequence | MAEFFNTPGG IAVIILAQTL AVVAFVMISL LFLVYGDRKI WAAVQMRRGP NVVGVYGLLQ TVADALKYVV KEVVIPAGSD RTVFILAPLT SFVLAMIAWA VIPFNDTWVL SDINVAILYV FAVSSLEVYG VIMGGWASNS KYPFLGSLRS AAQMISYEVS IGLIIIGVIL STGSMNFGDI VRAQDGDAGL FNWYWLPHFP MVFLFFISCL AETNRPPFDL PEAESELVAG YQVEYSSTPF LLFMAGEYIA IFLMCALTSL LFFGGWLSPV PFLPDSPLWM VAKMAFFFFL FAMVKAITPR YRYDQLMRLG WKVFLPFSLI WVVFVAFAAR FEWFWGAFAR WSTGG
|
| |