Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_0746 |
Symbol | |
ID | 4076155 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | + |
Start bp | 803081 |
End bp | 804277 |
Gene Length | 1197 bp |
Protein Length | 398 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 638006043 |
Product | NADH dehydrogenase subunit E |
Protein accession | YP_612741 |
Protein GI | 99080587 |
COG category | [C] Energy production and conversion [S] Function unknown |
COG ID | [COG1905] NADH:ubiquinone oxidoreductase 24 kD subunit [COG3743] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR01958] NADH-quinone oxidoreductase, E subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.331264 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGCGTC GTCTTCATCA CGAACAGCCT GACAGCTTTG CGTTCACCCC GGCCAATCAG GCCTGGGCCG AAGCTCAGAT GACCAAATAC CCCGAAGGCC GTCAGGCCTC GGCGGTCATT CCGATTCTGT GGCGCGCTCA GGAGCAAGAG GGCTGGATTT CCAAACCCGC GATCGAATAT GTGGCCGATA TGCTCGGCAT GGCCTACATC CGCGTGCTGG AAGTGGCCTC TTTCTATTTC ATGTTCCAGC TGCAGCCCAC GGGTTCGGTT GCGCATATCC AGATCTGTGG CACCACGTCC TGCATGATCT GTGGTGCCGA GGATCTGGTC GCGATCTGCA AGGACAAGAT CTCTGCCAAG CCGCATACGC TGTCCGAGGA CGGCAAGTTC TCTTGGGAAG AGGTAGAATG CCTTGGCTCC TGCGCCAATG CGCCGATGGC GCAGATCGGC AAGGATTATT ACGAGGATCT GACCGCGGCA TCTTTTACCA AGCTGCTCGA TGATCTGGCG GCGGGTAAAC CCGTCGTACC CGGCCCGCAA AACGGTCGCT ACGCTGCAGA GCCAAAGGCG GGCCTGACCT CGCTCACCGA ATATGAAGCA GGCAAGCCGC AGTATAATGC CTCGGCGGAG CTTGCGACCG AAATCGGTGA CGGTGTGAAG CGTATTCAGG GCGATGAAGT TCCGCTCCTG ACCCCATGGG TCGGCAAGGA TGGCGTGGTT GCAGGGCGTG CCGCTGCAGA TCCGACGCCG CCCGCGCCAG AGCGTCCGCA ACCCGCGGCC AAGCAGGCTG AGACCGCCAA GAAGAAGGCT CCGGCCAAGC CTGCGGTCAA GAAATCGGAT GCGGCAACGC CTGCACAGCC CGAAGCCGCC GCCGCAAAGG TGACGGAACC CAAGGCGGAC TTGGAAGAAC AAGCACCCGA GACGCTGACA GCGGCGCGCG AGGGCGGGGC GGACGATCTC AAGCTCCTCA AAGGTGTGGG GCCAAAGCTC GAACAGACGC TCAATGAGCT GGGCTTTTTC CACTTTGACC AGATTGCCAA ATGGACCGAG GCCGAGGTGG CCTGGGTGGA TGCGCGCCTG AAGTTCAAAG GCCGCATCGA GCGCGACGGC TGGATCGAGC AAGCCAAGCA ACTGGCAGCC GGTGAAGAAA CCGAGTTTGC CAAATCGGCC AAGAAAGACG GCCGCTACAA AGACTAA
|
Protein sequence | MLRRLHHEQP DSFAFTPANQ AWAEAQMTKY PEGRQASAVI PILWRAQEQE GWISKPAIEY VADMLGMAYI RVLEVASFYF MFQLQPTGSV AHIQICGTTS CMICGAEDLV AICKDKISAK PHTLSEDGKF SWEEVECLGS CANAPMAQIG KDYYEDLTAA SFTKLLDDLA AGKPVVPGPQ NGRYAAEPKA GLTSLTEYEA GKPQYNASAE LATEIGDGVK RIQGDEVPLL TPWVGKDGVV AGRAAADPTP PAPERPQPAA KQAETAKKKA PAKPAVKKSD AATPAQPEAA AAKVTEPKAD LEEQAPETLT AAREGGADDL KLLKGVGPKL EQTLNELGFF HFDQIAKWTE AEVAWVDARL KFKGRIERDG WIEQAKQLAA GEETEFAKSA KKDGRYKD
|
| |