Gene TM1040_0744 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_0744 
Symbol 
ID4076153 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp801379 
End bp802590 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content60% 
IMG OID638006041 
ProductNADH dehydrogenase subunit D 
Protein accessionYP_612739 
Protein GI99080585 
COG category[C] Energy production and conversion 
COG ID[COG0649] NADH:ubiquinone oxidoreductase 49 kD subunit 7 
TIGRFAM ID[TIGR01962] NADH dehydrogenase I, D subunit 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGGCT CCAAATTCGA CGACGCCCAG ACGGGCGAAC AGAAAATCCG TAACTTCAAC 
ATCAACTTCG GCCCGCAGCA CCCTGCGGCG CACGGCGTGC TGCGTCTGGT GCTGGAACTG
GATGGCGAGA TCGTGGAACG CTGCGACCCG CACATCGGTC TTCTGCACCG TGGCACCGAA
AAGCTGATGG AAAGCCGCAC CTACCTGCAG AACCTGCCGT ATTTCGACCG CCTCGACTAT
GTGGCGCCGA TGAACCAGGA GCACGCTTGG TGTCTGGCAA TCGAAAAGCT GACTGGCGTG
GAGGTCCCCC GCCGTGCGCA GCTGATCCGA GTGCTCTATT CTGAGATCGG CCGTATCCTC
AATCACCTCT TGAACATCAC CACTCAGGCG ATGGACGTGG GCGCGCTGAC GCCGCCGCTC
TGGGGCTTTG AGGAACGCGA GAAGCTGATG ATCTTCTACG AGCGGGCCTG TGGTGCACGC
TTGCACGCGG CCTACTTCCG CCCTGGTGGC GTGCATCAGG ATCTGCCGGA CGAGCTGCTG
GATGATATCG ACCTCTGGGC GATGGAATTT CCGAAGGTCA TGGACGACAT CGACGGCCTC
TTGACCGAGA ACCGGATCTT CAAGCAGCGC AACTGCGACA TTGGCGTAGT CACCGAGGAT
GACATCCAGA AGTATGGCTT CTCCGGTGTG ATGGTGCGCG GGTCTGGCCT GGCTTGGGAT
TTGCGCCGCG CGCAGCCCTA TGAATGCTAC GACGAGTTCG ATTTCCAGAT CCCGGTCGGC
AAGAACGGCG ACTGCTACGA TCGCTATCTG GTGCGGATGG AAGAGATGCG TCAGTCGCTC
TCGATCATCC GTCAGGCTAT CGCAAAATTG CGCGAGGCCA CCGGTGACGT TCTGGCCCGT
GGCAAGCTCA CCCCGCCTAA GCGCGGCGAT ATGAAGACCT CGATGGAGAG CCTGATCCAC
CACTTCAAGC TCTACACCGA AGGCTTCCAT GTTCCCGAGG GCGAGGTCTA TGCCGCTGTC
GAGGCGCCCA AAGGCGAATT TGGCGTCTAT CTCGTGGCGG ATGGCAGCAA CAAGCCCTAC
CGCGCCAAGC TGCGCGCACC GGGGTTCTTG CATCTTCAAG CGATGGATTA CGTCGCCAAG
GGCCACCAGC TTGCGGATGT CGCTGCAATT ATTGGAACCA TGGACATCGT GTTTGGAGAG
ATTGACCGAT GA
 
Protein sequence
MDGSKFDDAQ TGEQKIRNFN INFGPQHPAA HGVLRLVLEL DGEIVERCDP HIGLLHRGTE 
KLMESRTYLQ NLPYFDRLDY VAPMNQEHAW CLAIEKLTGV EVPRRAQLIR VLYSEIGRIL
NHLLNITTQA MDVGALTPPL WGFEEREKLM IFYERACGAR LHAAYFRPGG VHQDLPDELL
DDIDLWAMEF PKVMDDIDGL LTENRIFKQR NCDIGVVTED DIQKYGFSGV MVRGSGLAWD
LRRAQPYECY DEFDFQIPVG KNGDCYDRYL VRMEEMRQSL SIIRQAIAKL REATGDVLAR
GKLTPPKRGD MKTSMESLIH HFKLYTEGFH VPEGEVYAAV EAPKGEFGVY LVADGSNKPY
RAKLRAPGFL HLQAMDYVAK GHQLADVAAI IGTMDIVFGE IDR