Gene Smed_0891 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_0891 
Symbol 
ID5321732 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp962007 
End bp963197 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content62% 
IMG OID640789831 
ProductNADH dehydrogenase subunit D 
Protein accessionYP_001326581 
Protein GI150396114 
COG category[C] Energy production and conversion 
COG ID[COG0649] NADH:ubiquinone oxidoreductase 49 kD subunit 7 
TIGRFAM ID[TIGR01962] NADH dehydrogenase I, D subunit 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.984899 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGAAC ATAACGTCCG CAACTTCAAC ATCAACTTCG GCCCGCAGCA TCCGGCGGCG 
CACGGCGTTC TGCGTCTGGT GCTGGAGCTT GACGGCGAGA TCGTCGAGCG TGTCGACCCG
CATATAGGCC TTTTGCATCG CGGCACGGAG AAGCTGATCG AGGCCAAGAC CTATCTGCAG
GCGATTCCTT ATTTCGACCG GCTCGACTAT GTCGCGCCGA TGAACCAGGA GCACGCTTTC
GCGCTTGCAG TGGAGAGGTT GACGGGCACG CAGGTGCCGA TCCGCGGTCA GCTGATCCGG
GTTCTCTATT CCGAGATCGG TCGTATCCTT TCGCACCTCC TCAATGTGAC CACCCAGGCC
ATGGATGTCG GCGCGCTTAC CCCGCCGCTC TGGGGCTTCG AGGAGCGCGA GAAGCTGATG
GTCTTTTACG AGCGCGCCTG CGGCGCGCGC ATGCATGCCG CATATTTCCG CCCGGGCGGC
GTGCACCAGG ATCTGCCGCA TCAACTGGTC GAGGATATCG GCAAGTGGAT CGACCCGTTC
CTGAAGACCG TCGACGATAT CGACGAGCTT CTGACCGGCA ACCGCATCTT CAAGCAGCGC
AACGTCGATA TCGGCGTCGT CAGCCTGGAA GACGCCTGGG CCTGGGGTTT CTCGGGCGTC
ATGGTGCGCG GCTCGGGCGC GGCCTGGGAC CTGCGCCGTT CGCAGCCCTA TGAATGCTAT
TCCGACCTGG AGTTCGACAT TCCGATCGGC AAGAACGGCG ATTGCTTCGA CCGTTATCTC
ATCCGGATGA TCGAGATGCG GGAGTCCGCC CGCATCATGC GCCAATGCGT CGATCGCCTG
CTGGGCGATG CCAAGGTCGG TCCGGTCTCC TCGCTCGACG GCAAGATCGT GCCGCCGAAG
CGGGGCGAGA TGAAGCGGTC GATGGAGGCG CTGATCCATC ACTTCAAGCT CTACACCGAG
GGCTATCACG TGCCGGCCGG CGACGTTTAT GCCGCGGTCG AGGCGCCCAA GGGCGAGTTC
GGCGTCTATC TCGTATCCGA CGGCACCAAC AAGCCCTACC GCTGCAAGAT ACGTGCACCG
GGCTACGCCC ATCTTCAGGC GATGGATTTC CTCTGTCGCG GACACCAGCT TGCCGATGTT
TCGGCCGTGC TGGGCTCTCT CGATATCGTT TTCGGCGAGG TGGATCGCTG A
 
Protein sequence
MTEHNVRNFN INFGPQHPAA HGVLRLVLEL DGEIVERVDP HIGLLHRGTE KLIEAKTYLQ 
AIPYFDRLDY VAPMNQEHAF ALAVERLTGT QVPIRGQLIR VLYSEIGRIL SHLLNVTTQA
MDVGALTPPL WGFEEREKLM VFYERACGAR MHAAYFRPGG VHQDLPHQLV EDIGKWIDPF
LKTVDDIDEL LTGNRIFKQR NVDIGVVSLE DAWAWGFSGV MVRGSGAAWD LRRSQPYECY
SDLEFDIPIG KNGDCFDRYL IRMIEMRESA RIMRQCVDRL LGDAKVGPVS SLDGKIVPPK
RGEMKRSMEA LIHHFKLYTE GYHVPAGDVY AAVEAPKGEF GVYLVSDGTN KPYRCKIRAP
GYAHLQAMDF LCRGHQLADV SAVLGSLDIV FGEVDR