Gene Smed_0894 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_0894 
Symbol 
ID5321735 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp964304 
End bp965608 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content63% 
IMG OID640789834 
ProductNADH dehydrogenase I subunit F 
Protein accessionYP_001326584 
Protein GI150396117 
COG category[C] Energy production and conversion 
COG ID[COG1894] NADH:ubiquinone oxidoreductase, NADH-binding (51 kD) subunit 
TIGRFAM ID[TIGR01959] NADH-quinone oxidoreductase, F subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.13206 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTAAAG ACGAAGATCG CATCTTTACC AACATCTACG GCCTCATGGA CAAGTCGCTC 
AAGGGCGCGA TGGCGCGAGG CCATTGGGAC GGCACGAAGC AGTTCCTGGA AAAGGGCCGC
GACTGGATCA TCAACGAGGT GAAGGCTTCC GGCCTCCGCG GCCGCGGCGG CGCCGGCTTC
CCGACCGGTC TCAAATGGTC CTTCATGCCG AAGGAGAGCG ACGGGCGCCC GCATTACCTC
GTCGTCAATG CCGACGAGTC CGAGCCCGGC ACCTGCAAGG ACCGCGACAT CATGCGCCAC
GATCCGCACA CGCTGATCGA GGGCTGCGTG ATTGCGAGCT TCGCGATGGG TGCGCATGCC
GCCTATATCT ATGTTCGCGG CGAGTTCATC CGCGAGCGCG AAGCGCTGCA GGCTGCGATC
GACGAATGTT ACGCATACGG CCTGCTCGGA AAGAACAACA AGCTCGGCTA CGACATCGAT
ATCTTCGTGC ATCACGGCGC CGGCGCCTAT ATCTGCGGCG AGGAAACCGC GCTGCTCGAG
AGCCTTGAAG GCAAGAAAGG CCAGCCGCGC CTGAAGCCGC CTTTCCCCGC GAATATGGGC
CTTTACGGCT GCCCGACGAC TGTCAACAAC GTCGAGTCGA TCGCGGTTAC GCCGACCATC
CTGCGCCGGG GCGCCGGCTG GTATACGAGC TTCGGCCGCC CGAACAATCA CGGCACCAAG
CTCTATTCGG TTTCCGGACA CGTCAATCGC CCGTGCACGG TCGAGGATGC GATGTCCATC
CCCTTCCATG AGCTTATCGA GAAGCACTGC GGCGGCATTC GCGGCGGCTG GGACAATCTG
CTTGCCGTCA TTCCCGGCGG CTCTTCGGTC CCCTGCGTGC CCGGCGCGCA GATGAAGGAC
GCGATCATGG ATTATGACGG CCTGCGCGAG CTCGGATCGG GTCTCGGAAC GGCTGCCGTC
ATCGTCATGG ACAAGTCGAC CGACATCATC AAGGCGATCT GGCGGCTTTC GGCTTTCTAC
AAGCATGAGA GCTGCGGTCA GTGCACGCCC TGCCGCGAAG GCACCGGCTG GATGATGCGC
GTGATGGAGC GCATGGTGCA GGGCCGTGCC CAGAAGCGCG AGATCGATAT GCTCTTCGAC
GTGACGAAAC AGGTCGAAGG CCACACGATC TGCGCGCTGG GCGATGCGGC GGCCTGGCCG
ATCCAGGGCC TCATCAAGCA TTTCCGCCCG GAAATGGAGA AGCGGATAGA CGAATACACC
CGCAACGCGA CTTCGCAAGG CGCGGTGCTG GAGGCAGCGG AGTAA
 
Protein sequence
MLKDEDRIFT NIYGLMDKSL KGAMARGHWD GTKQFLEKGR DWIINEVKAS GLRGRGGAGF 
PTGLKWSFMP KESDGRPHYL VVNADESEPG TCKDRDIMRH DPHTLIEGCV IASFAMGAHA
AYIYVRGEFI REREALQAAI DECYAYGLLG KNNKLGYDID IFVHHGAGAY ICGEETALLE
SLEGKKGQPR LKPPFPANMG LYGCPTTVNN VESIAVTPTI LRRGAGWYTS FGRPNNHGTK
LYSVSGHVNR PCTVEDAMSI PFHELIEKHC GGIRGGWDNL LAVIPGGSSV PCVPGAQMKD
AIMDYDGLRE LGSGLGTAAV IVMDKSTDII KAIWRLSAFY KHESCGQCTP CREGTGWMMR
VMERMVQGRA QKREIDMLFD VTKQVEGHTI CALGDAAAWP IQGLIKHFRP EMEKRIDEYT
RNATSQGAVL EAAE