Gene Smed_3620 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_3620 
Symbol 
ID5318151 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp55217 
End bp56431 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content61% 
IMG OID640775434 
ProductNADH dehydrogenase I, D subunit 
Protein accessionYP_001312367 
Protein GI150375771 
COG category[C] Energy production and conversion 
COG ID[COG0649] NADH:ubiquinone oxidoreductase 49 kD subunit 7 
TIGRFAM ID[TIGR01962] NADH dehydrogenase I, D subunit 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.761398 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGAAG TCACCGAGCT CATGAGGCCG GAAGGCGAAG CGCTCAATAC CAAGGAGGTG 
CTTCTCAATC TCGGTCCGCA ACACCCGAGC ACCCATGGGG TTCTGCGGCT CGTGCTGCAA
CTCGACGGCG AATATGTCGA GCGCATCGAC CCCCATATCG GCTATCTGCA CCGTGGCACC
GAAAAGCTGG CGGAGAGCTT CACCTATACG CAAATCTTCC CGCTGACTGA CCGGCTCGAC
TATCTCTGTC CACCTTCGAA CAACCTGGCC TTCGCGCTCG CCGTGGAAAA GCTGCTCGGC
ATAGAGGCCC CGATCCGGGC GCAATACATC CGCGTGATGA TGGCCGAACT CGCAAGGATT
TCCGGCCATC TCCTGATCAC CGGCGCACTG CCGATGGACC TGGGCGCCAT GACCGCGCTG
CTTTACGCCA TGCGAGAGCG CGAAATGATC ATGGACCTCC TGGAAATGAT CACCGGTGCG
CGCATGCACA CGTCCTACTG CCGCGTCGGC GGGGTGCGCG AGGACCTGCC CGACGGGTTC
CTTCCGAAGA TCCGGGAGTT CTGCGAGATA TTCCCGAACA GGATCCGCGA CTATGAGCGC
CTGATAGAGA ACAACCGGGT GTTTCTCAGC CGTACTCAGG GGATCGGCGT GATCTCCGCG
GCGGACGCGG TCGACCTCGG CTTGAGCGGA CCGAACCTGC GTGCCTCCGG CGTCGACTGG
GACATCCGGC GCGACGAACC CTATGAAATC TACGACCGGC TCGATTTTGA CGTCATCACG
CGCGAGGAGG GCGATTGCTA TGCGCGCTGG CTTTGCCGGG TCGACGAGAT GCGAGAGAGC
ATCCGCCTCA TCGAACAATG CATGGAGCAG ATGCCGGAGG GGCCGTTTCA GGTCGATATT
CCGACGATCG CCTTCCCCGT CGATAAAGAG CGCGTGCATT GCTCGATGGA AGCACTGATC
CAGCATTTCG ATCTCTCCGC CTACGGCTTC GACGTACCCG CGGGGGAAGT CTATTCGGTA
ATCGAGGCGC CCAAGGGGGA ACTCGGCTTC TACATCATCA GCGACGGATC GCCAAAGCCG
TTCCGCATGA AGGTGAGGGC CCCGTCCTTC GTCAATCTCC AGGCGCTCTT CGGGGTCACC
AATGCACGTT ACCTCGCCGA TATGATCGCC GTGCTCGGCA GTCTCGACCC GGTGATGGCG
GAGGTGGACA AGTAG
 
Protein sequence
MTEVTELMRP EGEALNTKEV LLNLGPQHPS THGVLRLVLQ LDGEYVERID PHIGYLHRGT 
EKLAESFTYT QIFPLTDRLD YLCPPSNNLA FALAVEKLLG IEAPIRAQYI RVMMAELARI
SGHLLITGAL PMDLGAMTAL LYAMREREMI MDLLEMITGA RMHTSYCRVG GVREDLPDGF
LPKIREFCEI FPNRIRDYER LIENNRVFLS RTQGIGVISA ADAVDLGLSG PNLRASGVDW
DIRRDEPYEI YDRLDFDVIT REEGDCYARW LCRVDEMRES IRLIEQCMEQ MPEGPFQVDI
PTIAFPVDKE RVHCSMEALI QHFDLSAYGF DVPAGEVYSV IEAPKGELGF YIISDGSPKP
FRMKVRAPSF VNLQALFGVT NARYLADMIA VLGSLDPVMA EVDK