Gene Smed_5986 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5986 
Symbol 
ID5320288 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp942790 
End bp944661 
Gene Length1872 bp 
Protein Length623 aa 
Translation table11 
GC content60% 
IMG OID640777662 
Productputative dehydrogenase large subunit protein 
Protein accessionYP_001314594 
Protein GI150377999 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2303] Choline dehydrogenase and related flavoproteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.420299 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCTTTTTC CCGTCGAATC GGCGCTGAAA GCCGATGCCA ACACCGTCGC AGCGACCGCC 
GACTACGATA TCGTGATCGT CGGAACAGGC ATTTCCGGAG CAATCATCGC CAAGCAGGCT
GCGGAAGCGG GCAAGCGTGT CCTCATCCTA GAAGCCGGAA CCGGTGCCAA TAGAACTCTG
GCCGGCTATG ACGATCTGCT GACCACCTTC TATTTGGCAG CCGGCAAGGA TAACCAGTCG
CCCTTCCCGC TGAATGCCAA CGCGGCCATA CCCCGCAGCC CGCAGCTTCG AAAGCTGCAG
GCGGGGGAAA CCGATAGCTC GACGTACATC GTTCAATCCG GCCCTTATGT CAGTGATACG
ACATATACCC GAATTTTCGG CGGAACGACG ATGCACTGGG AGGCGAAAAC CCCGCGCATG
CTTCGCTCGG ATTTCCAGGC ACGCACCATT TTCGGCCAGG GGCTGGACTG GCCACTGAGC
TTTGAGGAAA TCGAGGATGA CTACCGTCTG GCCGAGCGGG AAATAGGCGT ATCGGCGAAC
GTTGAAGACC AGCAATATCT GGGGCAGACC TTCCCGGACG GCTACGTCTT CCCGATGCGC
GGCCTGCCGC TGTCCTACCT GGACCAGCAG GTCAACAAGG GTATCGAAGG CACCAGTGTC
GAGCTTTACG GCGAGACCTA TCCCCTGAAG GTCAGGCCCT ATCCCCAGGG GCGCAACAGC
ATACCAAACC CGGCCTATGA TGGTGGGAAG GGTTATCGTC CAATTGGCGC CGTTAATACG
CATCAGGTCG AAGAGGGTGG TCGCTGCCAG GGTAACACCA ACTGCGTCCC GCTCTGTACT
GTGCAAGCGC GCTACCACTC CGGCAAAACG CTTGCCAAGG CGTTCGCGGT AAACGGGGAA
AGGCGCACGC CGCTTGTTGA ATTCTTGCCG CAGGCGGTCG CATCGAAGGT CAACATTGAT
CCGGACAGCG GGAAAGTGCG GTCTCTGGAG GTGAAGGTTT ACAAAGACCC GGCCTCACCC
GCCTACGAGA CCTTCACCGT GAAGGGCAAG GTTTTCGTGC TTGCGGCAGG CGCCATTGAA
ACGGCGCGTC TCATGCTGGC CTCCGGCCTG CGCAGCACCA GCGGCCTTGT CGGACGCAAT
CTGATGGACC ACGCCTATCT GCTGAATTGG GCGCTGATGC CGCAAATCTG CGGTACGATG
CGCGGAACGA GTTCGACGGG CGGTATCGTG GACCTACGGG ACGGTCCTTT CCGTGAGAGG
CAGGCCGCCT TCGCCATTGA TATCCATAAC GACGGCTGGG GCTGGGCCAC GGGCGCGCCG
ACCTCGGACC TTCTCGAACT GGTGGATGAT CGCAACCTGC ACGGGGGGGA TCTTCGGCGC
GGCGTGATCG ACCGGGTTTC GCGGCAGTTG CTGCTGGCAT TCATGATCGA GGTCATGCCG
GTCGAAAGCA ATCGCATCGA GGTGGACCCG AAGTATAGGG ACGCGTTGGA CAATATGCGG
CCCATCCTGT CCTTCACGGT TCCGGAATAT ACCATGAAGG GTGCCGCGTA TGCCCGCCAG
TTTTCGCGCA CCGTGTTTGC GCGTATGGGC GCGCAGGACC ACACCCATTA CGACCCAAGC
GATTTCGGCT ATGTCGCCTA TGACAAGCAA GGCTATGCAA TCCGAGGCGG CAATCATCTG
GCCGGCACCC ATATCATGGG AACGACGAAG ACCAACTCCG TTGTGGACAA GAACCAGCGC
AGCTGGGACC ACGAAAACCT TTATCTCGTG GGCGGCGGCA GCATGCCGAC GATCGGCACG
GCCAATGTCA CGTTGACGCT GGCCGCCATG TGCTTCCGAA GCAGCCGCGA CATTCTAAAG
TCACTGCATT GA
 
Protein sequence
MLFPVESALK ADANTVAATA DYDIVIVGTG ISGAIIAKQA AEAGKRVLIL EAGTGANRTL 
AGYDDLLTTF YLAAGKDNQS PFPLNANAAI PRSPQLRKLQ AGETDSSTYI VQSGPYVSDT
TYTRIFGGTT MHWEAKTPRM LRSDFQARTI FGQGLDWPLS FEEIEDDYRL AEREIGVSAN
VEDQQYLGQT FPDGYVFPMR GLPLSYLDQQ VNKGIEGTSV ELYGETYPLK VRPYPQGRNS
IPNPAYDGGK GYRPIGAVNT HQVEEGGRCQ GNTNCVPLCT VQARYHSGKT LAKAFAVNGE
RRTPLVEFLP QAVASKVNID PDSGKVRSLE VKVYKDPASP AYETFTVKGK VFVLAAGAIE
TARLMLASGL RSTSGLVGRN LMDHAYLLNW ALMPQICGTM RGTSSTGGIV DLRDGPFRER
QAAFAIDIHN DGWGWATGAP TSDLLELVDD RNLHGGDLRR GVIDRVSRQL LLAFMIEVMP
VESNRIEVDP KYRDALDNMR PILSFTVPEY TMKGAAYARQ FSRTVFARMG AQDHTHYDPS
DFGYVAYDKQ GYAIRGGNHL AGTHIMGTTK TNSVVDKNQR SWDHENLYLV GGGSMPTIGT
ANVTLTLAAM CFRSSRDILK SLH