Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_5986 |
Symbol | |
ID | 5320288 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009621 |
Strand | + |
Start bp | 942790 |
End bp | 944661 |
Gene Length | 1872 bp |
Protein Length | 623 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 640777662 |
Product | putative dehydrogenase large subunit protein |
Protein accession | YP_001314594 |
Protein GI | 150377999 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2303] Choline dehydrogenase and related flavoproteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.420299 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCTTTTTC CCGTCGAATC GGCGCTGAAA GCCGATGCCA ACACCGTCGC AGCGACCGCC GACTACGATA TCGTGATCGT CGGAACAGGC ATTTCCGGAG CAATCATCGC CAAGCAGGCT GCGGAAGCGG GCAAGCGTGT CCTCATCCTA GAAGCCGGAA CCGGTGCCAA TAGAACTCTG GCCGGCTATG ACGATCTGCT GACCACCTTC TATTTGGCAG CCGGCAAGGA TAACCAGTCG CCCTTCCCGC TGAATGCCAA CGCGGCCATA CCCCGCAGCC CGCAGCTTCG AAAGCTGCAG GCGGGGGAAA CCGATAGCTC GACGTACATC GTTCAATCCG GCCCTTATGT CAGTGATACG ACATATACCC GAATTTTCGG CGGAACGACG ATGCACTGGG AGGCGAAAAC CCCGCGCATG CTTCGCTCGG ATTTCCAGGC ACGCACCATT TTCGGCCAGG GGCTGGACTG GCCACTGAGC TTTGAGGAAA TCGAGGATGA CTACCGTCTG GCCGAGCGGG AAATAGGCGT ATCGGCGAAC GTTGAAGACC AGCAATATCT GGGGCAGACC TTCCCGGACG GCTACGTCTT CCCGATGCGC GGCCTGCCGC TGTCCTACCT GGACCAGCAG GTCAACAAGG GTATCGAAGG CACCAGTGTC GAGCTTTACG GCGAGACCTA TCCCCTGAAG GTCAGGCCCT ATCCCCAGGG GCGCAACAGC ATACCAAACC CGGCCTATGA TGGTGGGAAG GGTTATCGTC CAATTGGCGC CGTTAATACG CATCAGGTCG AAGAGGGTGG TCGCTGCCAG GGTAACACCA ACTGCGTCCC GCTCTGTACT GTGCAAGCGC GCTACCACTC CGGCAAAACG CTTGCCAAGG CGTTCGCGGT AAACGGGGAA AGGCGCACGC CGCTTGTTGA ATTCTTGCCG CAGGCGGTCG CATCGAAGGT CAACATTGAT CCGGACAGCG GGAAAGTGCG GTCTCTGGAG GTGAAGGTTT ACAAAGACCC GGCCTCACCC GCCTACGAGA CCTTCACCGT GAAGGGCAAG GTTTTCGTGC TTGCGGCAGG CGCCATTGAA ACGGCGCGTC TCATGCTGGC CTCCGGCCTG CGCAGCACCA GCGGCCTTGT CGGACGCAAT CTGATGGACC ACGCCTATCT GCTGAATTGG GCGCTGATGC CGCAAATCTG CGGTACGATG CGCGGAACGA GTTCGACGGG CGGTATCGTG GACCTACGGG ACGGTCCTTT CCGTGAGAGG CAGGCCGCCT TCGCCATTGA TATCCATAAC GACGGCTGGG GCTGGGCCAC GGGCGCGCCG ACCTCGGACC TTCTCGAACT GGTGGATGAT CGCAACCTGC ACGGGGGGGA TCTTCGGCGC GGCGTGATCG ACCGGGTTTC GCGGCAGTTG CTGCTGGCAT TCATGATCGA GGTCATGCCG GTCGAAAGCA ATCGCATCGA GGTGGACCCG AAGTATAGGG ACGCGTTGGA CAATATGCGG CCCATCCTGT CCTTCACGGT TCCGGAATAT ACCATGAAGG GTGCCGCGTA TGCCCGCCAG TTTTCGCGCA CCGTGTTTGC GCGTATGGGC GCGCAGGACC ACACCCATTA CGACCCAAGC GATTTCGGCT ATGTCGCCTA TGACAAGCAA GGCTATGCAA TCCGAGGCGG CAATCATCTG GCCGGCACCC ATATCATGGG AACGACGAAG ACCAACTCCG TTGTGGACAA GAACCAGCGC AGCTGGGACC ACGAAAACCT TTATCTCGTG GGCGGCGGCA GCATGCCGAC GATCGGCACG GCCAATGTCA CGTTGACGCT GGCCGCCATG TGCTTCCGAA GCAGCCGCGA CATTCTAAAG TCACTGCATT GA
|
Protein sequence | MLFPVESALK ADANTVAATA DYDIVIVGTG ISGAIIAKQA AEAGKRVLIL EAGTGANRTL AGYDDLLTTF YLAAGKDNQS PFPLNANAAI PRSPQLRKLQ AGETDSSTYI VQSGPYVSDT TYTRIFGGTT MHWEAKTPRM LRSDFQARTI FGQGLDWPLS FEEIEDDYRL AEREIGVSAN VEDQQYLGQT FPDGYVFPMR GLPLSYLDQQ VNKGIEGTSV ELYGETYPLK VRPYPQGRNS IPNPAYDGGK GYRPIGAVNT HQVEEGGRCQ GNTNCVPLCT VQARYHSGKT LAKAFAVNGE RRTPLVEFLP QAVASKVNID PDSGKVRSLE VKVYKDPASP AYETFTVKGK VFVLAAGAIE TARLMLASGL RSTSGLVGRN LMDHAYLLNW ALMPQICGTM RGTSSTGGIV DLRDGPFRER QAAFAIDIHN DGWGWATGAP TSDLLELVDD RNLHGGDLRR GVIDRVSRQL LLAFMIEVMP VESNRIEVDP KYRDALDNMR PILSFTVPEY TMKGAAYARQ FSRTVFARMG AQDHTHYDPS DFGYVAYDKQ GYAIRGGNHL AGTHIMGTTK TNSVVDKNQR SWDHENLYLV GGGSMPTIGT ANVTLTLAAM CFRSSRDILK SLH
|
| |