Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_1817 |
Symbol | |
ID | 5322675 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009636 |
Strand | - |
Start bp | 1898095 |
End bp | 1899693 |
Gene Length | 1599 bp |
Protein Length | 532 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640790755 |
Product | hypothetical protein |
Protein accession | YP_001327487 |
Protein GI | 150397020 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0270492 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.00149176 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGGCCTTT ACGGATCCGC TCCGGAAGCA CCGGACCCGA AGCAAACGGC GTCCGCGCAG ACTGCGACGA ACATCGGAAC CGCGGTTGCC AACAACGTCA TGGGCAACGC CAACCAGGTC ACGCCAGACG GCAACCTGAC CTATACCTAT AACACTCAGA AGTGGACCGA TCCTCTCAGC GGGAAAGAAT ACGACCTCAA GGTTCCGACG GCGACACAGA CACTTTCGCC CGCGCAGCAG GCGATCAAGG ACCAGGAGGA CGCCGCCCAG CTGAACCTGG CGACGCTTGC CAACACCCAA TCGGGAAAGC TCAACGGCCT TCTCGCCAGT AAGTTCGACA TATCCGGCGC TCCAGCGGCC GGAAAGTCGG ACGCGATCGG GCTGCCGCAG TATCAGAGCT TCACGAGCGG TCCGAAGCTG CAGACCAGCC TCGCAAATGC CGGCAACGTT CAAAGCTCGA TTGCAGGTGC CGGTTCCATA CAGAGCCAGG TTGCGGACAG CGGCAAGATC CAGACTTCGC TTGGCAATGC CGGGAACATC ACCGAGAGCT ATGATTTCGA CATCGACACG TCGAAATACG AACAGGCGCT GATGGACCGC CTCAGCCCGC AGATCGAGCG GGACCGCGCC GCCCTTGAAA CGAAGCTGAC CAACCAGGGA CTGCAGCCGG GCTCCGAGGC CTATGACCGT GCGATGGACG AGGCGAACCG CGCGGCGAAC GACGCCCGGA TAGGGGCAAC CCTGAGTGCC GGGCAGGAGC AATCGCGGAT CGCCGGTCTG GCGCAGAACC AGGCGCAGTT CCAGAATTCG GCACAGCAGC AAGCCTACGA CCAGATGACC GGACTGGCCC AGTTCTACAA TTCGGCGCAG GCACAGCAAT ACGCGCAGAA CGCGAACGAC ATGCAGATGG GAAACGCCGC TCAGCAGCAG CAGTTTTCGC AGAACCAGGC GCAGATGCAG GCCAACAATG CAGGTCAGGA GCAGAAATTC AACCAGGGAC TGACCGCGGC GCAGTTCGGA AACGACGCCC TGCAGCAGCA GTACCAGAAC CAGAACACGG CGACGGGCGG CAACAATGCT CTGGCGGATC AGAGATTCAA TTCTCAGCAG GCGAAGTACA ACCTGCAAAA CCAGGAGCGG GCACAATATC TGAACGAGCT TTACGCGCAG CGCAACCAGC CGATCAACGA GATCGTCGGG CTGATGTCCG GGGCGCAGGT CGACAGCCCG AGCTTCGTGC CGACCCAGAG CAACCCCATG CCGACCGTCG ATTATGCCGG GCTGGTGCAA CAGGACTATG CCAACAAGAT GGGCGCCTAC CAGCAGAAGC AAAGCACGAT GCAGAACCTC TTTGGCGGCA TGCTCGGTTT CGGCGGGCAA CTGGCCAGCC TCTCGGACAA GCGCGCGAAG AAGGACATCA AGAAAGTCGG CGGCCTCTAC GAGTACAGGT ACAAAGGTGA AGGCAGGAAC GCTCCCAAGC GGATAGGCGT GATGGCGCAG GAGGTGGAAA AAGTGCGCCC CGACGCTGTC GCCAAGGGCG CCGATGGCCT GCGGCGCGTG GATTACGGAC TGCTCTTCAA CGCAGGGAGA GGCAAATGA
|
Protein sequence | MGLYGSAPEA PDPKQTASAQ TATNIGTAVA NNVMGNANQV TPDGNLTYTY NTQKWTDPLS GKEYDLKVPT ATQTLSPAQQ AIKDQEDAAQ LNLATLANTQ SGKLNGLLAS KFDISGAPAA GKSDAIGLPQ YQSFTSGPKL QTSLANAGNV QSSIAGAGSI QSQVADSGKI QTSLGNAGNI TESYDFDIDT SKYEQALMDR LSPQIERDRA ALETKLTNQG LQPGSEAYDR AMDEANRAAN DARIGATLSA GQEQSRIAGL AQNQAQFQNS AQQQAYDQMT GLAQFYNSAQ AQQYAQNAND MQMGNAAQQQ QFSQNQAQMQ ANNAGQEQKF NQGLTAAQFG NDALQQQYQN QNTATGGNNA LADQRFNSQQ AKYNLQNQER AQYLNELYAQ RNQPINEIVG LMSGAQVDSP SFVPTQSNPM PTVDYAGLVQ QDYANKMGAY QQKQSTMQNL FGGMLGFGGQ LASLSDKRAK KDIKKVGGLY EYRYKGEGRN APKRIGVMAQ EVEKVRPDAV AKGADGLRRV DYGLLFNAGR GK
|
| |