Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_4997 |
Symbol | |
ID | 5318718 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009620 |
Strand | - |
Start bp | 1511623 |
End bp | 1512624 |
Gene Length | 1002 bp |
Protein Length | 333 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 640776779 |
Product | KpsF/GutQ family protein |
Protein accession | YP_001313711 |
Protein GI | 150377115 |
COG category | [M] Cell wall/membrane/envelope biogenesis [T] Signal transduction mechanisms |
COG ID | [COG0794] Predicted sugar phosphate isomerase involved in capsule formation [COG2905] Predicted signal-transduction protein containing cAMP-binding and CBS domains |
TIGRFAM ID | [TIGR00393] KpsF/GutQ family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.0692534 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGGCGA AAGCGGATGG AAATGCCGGT GTCACCGTGC TGGAGTCGAT CGGCAGGACG TTGGCGACGG CGACGAACGG CATCAGGGCG CTTGCCGACC ATCTGTCGAG CGACGAAACC TTCGCGGACG CCCTTGTCAA TGCGGTCGAA CTGATGGGTG ATGGAGATGG CCGCGTCGTC GTTTCGGGTG TCGGCAAGAG CGGTCACATC GGCCGCAAGA TCGCAGCCAC GCTCGCATCC ACCGGCACCT CGGCCTATTT CGTCCATCCG ACCGAGGCGA GCCATGGCGA TCTCGGCATG ATCACCGCGC AGGATGCATT GGTCCTGCTT TCCTGGTCGG GCGAGACGGC GGAACTCGCC AACATGCTGA CCTATGCCAA GCGTTTCAAG GTGCCGATCA TTTCGATCTG TTCCAACCGC GAGAGCACGC TTGCGCGCAA CTCCGAAGTC GCGCTCGTGC TGCCGAAGGT GCCGGAGGCT TGTCCGCACG GTCTGGCGCC GACGACCTCG GCAATGCTTC AGCTCGCCAT CGGCGATGCA CTGGCAATCG CGCTGCTGGA GCGGCGCGGC TTCTCTGCCG AGGACTTCAA GACCTTCCAT CCGGGCGGCA AGCTGGGCGC GCAGCTGCGC CTCGTCCATG AGCTGGCGCA TGGCGCCGGG CAGATGCCGT TGCTCCCTGT CGGTCGCCCG ATGAGCGAGG CGGTCATCGA GATGTCGGCC AAGGGCTTCG GCGTCGTCGG CATCGTCGAT GAAAGCGGAA AGCTGGTCGG CGTCATCACC GACGGCGATA TGCGCCGCCA CATGACGGCG GACCTCCTGG CGCAACCGGT CGAGGCCATA ATGTCGCACA ACCCGCGTGT CCTCAGCCGC GACGTGCTGG CCAGTGCGGC CATGGAGTTT ATGGAAGAAC ACAAGATCAC CGTGCTCTTC CTCGTCGGCG ATGCGGGCGC ACCGGTCGGC ATCCTGCATA TTCACGATCT GCTGCGCGCC GGAGTCGCCT GA
|
Protein sequence | MQAKADGNAG VTVLESIGRT LATATNGIRA LADHLSSDET FADALVNAVE LMGDGDGRVV VSGVGKSGHI GRKIAATLAS TGTSAYFVHP TEASHGDLGM ITAQDALVLL SWSGETAELA NMLTYAKRFK VPIISICSNR ESTLARNSEV ALVLPKVPEA CPHGLAPTTS AMLQLAIGDA LAIALLERRG FSAEDFKTFH PGGKLGAQLR LVHELAHGAG QMPLLPVGRP MSEAVIEMSA KGFGVVGIVD ESGKLVGVIT DGDMRRHMTA DLLAQPVEAI MSHNPRVLSR DVLASAAMEF MEEHKITVLF LVGDAGAPVG ILHIHDLLRA GVA
|
| |