Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_4468 |
Symbol | |
ID | 5318170 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009620 |
Strand | + |
Start bp | 949400 |
End bp | 950611 |
Gene Length | 1212 bp |
Protein Length | 403 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640776269 |
Product | putative transcriptional regulator protein |
Protein accession | YP_001313201 |
Protein GI | 150376605 |
COG category | [G] Carbohydrate transport and metabolism [K] Transcription |
COG ID | [COG1940] Transcriptional regulator/sugar kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0284418 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGCTGA CCAAGTCCAG CACCGAACTC GTGCGTCAGC AGAACAGCGC GCTGGTGCTC GCGGCGCTTC GGCGGAAGGG CTCGCTTTCC CACACGGAGA TTGCGGCACA GACCGGGCTT GCCTCGGCCA CGATCTCCGT TATCACGGCC GAGCTGGAGC GCGCCGGGAT CATCGCCAAA ATCGAGAGGC ATGTGCAGGG CGGTCGCGGC CGGCCGCGGG TTCTCTTCGA GCCGAGACGC GATTGCGGTC ACGTGATCGT CGTGCGGATC TCTTCCGACG TGGTGCAATA CTCGCTTGCT GATTATGGCG GCATTCTGCT CGATCGTTTC GAGGAGGTGC GCGGTGACCT CGGCGGGACA GCTGCCTTCG GGGAGGTCCT TGCAGCGGCA CTTGAACGGT TGCTCCTCCG CTCGAATATA TCGAAGGACG AGGTGCTTGC CATCTCGATC AGCAGCAAGG GGCTCGTGGC CGCCGATGGC GCGCGACTTA TCTGGTCCCC CGTCTTCGGC GGCGAGCAAC TCGATTTCGT GAAGCTGCTG CGTCCTGATT GGCGCGCCAG GATCATGCTC AGCAATGAGT CGCTCCTCGT GGCGCACGCG CTTGCCGTAA AGGAAGAGGA AAAACAGGTC GGTTTCCGCG CGCTCGCCGC CGTGTCGCTC GGACACAGCA TCGGCCTCGG CCTTGCCCGA AGTGGCAGCT CCGGTGAGCT CGATGTCTCG GCTCCGAACT TCGGCCATAT GCTGCATCAG GATGCCGCCG GGCTCTGCCG CTGCGGCAGT TTTGGCTGCG TCGAGGCCGC CGCCGGTTTC TACGGGATTC TGCGCACGGC CTTCGAGGTG CCCTCCAATA CCATCCCAGC CAAGTTCGTG CCGCTTTCGG AGATGGACAA GATCGCCGCG CGTGCACGGC AAGGACATCG GATGGCCGCC TATGCCTTCC GACAGGCCGG CGTTGCGCTC GGCAACGGAA TTTCCCGGAT GCTCAGTCTC TACGAGCCGA TGCCGATCTT CGTGACTGGA CCGGGCACGC GCTATTTCGA TCTCCTGCAG AAGGGACTGG AGGAGGGCAT GGCACATTCG CTGCAGGTGC GCCTTGAGGG CATGCCGCAG ATATCGGTGG TTATTGACGA ACAACGGCTC GTTCTGGATG GTCATCTCGA CCGGGCGCTA GGGGCAATCG ACGGCGACAT TGCCGCCTCG GGCCACCCAT GA
|
Protein sequence | MMLTKSSTEL VRQQNSALVL AALRRKGSLS HTEIAAQTGL ASATISVITA ELERAGIIAK IERHVQGGRG RPRVLFEPRR DCGHVIVVRI SSDVVQYSLA DYGGILLDRF EEVRGDLGGT AAFGEVLAAA LERLLLRSNI SKDEVLAISI SSKGLVAADG ARLIWSPVFG GEQLDFVKLL RPDWRARIML SNESLLVAHA LAVKEEEKQV GFRALAAVSL GHSIGLGLAR SGSSGELDVS APNFGHMLHQ DAAGLCRCGS FGCVEAAAGF YGILRTAFEV PSNTIPAKFV PLSEMDKIAA RARQGHRMAA YAFRQAGVAL GNGISRMLSL YEPMPIFVTG PGTRYFDLLQ KGLEEGMAHS LQVRLEGMPQ ISVVIDEQRL VLDGHLDRAL GAIDGDIAAS GHP
|
| |