Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_3666 |
Symbol | |
ID | 5318063 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009620 |
Strand | + |
Start bp | 105757 |
End bp | 106824 |
Gene Length | 1068 bp |
Protein Length | 355 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 640775479 |
Product | hypothetical protein |
Protein accession | YP_001312412 |
Protein GI | 150375816 |
COG category | [G] Carbohydrate transport and metabolism [K] Transcription |
COG ID | [COG1940] Transcriptional regulator/sugar kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.00707046 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGGGCAAAT CCGTGAACGA AAAGCAAAGC GCTCCACTCA TTCACGGAGC CAGCGATTTG CCGTCAGTGA CGGTCGACGG CTACAATCTG GAACTGCGTG ACGGCGACGG CTTTCTCGGC GACAAGGCCA ACAAATCCGC GTTCCAGGAA AAACTCGACG ACTGGCGCAA ACGGGTGCGC AAAGGCGGCG ACGACCCGCT GGGAGAGGCG CTCACCCAGG ACCTGTCCAA GAAACAGATC GACGCTCTTC TTCGTGGTGA CGACAAGCAG GCGGCCGCCC TCATCATGGG AGCGATGGAC GAATTCGCCG GCGAGCTGGC CGGCGTTCTT GAGAAATTCC TGCAGCAGAA GAGCTGGAAG AACTCGGAAC GTGTCGTGAT CGGAGGCGGC TTCAGAGGAA GTGCCGTAGG TGAGTTCGCG ATTGCACGGG CGATGGTGCT GATGAAGGCA AAGGGTATCA AGATCGAGCT CTCTCCCATC GTTCATCATC CGGATGATGC CGGGCTCATC GGTGCCGCAC ATCTCATGCC AGCCTGGATG CTCAAGGGAC ACAAGACCAT CCTCGCTATC GACATCGGCG GTACCAATAT CCGCGTAGGC ATCGTCGAGC TGCGTCTGAA AGATGATACG GACCTTTCAA GGGCCAAAGT CTGGAAATCG GACATCTGGC GGCATGCGGA CGACAAACCC AACAGAAGCG CCACGATCGA AGCACTCATC GGGATGATCG AAAAGCTCAT AGCCAAGGCG GACAAGGCGG ATCTTGCACC GGCGCCGGTC ATCGGCGTTG CCTGCCCCGG TGTAATCAAT GCGGATGGCT CGATCCTGCG CGGAGGCCAG AACCTGCCCG GCGGGAACTG GGAAAGCGAG CATTTCAACC TGCCTGCCAC GCTCAAGGAC GCCATTCCGC AGATCGGCGA TCATGAGACC TTCGTAATCA TGCACAACGA CGCCGTCGTC CAGGGCCTGT CGCAAGTACC ATTCGTGCAG AATGCTTCGA GCTGGGGTAT CCTGACGATC GGGACCGGTC TCGGCAATGC GCACTTCAGC AACAAAGCCG GAAATTGA
|
Protein sequence | MGKSVNEKQS APLIHGASDL PSVTVDGYNL ELRDGDGFLG DKANKSAFQE KLDDWRKRVR KGGDDPLGEA LTQDLSKKQI DALLRGDDKQ AAALIMGAMD EFAGELAGVL EKFLQQKSWK NSERVVIGGG FRGSAVGEFA IARAMVLMKA KGIKIELSPI VHHPDDAGLI GAAHLMPAWM LKGHKTILAI DIGGTNIRVG IVELRLKDDT DLSRAKVWKS DIWRHADDKP NRSATIEALI GMIEKLIAKA DKADLAPAPV IGVACPGVIN ADGSILRGGQ NLPGGNWESE HFNLPATLKD AIPQIGDHET FVIMHNDAVV QGLSQVPFVQ NASSWGILTI GTGLGNAHFS NKAGN
|
| |