Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_4575 |
Symbol | |
ID | 5318022 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009620 |
Strand | - |
Start bp | 1067657 |
End bp | 1068898 |
Gene Length | 1242 bp |
Protein Length | 413 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640776376 |
Product | glycosyl transferase group 1 |
Protein accession | YP_001313308 |
Protein GI | 150376712 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.161634 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.819576 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTATCGT CTGCAGCAGT TGATCTTCGC CAATCAGAAG CGCCCGGCGT AAGCGTTGGC GGCCATCAGG CGGTCGTGTG CTTTCCCTTC ATCGGGGACC TCGTTGGCGG AAGCCACATG TCCTCGCTTG GTCTGATACG AAATCTCCCC CGGGATCGCT TTGTGCCATT GGTCGTGCTG CACCATACCG ATGGACCGGT CGCGGAATTG TTCCGCCGGG AAGGCATCGG CTTCGTCGAG GCTCCCGTTT TGAACCGCCT TGAGCGGGCC GCTCCGCGCA ACGGCGCCGC AGTGGTCAAT GTCGTGCGCA CGGTTCCGGG GCTTGTAAAG TTTCTGCGGG CAAGAAATGC CTCGATCGTA CACACGAATG ATGGCCGCAC GCATCTTATC TGGGGTCTTG CTGCGCGGAT TGCCGGGTCG AAACACCTTT GGCACCATCG GGGCGACGCT ACTTCGTTCG GTCTGCGCCA TGTGGCCCCC TGGCTGCCCA ACCGGCTGGT TGCCGTGTCG AAATTCGCAT CGCCCCGGCC GGGATTTTTC TCGGCCGCGG GCAAATGCAG CGTCGTTCAC AGCCCCTTCG ATGTCATGAA AATGGACGGG TTCGATCGAG TGGAAGCTCG CAACAACGTG CTGGCCGCGA TCGGATGTTC ACCCGACACC AAGCTCCTCG GCTATGTCGG AACGCTGGTT GCGCGCAAAC GGCCGATCCT CTTCGTCGAA GCGATAGCCG CCTTGAAGCG GCTATCGCCT GAGACCAAAG TTGCGGGCCT GTTTTTCGGC GATGCGCTCA ATGGCCTGGA CGAAGCGGCG AGATCCCGCG CCGAGGCACT GGGGGTTGCC GATTGCATTC ACTTCATGGG CTTCCGTTAT CCGGGCGAAG CCTGGATCGC CGGACTGGAT GCCCTGCTGG TAACGGCTGT GAATGAACCT CTCGGAAGAA CACTTGTCGA AGCGATGTTG CTCCGCACAC CGGTTATCGC GGCCGACTCC GGTGGGAATC CGGAGGTGGT CGAAGATGGC AGAACCGGAA TGCTCGTTCC CGCAGACGAC CCGGATGAAT TCGCCAAGGC TTGCCTGGCA CTGTTCAACA ATTCCGGACT TTGCGACCAC CTCGTGGAAA CGGCGCGTGG CGAGGTCCGC TCCCGTTTCA GTTTCGAACG CCACGTGCAC GCAATTACAT CGGTCTACGA AAACCTGATC GGTGGAACCG GAATGCGGCG CTCTGCAAGT GCAGGCCCGT GA
|
Protein sequence | MVSSAAVDLR QSEAPGVSVG GHQAVVCFPF IGDLVGGSHM SSLGLIRNLP RDRFVPLVVL HHTDGPVAEL FRREGIGFVE APVLNRLERA APRNGAAVVN VVRTVPGLVK FLRARNASIV HTNDGRTHLI WGLAARIAGS KHLWHHRGDA TSFGLRHVAP WLPNRLVAVS KFASPRPGFF SAAGKCSVVH SPFDVMKMDG FDRVEARNNV LAAIGCSPDT KLLGYVGTLV ARKRPILFVE AIAALKRLSP ETKVAGLFFG DALNGLDEAA RSRAEALGVA DCIHFMGFRY PGEAWIAGLD ALLVTAVNEP LGRTLVEAML LRTPVIAADS GGNPEVVEDG RTGMLVPADD PDEFAKACLA LFNNSGLCDH LVETARGEVR SRFSFERHVH AITSVYENLI GGTGMRRSAS AGP
|
| |