Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_4945 |
Symbol | |
ID | 5318272 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009620 |
Strand | - |
Start bp | 1457081 |
End bp | 1458349 |
Gene Length | 1269 bp |
Protein Length | 422 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640776728 |
Product | polysaccharide export protein |
Protein accession | YP_001313660 |
Protein GI | 150377064 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1596] Periplasmic protein involved in polysaccharide export |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.330229 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAATCGA ACAGACTACC AGGGGCATCC GCTGGTTCCC GCATGGCTTC CTGTTTAACC CGCCTGGTGC TCATGGCGGC TCTCGCCGCT TCCGCGACAT CGCTGGCGCG CGCCGACGAA TATCGGCTCG GCGTTATGGA CAAACTCCGG GTTCGCGTCG CCGAGTGGCA GACCGCCGAA GGTGCGGTCC GCGATTGGTC CGCCGTCAGC GGCGATTATA CGGTTGGTGC ATCGGGAAGC ATTTCCCTGC CTTTCGTCGG CGAACTGCCT GCCTCAGGCA AGACGACAGC CGAAGTGGCC GAAGAGATCG GCGTAAAGAT GCAGAAGCTC TTCGGTCTCA GGGATCGGCC CTCCGCATCG GTGGAGATGG CGCAATATCG ACCGGTCTAT CTCTCCGGAG AGGTTCAGAC GCCTGGCGAA TATCCTTACG CTCCCAACAT GACGGTCCTG AAGGCCGTCA GTCTCGGGGG AGGGCTGCGG CGGGCCGACA ACGGCCAGCG GTTTGCACGC GACTACATCA ATGCGAGCGG GGAATCGGCT GTACAGGTTG CGGAACGCAG CCGTCTCCTC ATTCGCCGAG CGCGCCTGCA GGCCGAAATC GGCAAGCGTG ACTCGATCGG GATGCCCGAG GAGCTCAAGA ATGTACCGGG TGCCGACGAA CTGCTCGCCA GCGAAACCGC CCTTATGGAA TCGCGCGACA AGCGCCAGAA GCGTCAGCTC GATGCTCTCG CCGATCTGAG GTCCCTTCTT CAAAGCGAGA TCGAAGCGCT CGCAAAGAAG GCTGAAACGC AGGCGCGCCA ATTGGAGCTT GCCACCGAGG ATCGCGACAA GGTCGACAGC CTTGCCGAGA AAGGGCTTGC GCTGAGTCAG CGCAAGCTTT CCCTCGAGCA ACGGGTGGCG GACGTGCAGG CCTCCCTGCT GGATATCGAC ACCGCATCGC TGAAAGCGAA ACAGGATGCG AGCAAGGCGG CGCAGGACGA AACGAATCTG CGCAACGACT GGGATGCGCA ACTTGCGCAG GAACTGCAGA ACACCGAAGC CGAACTCGAT ACGCTGGCCT TGAAGCTCGG CACCAGCCGG GATCTCATGA CAGAGGCGCT GCTGCAGTCG GCGGACGCGG CCCAGCTCGA ACAGCAGGCA GCCCAGATCA CCTATTCCAT CATTCGCGAC AAGGATGGGA AGCCGACCGA GATCGCGGCC GATGAGAACA CCCCAGTGCT GCCTGGGGAC GTCATCAAAG TGAATACGGC ATTGGCGGCG ATGCGGTGA
|
Protein sequence | MQSNRLPGAS AGSRMASCLT RLVLMAALAA SATSLARADE YRLGVMDKLR VRVAEWQTAE GAVRDWSAVS GDYTVGASGS ISLPFVGELP ASGKTTAEVA EEIGVKMQKL FGLRDRPSAS VEMAQYRPVY LSGEVQTPGE YPYAPNMTVL KAVSLGGGLR RADNGQRFAR DYINASGESA VQVAERSRLL IRRARLQAEI GKRDSIGMPE ELKNVPGADE LLASETALME SRDKRQKRQL DALADLRSLL QSEIEALAKK AETQARQLEL ATEDRDKVDS LAEKGLALSQ RKLSLEQRVA DVQASLLDID TASLKAKQDA SKAAQDETNL RNDWDAQLAQ ELQNTEAELD TLALKLGTSR DLMTEALLQS ADAAQLEQQA AQITYSIIRD KDGKPTEIAA DENTPVLPGD VIKVNTALAA MR
|
| |