Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_4993 |
Symbol | |
ID | 5318714 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009620 |
Strand | + |
Start bp | 1507504 |
End bp | 1508805 |
Gene Length | 1302 bp |
Protein Length | 433 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 640776775 |
Product | capsule polysaccharide biosynthesis protein |
Protein accession | YP_001313707 |
Protein GI | 150377111 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3563] Capsule polysaccharide export protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.0145163 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACCTC ACGCCGGCGA GGGCTCACGA GCGACGGATT TCGGTAGCGG CGCTGCGCCT CAGGGGCGCC GAACGACCTT TGCCGTCCAG ATCACCGAGT GGAAGCGGGA GCCTTTCGAA CGATATTTCC CCGACAGGCA CTTCCACTTC CTTCCGATGA ATCTCGGCGA ACACGAGTTC GAGCGCGTCT GGAAGCCGCG AATTCTCGCC GAGAGCAGCG CCGAGATACT CGCCTGGGGG CCGGAGCTAC CAGGCCCGCT GGATGCGCTG GCCAAAGCGC GGAACATTCC TGTTACTTTC ATTGAGGACG GCTTCCTTCG TTCGGCCAGG CCGAGCGCCA GCCGCACGCC TCCCCTCTCT CTGGCCCTCG ACAGCAAGGC GATCTATTTC GACTGCCGGC ATCCCTCGCA GCTCGAGGAG CTGTTGGGAA CCTACGATTT CGAGGCGGAT GCGGAATTGA TGACGCGCGC GCGCGCCGGC ATCGCGCTTC TCACCGAAAG CGGCATCAGC AAATATAATG GCGGCCGGCA GCGGACGGCC GAAGAGGTTT ACGGCGAGAA GACGCGCAAG CGTGTTCTGG TTGTCGGCCA GGTGGAGGAC GATGCTTCCG TCCGCTACGG CTGCCTCAGC CGGATGACCA ACAACGACCT GGTGCGGCTC GCGGCCTCAG AGCAGCCGGA CGCCCAGATC CTCTACAAGC CGCATCCGGA CGTGCTGAGC CGCGTGCGGC CGGCAAGGTC CGATCCCGCG GAAGTCGCGC ACCTTTGCAC CCTGGTCACC GAGAGCCTGC CGCTTGCGGA GGCGCTGCGC ACCGTCGATC ACGTCTATAC GATCACCTCG CTCGCGGGTT TCGAGGCGCT TATCCGCGGC ATCGAGGTCA GCACCGCCGG CTGTCCCTTC TATTCCGGAT GGGGGCTGAC GGACGACCGC CAGCCTAACC CGCGCCGCGG GCGGCGCTTG AGCATCGAGG CGCTTTTCGC GGGCGCCTAT CTGCTTTATC CTTGCTATTT CGACCCCGAG ACCGGAGAAC GATCATCATT CGAAGCAACC GTCGCGACAC TCAGAAGTCA GCTGGAAGAG CCGCAGGGCC TTGCCCGGCC GCGGCCGGCC TGGCGCGCCT GGGGGCCCTA TGGCCTCCTC GGCTGGCGCC ACCTGCTGCC GCCCTTCGTC ACGCCCGTGA TCCGCAAGAT AGGCAGTGAC CGCGATGTCG AAGACTTCAG GGCCGATCCG ATCCGCTTCT TCCGCACCCT CTCCGACCGG AAGTTCCGCG TCATCGGCCG TATTCTCTAT CCGTTCGGGT GA
|
Protein sequence | MKPHAGEGSR ATDFGSGAAP QGRRTTFAVQ ITEWKREPFE RYFPDRHFHF LPMNLGEHEF ERVWKPRILA ESSAEILAWG PELPGPLDAL AKARNIPVTF IEDGFLRSAR PSASRTPPLS LALDSKAIYF DCRHPSQLEE LLGTYDFEAD AELMTRARAG IALLTESGIS KYNGGRQRTA EEVYGEKTRK RVLVVGQVED DASVRYGCLS RMTNNDLVRL AASEQPDAQI LYKPHPDVLS RVRPARSDPA EVAHLCTLVT ESLPLAEALR TVDHVYTITS LAGFEALIRG IEVSTAGCPF YSGWGLTDDR QPNPRRGRRL SIEALFAGAY LLYPCYFDPE TGERSSFEAT VATLRSQLEE PQGLARPRPA WRAWGPYGLL GWRHLLPPFV TPVIRKIGSD RDVEDFRADP IRFFRTLSDR KFRVIGRILY PFG
|
| |