Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_2628 |
Symbol | |
ID | 5323497 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009636 |
Strand | - |
Start bp | 2728907 |
End bp | 2729977 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640791572 |
Product | putative signal peptide protein |
Protein accession | YP_001328293 |
Protein GI | 150397826 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGGGCA GTATCGCTAC ATCTGCTGTC CTCCACGCCC TGGTTCTGAC CTGGGCGCTG GTGTCGCTTG GCAGCCCGGC CGATTTCGAG GTCGCGGATG TCGAGGCCTT GCCGGTCGAC ATCGTACCTG TCGAGTCGAT TACCCAGATA CAGCAGGGCG ACAAGAAGGC TCCGGCCAGG GAAAAGGCTT CGCCGGTTCC GACCAAGAAG CCGACGCCGG TCGAAAACGC CGAGAATGTC GGCGAGAACG ACGTCGACCT GAAAACACCG CCCACGCCCA ATGCCAAGCC GGTCGACAAC GAGTCGGCCG CAGCACCGCA GAAAACCGAA AAGGCTCCGC CGACGCCCGA TCCCGTGAAG GAAGAGATCG AGAAGGTCGA AGAGACGAAG CCTGCGGCAG AGCCCGCGAC GGAAGTGGCG GCACTGCCCG AACCCAAGCA GGAAGTGAAG CCGGACACGA AGCCGGAGCC CGCGCCGGCG GAAGAGCAGC CTACGGAAAA TCCCGAGGCT GAGGCTCTGC CGGACAAGGT GCCGACGCCG CAGGTAAAGC CGAAGGTCGA AAAGCCGGCG CAGACGGCCA AGACCCCCGA GCGCAAGAAG GATGAAGTCC AGAAGGAGCA GAAGAAGGCG TCCTCGCAGA AGGAGAGCGA CTTCAACGCC GACGAAATCG CGGCGCTGCT CAACAAGCAG GAGTCTTCGG GCGGTGGCGC CAAGCGCTCG ACCGAGGAAG CGGCTCTCGG TGGCAAGAAG ACGACGACGG GAAACACCCT TTCGCAAAGC GAAATGGATG CACTTCGCGG CCAGATCCAA AACAACTGGT CGATCATCCC CGGCATGGCC GACGCGGCGG ATGTCCGCAT CAAGGTGAGG ATGCGGCTCG ACCCGAACGG GGAGCTGATC GGCGATCCGG AGGTGGAGGC CAGCGGCGGT TCGGATTCGG CGCGGCGGGC GCTCATGGGT GGTGCGCGTC GCGCCATCCT GAAATCGGCC CCCTTCAAGG GACTGCCGGC TGAAAAATAT GATTCGTGGA GCGAGGTCGT CGTCAACTTT GACCCGAGCT CGATGCTCTA G
|
Protein sequence | MKGSIATSAV LHALVLTWAL VSLGSPADFE VADVEALPVD IVPVESITQI QQGDKKAPAR EKASPVPTKK PTPVENAENV GENDVDLKTP PTPNAKPVDN ESAAAPQKTE KAPPTPDPVK EEIEKVEETK PAAEPATEVA ALPEPKQEVK PDTKPEPAPA EEQPTENPEA EALPDKVPTP QVKPKVEKPA QTAKTPERKK DEVQKEQKKA SSQKESDFNA DEIAALLNKQ ESSGGGAKRS TEEAALGGKK TTTGNTLSQS EMDALRGQIQ NNWSIIPGMA DAADVRIKVR MRLDPNGELI GDPEVEASGG SDSARRALMG GARRAILKSA PFKGLPAEKY DSWSEVVVNF DPSSML
|
| |