Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_2666 |
Symbol | |
ID | 5323535 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009636 |
Strand | - |
Start bp | 2771514 |
End bp | 2772659 |
Gene Length | 1146 bp |
Protein Length | 381 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 640791610 |
Product | Sel1 domain-containing protein |
Protein accession | YP_001328331 |
Protein GI | 150397864 |
COG category | [R] General function prediction only |
COG ID | [COG0790] FOG: TPR repeat, SEL1 subfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0163222 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.379262 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTAATCC GCTCCCTCCT GAATTCCAGC CGGCCCGTAG CCTTTGCCGC AGCGATGCTT TCAGCAGTCG CGCCGGCACT GGCACAGCAA CCGATACCGC CAGCGCAGAC CGACGAGGGC AGTGCGCAGA AGCGCGGGCG GATCACGCCT TTCAACGGCG CGGCACTACC GGAAGATGGC GAGCGAAAGC CGGCAGCGGC CGGGCCGGAA AAGCCGAAGC CCAGCGATGG CAGTGCGCCT TCGAAGGGCG TCAACGTGAT CGATCGCATG GGAGCAGAAT TGCCTGATCT TCCTGAGGAG AAACCCTTCA CAGGCAAGAT CGACGAAGCT TATGGTGCCT TCCAGCGGGG CTATTACCTC ACGGCGATGG ACCTTGCCTT GCCGCGGGCC CAGCTCGGCG ATCCAGCCGC CCAGACGCTC GTCGCGGGAA TCCTCGAGCA GGGACTCGGT GTTGCGCGCG ACGCCAAGGC CGCCGCCTTC TGGTACGGCC AGGCGGCCAC CAATGGCGAT CCGGCGGCCA TGTTCAAATA TGCGCTGATC CTGATGGAAG GCCGCCACGT CAAGCGCGAC CGGAAGAAAG CAGACGAGCT CATGAAAAAG GCTGCCGATC TCGGCAATGC CTCGGCTCAG TTCAACTACG GACAAACGCT GGTGGCCGAC ATGCCCGGCG AGCGCGGCCT GAAGGCCGCC ATGCCCTATT ACGAGAAATC GGCCGAACAA GGCATTGCGG ACGCGCAATA TGCGCTGTCC CAGATCTACG TCAATGTCGA CGGCGTCGAG GACGACAAAC GCGCCCGCGC CCGCGAGTGG CTGCTCAGGG CGGCGCGCGC GGGCTATGAT ACGGCGCAGC TCGACATTGC GATCTGGCTG ATAGAAGGGA TCGCCGGCGA CCGCAACCTC GAAGAGGGCT TTGCCTGGAT GAAGCGCGCC GCCGAAAGCG GCAACGTCGT CGCCCAGAAC CGACTCTCCC ACCTCTATGT GAATGCCATA GGTACGCGTC CGGACCCCGT CGAAGCGGCA AAATGGTACG TCCTGTCGCG CCGGGCCGGC CTCAAAGATG ACGCGCTCGA GGATTTCTAT CTCGGCCTCA ACGAAACGCA GCAGAAGTCG GCGCTCGCGG CGGCCAACAA ATACCGCTCG TCCTGA
|
Protein sequence | MVIRSLLNSS RPVAFAAAML SAVAPALAQQ PIPPAQTDEG SAQKRGRITP FNGAALPEDG ERKPAAAGPE KPKPSDGSAP SKGVNVIDRM GAELPDLPEE KPFTGKIDEA YGAFQRGYYL TAMDLALPRA QLGDPAAQTL VAGILEQGLG VARDAKAAAF WYGQAATNGD PAAMFKYALI LMEGRHVKRD RKKADELMKK AADLGNASAQ FNYGQTLVAD MPGERGLKAA MPYYEKSAEQ GIADAQYALS QIYVNVDGVE DDKRARAREW LLRAARAGYD TAQLDIAIWL IEGIAGDRNL EEGFAWMKRA AESGNVVAQN RLSHLYVNAI GTRPDPVEAA KWYVLSRRAG LKDDALEDFY LGLNETQQKS ALAAANKYRS S
|
| |