Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_3799 |
Symbol | |
ID | 5318097 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009620 |
Strand | + |
Start bp | 249410 |
End bp | 250405 |
Gene Length | 996 bp |
Protein Length | 331 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 640775612 |
Product | hypothetical protein |
Protein accession | YP_001312545 |
Protein GI | 150375949 |
COG category | [R] General function prediction only |
COG ID | [COG1721] Uncharacterized conserved protein (some members contain a von Willebrand factor type A (vWA) domain) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.314785 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAATTT CTCTGCCCGC ATTCGGAAGA ACGAAGGAAA TCCTACGTCG GGCCGCCGGC GCTTCAGACG GTGCCTACGT TTCCGTCGAC AATCTCGTGG CGCTCGAATC GATGGCCGCC GACCTGACAT TCCTGCGCAA GGCCCCCGTC CGCCGCTTCC TTGCCGGGCG CCATGAATCG CGCATGCGCG GCCGTGGGCT GAGCTTCGAG GAGCTTCGGA CCTACATGCC GGGCGACGAT ATCCGCACGA TCGACTGGCG CGTCACCGCC CGAACGGGTC AACCTTTCGT GCGCGTCTAT AACGAAGAGA AGGACCGACC CGCCCTCGTC ATCGTCGACC AGCGCACCAA TATGTTCTTC GGCAGCCGCC GGTCGATGAA ATCGGTGGCC GCCGCTGAAG CGGCGGCGCT CTGCGCCTGG CGCGTTATGG CGCTTGGCGA TCGCGTCGGC GGCGTGGTCT TCAACGACCT GAAGCAGGAG TCCATCCGGC CGCACCGCAG CCGCGGATCC GTCATCCGCT TCGCCGAAAC GATTTCGCTC CAGAACAAGG CACTTAGCGC GGGCTCGGAT ATCGAGAGGG CGCCCGGTCA ACTCAATGCC GTCCTGGGCA ACGTCGCAGC CGTCGCGCAG CACGACCATC TGATTATCGT CATCAGCGAT TTCGATGGGC ACGGGCCGGA GACACGCGAT CTTCTCTTGC GGATGTCCGT CTCAAACGAC GTCATTGCCA TCCTCATCTA CGATCCATTC CTCCTGGACC TGCCGCGCCA GGGCGACATG GTGGTGAGCG GCGGCGCTCT GCAGGCCGAA CTGCAGTTCG GCCGTAGCAA TGTTCGCGAT GCGGTCGACA GCTTTGCGCG CAACAGAGGC CGAGAGCTGC TTTCTTGGCA GGAGGAGATG GGCCTGCCCA TGCTGCCCGT TTCCGCTGCC GAGGAGGTCG CCCCGCAATT GCGCACGCTT CTCGCTCAAC TCGCCTGGCG GCAAAGGAGG CGATAG
|
Protein sequence | MAISLPAFGR TKEILRRAAG ASDGAYVSVD NLVALESMAA DLTFLRKAPV RRFLAGRHES RMRGRGLSFE ELRTYMPGDD IRTIDWRVTA RTGQPFVRVY NEEKDRPALV IVDQRTNMFF GSRRSMKSVA AAEAAALCAW RVMALGDRVG GVVFNDLKQE SIRPHRSRGS VIRFAETISL QNKALSAGSD IERAPGQLNA VLGNVAAVAQ HDHLIIVISD FDGHGPETRD LLLRMSVSND VIAILIYDPF LLDLPRQGDM VVSGGALQAE LQFGRSNVRD AVDSFARNRG RELLSWQEEM GLPMLPVSAA EEVAPQLRTL LAQLAWRQRR R
|
| |