Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_3761 |
Symbol | |
ID | 5319053 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009620 |
Strand | - |
Start bp | 204582 |
End bp | 205883 |
Gene Length | 1302 bp |
Protein Length | 433 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 640775574 |
Product | hypothetical protein |
Protein accession | YP_001312507 |
Protein GI | 150375911 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0660829 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCCTCG GCGCATCCCA ACGTTTGAAT GACGGCGTTG CGGCCGTGGA AACCATGACG CGTTTCGCGC TTGCAGTATT GGCGCTTGCC TCGGGCGTCT ACACCTATCT CGGCGTCCGC AGTATCCTTG ATGGTTCGCC GACGGCAGTG TTCTTCGCAG CGATCATCTA CTCGGCCTCG GTCTCCGTCG GCATCTACGC CTTCTGGTCC TACATGGCCC GCTTCTATCC GCATGTGACC ACCCATGCCG GCCGGGTCGC CATGCTGGGC GTGATGGCCC TCGGTGCTGC CATGATCATC GCCATGTCCA GCTGGCTCAA CGCGGCGGCG CTGGCGGGGT CGGCCGCGCT CGAACAGCAC CTCGCGGAGA CGGTGGAAGA CTATACTGCC GACCTCGACC AGGCGCATCA GAACGCACTT GCAGCCCAGA GCCTGCTGCC CGATATCCAA CGCGCATCAG AACGCTTTGC TCAGCTTGCC GCCTCGGAGC GGCAATCGGG CGCGCTCACC GGCACCACGG GTTCGGGAAG CGTGGTGCAG CTTCTGTCGC AGATGTCGGC CCAGATGAAG GATCTGGAAA ACGGCATCAA TGCTTCTCGG GAACAGGTCG CGATGCTGTT CAATCAGGGA CAGGAGCGGC TCGAGACCAT GCGGACGCTG GTATCCGCGC CTGGTGCGGT CACGCCACGT GCCGATCAGT TCTCCTCCGA AGTGGTGGCG CTTACGGGGG TGATAACATC GCTCGGACAG ACTTCGATCG CGCCTTCGAT CCGCCGCGCC GCAGACGACC TGTCCCTCGG CTTCATCGCG CCGGTGGCCG ATGGCGGCGA TGCCGATCTC GTCACCCGCC AGGACCAGGT GATGGAGACG GTGCGGGCTT CGGTGGCAGC GCAGTCCAAG GTTCTCTCGG ATGCGGCAGA CGAAATACTG GGTCGGATGC CGGTGGCGGA GCGACGCTTC GTTCCGCTTT CATCCGCCGA GGCGGTACTG CGCTACGCGG CCGATTTTAT TCCCGCCTGG GCCGGTGCCA TTTCCATTGA CCTGCTGCCG GGCGTACTGG TCTTCATCCT CGCGACCGTG CACGGGGCGA TCCGCAGGCA GGAGGAGAAA CTGCCCTTTG CCGAGCGCAT CACGGCCGCC GAGCTTCTGC AGGCCCTGGA GGTCCAGCGC GCGGTGATGG CGAATGGGGG CCAGAACGGC GAGGCAGGCG ATTCGGTGGA AATGGAGGGC GATGAGCCGA ACAACATCAC CAGCCTCGAC CCGAGGGTGC GCGTAAAGGA CCGGTCGCAT GAGGATCGAT GA
|
Protein sequence | MALGASQRLN DGVAAVETMT RFALAVLALA SGVYTYLGVR SILDGSPTAV FFAAIIYSAS VSVGIYAFWS YMARFYPHVT THAGRVAMLG VMALGAAMII AMSSWLNAAA LAGSAALEQH LAETVEDYTA DLDQAHQNAL AAQSLLPDIQ RASERFAQLA ASERQSGALT GTTGSGSVVQ LLSQMSAQMK DLENGINASR EQVAMLFNQG QERLETMRTL VSAPGAVTPR ADQFSSEVVA LTGVITSLGQ TSIAPSIRRA ADDLSLGFIA PVADGGDADL VTRQDQVMET VRASVAAQSK VLSDAADEIL GRMPVAERRF VPLSSAEAVL RYAADFIPAW AGAISIDLLP GVLVFILATV HGAIRRQEEK LPFAERITAA ELLQALEVQR AVMANGGQNG EAGDSVEMEG DEPNNITSLD PRVRVKDRSH EDR
|
| |