Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_4880 |
Symbol | |
ID | 5318042 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009620 |
Strand | - |
Start bp | 1387380 |
End bp | 1388714 |
Gene Length | 1335 bp |
Protein Length | 444 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640776665 |
Product | protein of unknown function DUF900 hydrolase family protein |
Protein accession | YP_001313597 |
Protein GI | 150377001 |
COG category | [S] Function unknown |
COG ID | [COG4782] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.729214 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGGCCAATC GACGCAGCCG CGATTTTCTT GTCGCCTCAT CAGTTCCTGT GGTTCGCCTT CAGCGGATGA TCCTTTGCCT GGCAATTGCC GCTCTTTCCG GCTGCGGCGG GCATCCCAAA GGCGTGCTGA CGCCTGTCGC CGACAGCGCG CCCACAGCGA GCCGGGTCGA CATGCTGATC ACCACCACCC GCGGCCGTTC GGAGGTGGCC GGAGAGATGT TTACGGGAGA ACGAGCCCGC GCACCGGCCT TCGCGAACAT CACCGTCTCG ATCCCGCCTG TCCGCAAGGC CGGAGAGGTT GCCTGGCCGA AGAAATTGCC GTCCAATCCA GCCACCGATT TCGCGACTTT GAAGGCGGAC GACCTGACCA GGGATGGAGC CAAGGACTGG CTCAACACCA CAGTCCGGAA AAGCCCCGAC CGCAGTGTGC TCGTGTTCAT CCACGGCTTC AACAACCGCT TCGAGGACTC CGTCTACCGC TTCGCTCAGA TCGTCCATGA TTCCGGCGTC CACAGCGCCC CTGTCCTGGT GACATGGCCG TCGCGGGGCA GCCTGCTTGC CTATGGCTAC GACCGCGAAA GCACCAACTA CACCCGCAAC GCACTCGAAT CGCTTTTCCA GTATCTGGCC GCGGATAAAG AGGTGAAGGA GGTATCGATC CTCGCGCATT CCATGGGGAA CTGGCTCACG CTCGAGGCGC TTCGCCAGAT GGCCATCCGC AATGACGGCC TGCCGGAAAA ATTCAAGAAC GTGATGCTTG CGGCTCCGGA TGTCGATGTC GACGTCTTCC GTTCGCAAAT CGAGGACATG GGCAGGCAGC ATCCGCGGTT TACCCTGTTT GTATCCCGCG ACGACCGGGC GCTAGCCTTC TCTCGCAGGG TCTGGGGCGA CATTCCCCGG CTTGGTTCGA TCGACCCGGA GGCCGATCCC TACAAGCAGG AACTGGCGGA CAACGAGATC ACCGTTATCG ATCTGACCAA GGTGAAGGCC GGCGACGGCA TGCATCACGG CAAGTTCGCA GAATCCCCGG AAGTCGTCCG GCTCATCGGT GCACGCATCT CCGAAGGTCA GCCGTTGACC GACAGCCGGA TGGGCCTCGG GGACCATCTC ATTGCGGGAA CGACGGGAGC AGCCGCTGCG GCCGGCAGCG CGGCCGGCTT GATCCTTGCC GCTCCGGTCG CCGCCATCGA CCCGCACAGC AGAGACAATT ATGCGAACCA CGTCGGTGCG GCGATGGGAC AGTCGCATGG CAAGCAGCAG ATCGCGGTGA AAGACTGTTC GAAACAGGCG CGCGAGCGCG ATGCGGCGTC AACTTCACCG TGTCGAAGCT GGTGA
|
Protein sequence | MANRRSRDFL VASSVPVVRL QRMILCLAIA ALSGCGGHPK GVLTPVADSA PTASRVDMLI TTTRGRSEVA GEMFTGERAR APAFANITVS IPPVRKAGEV AWPKKLPSNP ATDFATLKAD DLTRDGAKDW LNTTVRKSPD RSVLVFIHGF NNRFEDSVYR FAQIVHDSGV HSAPVLVTWP SRGSLLAYGY DRESTNYTRN ALESLFQYLA ADKEVKEVSI LAHSMGNWLT LEALRQMAIR NDGLPEKFKN VMLAAPDVDV DVFRSQIEDM GRQHPRFTLF VSRDDRALAF SRRVWGDIPR LGSIDPEADP YKQELADNEI TVIDLTKVKA GDGMHHGKFA ESPEVVRLIG ARISEGQPLT DSRMGLGDHL IAGTTGAAAA AGSAAGLILA APVAAIDPHS RDNYANHVGA AMGQSHGKQQ IAVKDCSKQA RERDAASTSP CRSW
|
| |