Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_3785 |
Symbol | |
ID | 5317953 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009620 |
Strand | - |
Start bp | 231748 |
End bp | 233016 |
Gene Length | 1269 bp |
Protein Length | 422 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640775598 |
Product | protein of unknown function DUF900 hydrolase family protein |
Protein accession | YP_001312531 |
Protein GI | 150375935 |
COG category | [S] Function unknown |
COG ID | [COG4782] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.658591 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGGTAATG TAAACTCGTG CGGAGGCAGG CTCCGTGTCG CGAAAACCGG AAGCACGACC GTGTCGGCGC TTGTCGCCCT CGTTCTTCTG GCGGGCTGCG GAGGCCACGC GAAAGGCGTG ATGGCTCCTG TGGCTCTGGC GCAGCCGTCG GCCACCTCGC AGGTCGACAT GCTGGTTGCG ACGACCCGTG AACCCTCGGG AGACGCGGCG ACATTGTTCT CGGGAGAGCG CAGCCCGACA CTGTCTATGA CCGATGTTGC GGTTTCGATT CCGCCGGACT CGCGCCGCAA GCCCGGTACG GTGCAGTGGC CGCGGAAGCT TCCTCCGAAT CCGGAAACCG ACTTCGCTGT CACGCGGGTG CGCAAGCTGG CATCGAATGA CGAAGCGCGC GACTGGTTCC AGGTTCACAA TGAGGGCGGC CACGTGTTGC TCTTCGTGCA TGGTTTCAAC AACCGCTATG AGGATGCTGT TTTCCGGCTG GCGCAGATTG TTCACGATTC GGGCGCCCAG GCGACCCCGA TCCTGTTCAC CTGGCCATCG CGGGCACGGC TGTTCGACTA TAATTACGAC AAGGAGAGCA CCAATTACTC GCGCACTGCA CTCGAGGATA CGCTGCGCAC GCTGGCGTCC GCGCCGCGCG TCAAGGACAT CACCATCCTT GCCCATTCTA TGGGAACCTG GCTGACGATG GAGTCGCTGC GCCAGATGGG GATCCGCGAC GGCGGCATCG CGCCAAAGAT CGAGAACGTG ATTCTCGCTT CGCCCGACAT CGATCTCGAC GTCTTCGCCA AGCAATGGGT CGATATGGGC AAGGCACGCC CGAAGTTTAC GATCTTCGTC TCACAGGACG ACCGGGCGCT CGCGGTATCG CGGCTGATCT CCGGCGACGT GTCTCGACTC GGCGCGATCG ATCCGACCGC CGAGCCCTAC CGTACACAGT TGGAGACTGC CGGCATCACG GCGATCGATC TCACCAAGGT CCAGACGGAT GACGGCCTGC ATCATGGAAA ATTCGCCGAA AGCCCGGAGA TCGTGCAACT GATCGGGCAG CGGATCATCA AAGGGCAGAC GCTGACCGAT TCCGACATCT CGCTCGGCGA AGGCATCACT GCCGTCGTCG CAGGCACGGC CAAGAATGTC GGTACCGTTG CGGCGGCCAC GATCACCGCG CCGGTCACCA TCATCGAGCA GCGTGGAACG CCGCGCAAGA AGGTCAATCT GGAGGAGACG CTGACGAGCA GCGAGAATGC GGGAAACACG GCCCGTTGA
|
Protein sequence | MGNVNSCGGR LRVAKTGSTT VSALVALVLL AGCGGHAKGV MAPVALAQPS ATSQVDMLVA TTREPSGDAA TLFSGERSPT LSMTDVAVSI PPDSRRKPGT VQWPRKLPPN PETDFAVTRV RKLASNDEAR DWFQVHNEGG HVLLFVHGFN NRYEDAVFRL AQIVHDSGAQ ATPILFTWPS RARLFDYNYD KESTNYSRTA LEDTLRTLAS APRVKDITIL AHSMGTWLTM ESLRQMGIRD GGIAPKIENV ILASPDIDLD VFAKQWVDMG KARPKFTIFV SQDDRALAVS RLISGDVSRL GAIDPTAEPY RTQLETAGIT AIDLTKVQTD DGLHHGKFAE SPEIVQLIGQ RIIKGQTLTD SDISLGEGIT AVVAGTAKNV GTVAAATITA PVTIIEQRGT PRKKVNLEET LTSSENAGNT AR
|
| |