Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_5871 |
Symbol | |
ID | 5320173 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009621 |
Strand | - |
Start bp | 833475 |
End bp | 834380 |
Gene Length | 906 bp |
Protein Length | 301 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640777566 |
Product | hypothetical protein |
Protein accession | YP_001314498 |
Protein GI | 150377903 |
COG category | [P] Inorganic ion transport and metabolism [R] General function prediction only |
COG ID | [COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.789532 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGCCCA CCAAATGCCA GGATCAGGTC GTCCTCGACC TGTGGCACCC GCTTGCAGCG CTCGCCGAGG TACCTGCGGG TTTCGTCCAG GAAACGGTGC TGCTGGAAGA ACGTGTCAGT TACGCGACGG ATGGCGGCGG CAACGCCGCA GTGTGGCTTT CGCGCCCGGA CCTGGTAGCC GGCTGTCCCT TTGAACCCGG CGCACTGACG AGGGCGCTGC CGGCCAAGGT CGCCTACGGC TACATCTGGA CATCGCTCGG CACTCCGCCG ACCGAACTGT TTTTCATTCC TGAATATGCC GAGCCCGACC GCCGGAGGCT CAACGCTGCC ACCGTCGGTG TGAACGTCTC TGCGCCACGC GCCATCGAGA ACTTCCTCGA CATGGGCCAC TTTCCCTATG TCCATACTGA CATTCTCGGG GCGGAACCGC ACACAGAGGT CAAGGAATAC GATGTCGAGC TCTCGGTCGA GCGCGACGAG ATTGTCGCGA CGCGCTGTCG CTTCTTTCAG CCCAAAGCCT CGACCGCTTC GACAGAAGGT GCCGACGTCG AATACATCTA CCGGGTGCCG CATCCCTACT GCTCGATCCT CTACAAATCG AGCCCGGTAG ACGAGACGCG GCTCGACGTC ATAGCCGTTT TCCTCCAGCC GATGGATCAG GAGCACATCC GCGCACATAT GATGCTCTGC GTCCTTGACG ATGAGAATGA AGACAAGGTG ATCAAGCGCT TCCAGCAGAC CATTTTCGGC CAGGACAAGC CGATCCTGGA GAACCAGTTC CCGAAGCGAC TGCCGCTCGA TCCGCGCGCC GAAACCCCGA TCCGAGCCGA CAAGTCGGCG ATCGCCTACC GCCGATGGCT CAGCCAGAAG GGCGTGACCT ACGGGGTCAT CCCGGCCGCA ACCTGA
|
Protein sequence | MTPTKCQDQV VLDLWHPLAA LAEVPAGFVQ ETVLLEERVS YATDGGGNAA VWLSRPDLVA GCPFEPGALT RALPAKVAYG YIWTSLGTPP TELFFIPEYA EPDRRRLNAA TVGVNVSAPR AIENFLDMGH FPYVHTDILG AEPHTEVKEY DVELSVERDE IVATRCRFFQ PKASTASTEG ADVEYIYRVP HPYCSILYKS SPVDETRLDV IAVFLQPMDQ EHIRAHMMLC VLDDENEDKV IKRFQQTIFG QDKPILENQF PKRLPLDPRA ETPIRADKSA IAYRRWLSQK GVTYGVIPAA T
|
| |