Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_0115 |
Symbol | |
ID | 5320944 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009636 |
Strand | - |
Start bp | 127699 |
End bp | 128709 |
Gene Length | 1011 bp |
Protein Length | 336 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640789048 |
Product | 2-nitropropane dioxygenase NPD |
Protein accession | YP_001325810 |
Protein GI | 150395343 |
COG category | [R] General function prediction only |
COG ID | [COG2070] Dioxygenases related to 2-nitropropane dioxygenase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.459319 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGTTGC CGCCAATCCT GTCCGGAAAA GTGAGACTGC CTGTCGTAGG GGCGCCGCTT TTCATCGTCT CTCATCCCAA GTTGACGATC GCGCAATGCA AGGCCGGGAT TATCGGTGCC TTTCCGGCCC TCAACGCCCG CCCGCAATCG CAACTCGACG AATGGTTGTC GGAGATCACC GAAACGCTTG CGGCCCACGA CGCCGACTAT CCCGAACGCC CCGCCGCGCC TTTTGCGGTG AACCAGATCG TGCACAAGTC GAACGCCCGT CTCGAACAGG ACCTGATGCT TTGCGTCAAG TACAAGGTGC CCGTCGTCAT CTCCTCGCTC GGCGCCGTCC CTGAGGTGAA TGCCGCGATC CATTCCTATG GGGGTATCGT GCTCCACGAT GTCATCAACA ACCGTCATGC AAATTCCGCC ATCCGCAAGG GCGCCGACGG GCTCATCGCA GTCGCGGCAG GTGCCGGCGG CCATGCCGGA ACTCTCTCGC CCTTCGCGCT CGTCCAGGAG ATCCGCGCCT GGTTCGACGG GCCGCTGCTG CTTTCCGGCG CGATCGCCAC CGGCGACGCG ATCCTTGCAG CGCAGGCAAT GGGCGCCGAC ATGGCCTATA TCGGCTCTCC CTTCATCGCC ACCGAGGAGG CTCGCGCGAG CGACGCCTAC AAGCAGATGA TCGTCGACAG CAATGCCGCA GATATCGTTT ATTCCAATTT CTTCACCGGC ATTCACGGCA ACTATCTGAA GCCCTCGATC CTCGCCGCCG GAATGGACCC GTCCAATCTG CCCGAAGCAG ACCCCTCCAA GATGGACTTC GGCCAGGCCG CCGAGGGCGC CAAGGCATGG AAGGACATCT GGGGCTGCGG CCAGGGGATA GGCGCCGTGC ATGAAATCGG CACCGTCGCA CGCCTGGTCG ACCGGCTCGA ACAGGAATAC GACACCGCGC GCGCCCGCCT CGGCCTCGCC TCCGGCGCTG ACATCCGGCG GGGTAAGGAA TTTTCAACTT CTCGAAACTG A
|
Protein sequence | MSLPPILSGK VRLPVVGAPL FIVSHPKLTI AQCKAGIIGA FPALNARPQS QLDEWLSEIT ETLAAHDADY PERPAAPFAV NQIVHKSNAR LEQDLMLCVK YKVPVVISSL GAVPEVNAAI HSYGGIVLHD VINNRHANSA IRKGADGLIA VAAGAGGHAG TLSPFALVQE IRAWFDGPLL LSGAIATGDA ILAAQAMGAD MAYIGSPFIA TEEARASDAY KQMIVDSNAA DIVYSNFFTG IHGNYLKPSI LAAGMDPSNL PEADPSKMDF GQAAEGAKAW KDIWGCGQGI GAVHEIGTVA RLVDRLEQEY DTARARLGLA SGADIRRGKE FSTSRN
|
| |