Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_4186 |
Symbol | |
ID | 8014976 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | - |
Start bp | 4282086 |
End bp | 4283645 |
Gene Length | 1560 bp |
Protein Length | 519 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 644826756 |
Product | sulfatase |
Protein accession | YP_002977966 |
Protein GI | 241206870 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.360689 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAAAATC CAACCAGCAT TGAGCCAACC AGCATCGAGA AAGCGCGCCG GCCGAACGTC CTGCTGATCA CGGCGGATCA GTGGCGCGGC GATTGCCTGT CGGCTGTCGG TCATGCCTGT GTAAAGACAC CTGGCGTCGA TGCTCTGGCG CGGGAAGGCA CCCTGTTCCG GCGGCACTAT GCCGGCGCGG CGCCCTGCTC ACCGGCGCGG GCCACGCTCT ATACCGGTCT CTATCAGATG AACCACCGGG TCTGCCGCAA CGGTTCGCCG CTCGATGCCC GCTTCGACAA TCTGGCCCTG GCCGCCCGGC GGGCGGGATA CGACCCGACG CTGTTCGGCT ACACCGATAC GGCGCCCGAT CCGCGCGGGA TGGATGGCGG CGATCCGCAT CTAACGAGCT ACGAAGGCGT GCTGCCGGGC TTTACGGCCC GCCAGCTTCT GCCCGAGCAC GAAAGACAGT GGCTCTCCTG GCTCAGGTCT CGCGGCCATG CCGATGCCGT CAGCCGCGAT ATCCATATTC CTGTTGGTGC TGGAGCCGGA GAAATTTCAA ATGCGGCGCC GGCCTATTCG CGCGACGAGA CCCAAACGGC TTTCCTGGCC GGCGAGTTCA TCCGCTGGAT GGGCGAACAG GACAGGCCGT GGTTTGCGCA TGTCTCTTTC TTACGTCCGC ATCCGCCGTT CTCCGTGCCG GAGCCGTTCA ACCGGATGGT CAAGCCAGGC GAGGGACCGG CTTTTGCCCG CGCGACAAAC CGCGAAGCGG AACCGGTCAG CCATCCCTAC CTCGCCTATG CCATGCCGCG CGCTGACAAG GGCAGCTTTA TCCACGGCGC AACGGGGCCG CTCAGCGGCT GGAACGCCGA GGATTTCGCC GCGATCCGGG CGATCTATTA CGGCATGATA TCAGAGGTCG ATGCGCAGCT CGGCCGCATC TGGCAGGCCC TCAAGGATGC GGGCGCCTGG GACGATACGC TTGTTGTCTT CACCTCTGAT CACGCCGAGA TGGCGGGCGA TCACTGGATG CTCGGCAAGG GCGGTTTCTT CGACGGCAGC TATCATATTC CTCTGGTGAT CCGCGATCCC GCAAGCAGTG CTGCGGGTGG GGTCGTCGAC AAATTCACCA GCGCTGCGGA TATTTTTCCG ACACTTTGCC AAAGGTTTGG CATCGACGCG AAGAACGGGC TCGATGGCCG GTCGCTCATG CCGTTCGTCA GGGGTGGCAG CGGAAAGGGC TGGCGGGACG CGGCATTTTG GGAATTCGAT TTCCGCGACA TTGCGCATGG TGAGGCCGAG CAGCATTTCC GGCTGCGGTC CAACGAATGC AATCTCGCGG TGATCCGCGA CGCGCGGTTC AAATATGTGC ATTTCACCGC CTTGCCGCCG CTGCTCTTCA ATCTTGCCGA CGACCCGATG GAACTCGACA ATGTCGCGGC GGATCCGGCC TATGCGGCGA TACGGCTCGA CTATGCCGAG AAGCTGCTGT CGCTCCGCGC ACGCCATCTC GATCAGACTC TCGCCTATAC TGAGCTGACG GAAAGAGGGC CGGTGACGCA CCGGCCCTGA
|
Protein sequence | MQNPTSIEPT SIEKARRPNV LLITADQWRG DCLSAVGHAC VKTPGVDALA REGTLFRRHY AGAAPCSPAR ATLYTGLYQM NHRVCRNGSP LDARFDNLAL AARRAGYDPT LFGYTDTAPD PRGMDGGDPH LTSYEGVLPG FTARQLLPEH ERQWLSWLRS RGHADAVSRD IHIPVGAGAG EISNAAPAYS RDETQTAFLA GEFIRWMGEQ DRPWFAHVSF LRPHPPFSVP EPFNRMVKPG EGPAFARATN REAEPVSHPY LAYAMPRADK GSFIHGATGP LSGWNAEDFA AIRAIYYGMI SEVDAQLGRI WQALKDAGAW DDTLVVFTSD HAEMAGDHWM LGKGGFFDGS YHIPLVIRDP ASSAAGGVVD KFTSAADIFP TLCQRFGIDA KNGLDGRSLM PFVRGGSGKG WRDAAFWEFD FRDIAHGEAE QHFRLRSNEC NLAVIRDARF KYVHFTALPP LLFNLADDPM ELDNVAADPA YAAIRLDYAE KLLSLRARHL DQTLAYTELT ERGPVTHRP
|
| |