Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_3858 |
Symbol | |
ID | 6982621 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | - |
Start bp | 4001251 |
End bp | 4002810 |
Gene Length | 1560 bp |
Protein Length | 519 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 643398580 |
Product | sulfatase |
Protein accession | YP_002283346 |
Protein GI | 209551429 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.0969664 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.154261 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAAAATC CAACCAGCAT TGAGCCCAGA ATTGAGAAGA CGCGCCGGCC GAACATTCTG CTGATCACCG CCGATCAGTG GCGCGGCGAC TGCCTGTCGG CCATCGGTCA TGCTTGTGTG AAAACGCCGA ATGTCGATGC CTTGGCGCGC CAGGGAACGC TCTTTCGGCG GCACTATGCC GGGGCAGCCC CCTGCTCGCC GGCGCGGGCC ACGCTCTATA CCGGCCTCTA CCAGATGAAC CACCGCGTCT GCCGCAATGG TTCGCCGCTC GATGCCCGCT TCGACAATCT GGCGCTGGCG GCCCGGCGGG CGGGATACGA CCCGACGCTG TTCGGTTATA CCGACACGGC GCCCGATCCG CGCGGCATGG ATGCCAATGA TCCGCATCTG ACGAGTTATG AAGGCGTGTT GCCGGGCTTT ACCTCACGCC AGCTTCTGCC CGAGCATGAA AAACAATGGC TCTCCTGGCT GAGATCCCGC GGTCATGCGG ATGCCGTCAG CCGCGACATT CATATTCCCG TCGGCGCCGA AGCCGGAGAC ATTTCCGACG CGGCGCCGGC CTATTCCAGC GACGAGACCC AGACGGCTTT CCTAGCCGGC GAGTTCATCC GCTGGCTGGG TGATCAGGAC AGGCCGTGGT TCGCGCATGT GTCTTTCCTG CGTCCGCATC CACCCTTTTC CGTGCCGGAT CCGTTCAACC GGATGTTCAA GCCGGGCGAG GGGCCGGCTT TTGCGCGTGC GGCAAACCGC GAAGCGGAAG AGGCAAGCCA TCCCTATCTC GCCTACGCCA TGCCGCGCAC CGGCAAGGGC GCCTTCATCC ACGGCGCAAC GGGACCGCTC AGCGACTGGA ACGGCGAGGA TTTCGCCGCG ATCCGGGCGA TCTATTACGG CATGATAGCA GAGGTCGATG CCCAGCTCGG CCGGATCTGG CAGGCCTTGA AGGATGCCGG CGCCTGGGAT AATACGCTTA TTGTCTTCAC CTCCGACCAC GCCGAGATGG CCGGCGATCA CTGGACGCTG GGGAAGGGTG GCTTCTTCGA CGGCAGCTAC CATATTCCGC TCGTCATTCG CGATCCGGCA AGCGGCGCTA CAGGCGGGAT CGTCGATGGT TTCACCAGTG CTGCGGATAT TTTTCCGACG CTTTGCGAAA GGCTTGGCAT CGAGGCGAAG AACGGGCTCG ACGGCCGGTC GCTAATGCCG TTCGTCAATG GCGGGAGCGG ACAGGATTGG CGGGACGCGG CATTCTGGGA GTTCGACTTC CGCGATATCG CCGGGGGCGA GACGGAGCGG TATTTCGGGC TCAAGTCGAA CGAATGCAAT CTCGCGGTGA TCCGCGATGC GCAGTTCAAA TATGTGCATT TTGCCGCCTT GCCGCCGCTG CTCTTCAATC TCAGCGACGA TCCGATGGAG CTCGACAATA TCGCAGGCGA TCCCGCCCAT GCGGCGATCC GGCTTGAGTA TGCTGAAAAG CTGCTGTCGC TGAGGGCGCG GCATCTGGAT CAGACGCTTG CCTATACCGA GCTGACGGAA AAAGGGCCGG TAACGCGCCG GCCCTCATAA
|
Protein sequence | MQNPTSIEPR IEKTRRPNIL LITADQWRGD CLSAIGHACV KTPNVDALAR QGTLFRRHYA GAAPCSPARA TLYTGLYQMN HRVCRNGSPL DARFDNLALA ARRAGYDPTL FGYTDTAPDP RGMDANDPHL TSYEGVLPGF TSRQLLPEHE KQWLSWLRSR GHADAVSRDI HIPVGAEAGD ISDAAPAYSS DETQTAFLAG EFIRWLGDQD RPWFAHVSFL RPHPPFSVPD PFNRMFKPGE GPAFARAANR EAEEASHPYL AYAMPRTGKG AFIHGATGPL SDWNGEDFAA IRAIYYGMIA EVDAQLGRIW QALKDAGAWD NTLIVFTSDH AEMAGDHWTL GKGGFFDGSY HIPLVIRDPA SGATGGIVDG FTSAADIFPT LCERLGIEAK NGLDGRSLMP FVNGGSGQDW RDAAFWEFDF RDIAGGETER YFGLKSNECN LAVIRDAQFK YVHFAALPPL LFNLSDDPME LDNIAGDPAH AAIRLEYAEK LLSLRARHLD QTLAYTELTE KGPVTRRPS
|
| |