Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_6904 |
Symbol | |
ID | 8022650 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012858 |
Strand | + |
Start bp | 354900 |
End bp | 356570 |
Gene Length | 1671 bp |
Protein Length | 556 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 644833765 |
Product | sulfatase |
Protein accession | YP_002984899 |
Protein GI | 241666815 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 0.430095 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACAGCA GAATCCTTAT CCGCTGCATC GGAGCACTGG CTTCATCCAC CATTCTTTGG TGTGCGGCCT CGCCCCTGCA AGCGCAGGAC TCCCAACGAA AACCCAACAT CCTGTTCATC GTTTCGGATG ATACCGGCTA CGGTGATCTC GGCCCCTATG GCGGAGGTGA AGGTCGCGGG ATGCCGACCC CGAACATCGA CAAGCTTGCT GAAGACGGCA TGACCTTCTT CTCCTTCTAC GCCCAGCCGA GTTGCACGCC CGGCCGTGCG GCCATGCAGA CGGGGCGAAT ACCAAACCGC AGCGGCATGA CGACTGTCGC CTTTCAAGGT CAGGGCGGTG GCTTGCCCGC GGCCGAATGG ACACTTGCAT CCGTGCTGAA ACGTGGCGGC TATCACACCT ATTTCACCGG CAAATGGCAT CTCGGCGAAG CGGACTACGC CCTACCGACT GCGCAGGGTT ATGATGAGAT GCGGTACGCC GGCCTCTACC ATCTGAATGC CTATACGTAT GCCGATCCCA CCTGGTTCCC GGACATGGAT CCGAAGCTGC GGGAGATGTT CCAGAAGGTG ACCAAGGGGG CTTTGTCCGC CAAGGCAGGA GGACCAGTGA CCGAAGAATT CAAGGTCAAT GGCCAATACG TCGACACACC CATGATCGAC GGTAAGGAGG GCGTTGTCGG CATTCCGTTC TTCGACGGCT ACGTCGAGAA AGCGGCACTG GGCTTTCTGG ACGAGGCTGC CAAAGCACCG GACGAACCCT TCTTCATCAA CGTGAACTTC ATGAAGGTCC ACCAGCCGAA CATGCCGGCC CCAGAGTTCG AGCACAAGTC CATGTCGAAG TCGAAGTATG CGGACTCGAT CGTGGAACTC GACACCCGCA TTGGCCGAAT CATGGACAAA TTGCGGGAAA CCGGCATGGA CCGCAACACG CTGGTTTTCT ACACCACCGA CAATGGGGCA TGGCAGGACG TCTATCCGGA CGCCGGATAC ACCCCGTTCC GCGGAACCAA AGGCACCTTG CGAGAGGGCG GCAACCGTGT TCCTGCGATT GCGGTCTGGC CGGGAAAGAT CAAACCCCGC ACCAAGAACC ACGACATCGT CGGTGGTCTC GATCTGATGG CGACATTCGC CGCCGTCGGT GCGGTTCCGC TACCCGACAA GGATCGCGAA GACAAACCGA TCATATTCGA TAGCTACGAC ATGTCGCCGA TCTTGCTCGG CACCGGTAAA TCGGAACGCA AGTCCTGGTT TTACTTTACT GAAAACGAGC TCTCGCCCGG TGCGATACGC GTCAACAACT ACAAGTTCGC CTTTAATATC CGCGGGGATA ACGGAGCCTC GACGGGCGGA CTGGCGGTCG ACACCAACCT CGGCTGGAAG GGTGAGGAGA AGTATGTCGC TACGGTACCC CAAGTGTTCG ATCTGTGGCA GGACCCGCAG GAACGCTACG ACATTTTCAT GAACAACTTC ACCGAGCGGA CCTGGATGGG CGTCGTCATG GGCGAAGAAT TGAAGAAGAT CATGGCCACC TACGTGGAGT ACCCACCTCG CAAACCCCAG AGCCTGACCT ACAATGGTCC CATCACGCTA TCGGACTACA GTCGTTTTCA GTGGATCCGA GAATCGTTGG CAAAGGAAGG CGTGAGCATT CCTATGCCGA CCGGAAACTA A
|
Protein sequence | MNSRILIRCI GALASSTILW CAASPLQAQD SQRKPNILFI VSDDTGYGDL GPYGGGEGRG MPTPNIDKLA EDGMTFFSFY AQPSCTPGRA AMQTGRIPNR SGMTTVAFQG QGGGLPAAEW TLASVLKRGG YHTYFTGKWH LGEADYALPT AQGYDEMRYA GLYHLNAYTY ADPTWFPDMD PKLREMFQKV TKGALSAKAG GPVTEEFKVN GQYVDTPMID GKEGVVGIPF FDGYVEKAAL GFLDEAAKAP DEPFFINVNF MKVHQPNMPA PEFEHKSMSK SKYADSIVEL DTRIGRIMDK LRETGMDRNT LVFYTTDNGA WQDVYPDAGY TPFRGTKGTL REGGNRVPAI AVWPGKIKPR TKNHDIVGGL DLMATFAAVG AVPLPDKDRE DKPIIFDSYD MSPILLGTGK SERKSWFYFT ENELSPGAIR VNNYKFAFNI RGDNGASTGG LAVDTNLGWK GEEKYVATVP QVFDLWQDPQ ERYDIFMNNF TERTWMGVVM GEELKKIMAT YVEYPPRKPQ SLTYNGPITL SDYSRFQWIR ESLAKEGVSI PMPTGN
|
| |