Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_6900 |
Symbol | |
ID | 8022646 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012858 |
Strand | - |
Start bp | 349610 |
End bp | 351247 |
Gene Length | 1638 bp |
Protein Length | 545 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 644833761 |
Product | sulfatase |
Protein accession | YP_002984895 |
Protein GI | 241666811 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.0879565 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 0.386644 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCAGCA AGAGCAACCG GCAAGGTTAC AACGCCACCC GTCGCCAAAT CCTGCTGGCC GGCGGTTCGG CAATCGCTTT AACGGCATTC TGTCCGATCG CCAGCATCCC GGCGCTTGCC CAGGCAGGAG CAAAGAAACC GAACATCCTT GTGATCTTCG GCGATGACAT CGGCTGGTGG AACACCAGCG CTTACAATCG CGGGCAGATG GGATACCAGA CGCCAAATAT CGACCGTATC GCCGACGAAG GTGCGATGTT CACCGATCTC TACGCTCAGC AATCCTGCAC GGCGGGGCGG GCGGCCTTCA TCACAGGCCA AAGCTGTTTT CGCACGGGGC TGCTAAAAGT CGGGCTTCCC GGAGCCAAGG AGGGTCTGTC TGAGAAGGAT CCGACAATCG CCGAACTGCT CAAGCCACAA GGGTACGTTA CCGGCCAGTT CGGCAAGAAC CATCTCGGCG ATCGCAACGA ATTTCTGCCA ACGGTCCATG GCTTCGATGA ATTCTTTGGC AACCTCTATC ACCTCAATGC CGAAGAAGAA CCGGAGAACC CCGATTACCC GAAGGATCCA CAGTTTCTCG CAAAATTCGG ACCGCGTGGC GTGCTGAAAT GCAAAGCAAG CGAGACAGAC GATCCGACCG AGGATCCGCG TTTCGGAAGA GTGGGCAAGC AGACGATTGA GGATACGGGA CCGCTCAATA GAAAACGCAT GGAAACCGTT GATGAGGAAT TCCTAGGCGC TGCCAAGGAC TTCATCGACC GCAGCGCCAA AGCCGACAAA CCGTTCTTTT GCTGGTTCAA CTCGACCCGG ATGCATATCT ACACGCATCT TAAGGCTGAG TCGGAAGGCA AAACGGGGTT GGGGATCGTT GCCGACGGCA TGGCCGAATT TGACGGTATG GTTGGCCAGC TGCTCGACCA GCTCGATGAT CTTGGAATCG CCGAAAACAC CATTGTTGTC TGGACGACCG ATAACGGTGC AGAGGTGTTC TCCTGGCCTG ACGGGGGCAC AACGCCGTTC CATGGCGAAA AGAATACAAA TTGGGAGGGG GGCTACCGCG TGCCCGGGAT GGTGCGCTGG CCGGGCGTTG TCAAACCGGG AACCGAGATC AACGAGATTG TCTCCCACGA AGACTGGCTT CCGACCTTGG TTGCGGCAGC CGGCGAGCCG GACATCGCAG CCAAGCTTCT GAACGGCTAT GAAGCGGCCG GCAAGACATT CAACGTGCAT CTTGACGGCT ACAATCAACG CAAACTGCTT GATGGCACAG GGCCTGGGGC GCGCAAGGAG TATTTTTACT GGACTGATGA CGGAAGCCTG GCCGGATTGC GCTACGACCG CTGGAAGCTG GTGTTCATGG AACAACGAGC AGAGGGGTTG GACGTGTGGC AGGATCCTCT GATCACACTG AGATTTCCGA AGTTAATCGA CCTGCGCGCC GATCCGTTCG AAATTGCCCA GCATGCAGCG GGAGACTATG CAAGATGGCG TGTAGAACAT GCCTTCGCGC TGGTTCCGGC CCAGGCATAT GTGGCCAAAC ATCTTCAAAC CTATGTAAAA TATCCGCCCC GCCAGGCGCC GGGAAGCTTC TCGATGGACC ATGTGCTTGA GAAACTCCAG CGGGGTGGCG GACAGTGA
|
Protein sequence | MSSKSNRQGY NATRRQILLA GGSAIALTAF CPIASIPALA QAGAKKPNIL VIFGDDIGWW NTSAYNRGQM GYQTPNIDRI ADEGAMFTDL YAQQSCTAGR AAFITGQSCF RTGLLKVGLP GAKEGLSEKD PTIAELLKPQ GYVTGQFGKN HLGDRNEFLP TVHGFDEFFG NLYHLNAEEE PENPDYPKDP QFLAKFGPRG VLKCKASETD DPTEDPRFGR VGKQTIEDTG PLNRKRMETV DEEFLGAAKD FIDRSAKADK PFFCWFNSTR MHIYTHLKAE SEGKTGLGIV ADGMAEFDGM VGQLLDQLDD LGIAENTIVV WTTDNGAEVF SWPDGGTTPF HGEKNTNWEG GYRVPGMVRW PGVVKPGTEI NEIVSHEDWL PTLVAAAGEP DIAAKLLNGY EAAGKTFNVH LDGYNQRKLL DGTGPGARKE YFYWTDDGSL AGLRYDRWKL VFMEQRAEGL DVWQDPLITL RFPKLIDLRA DPFEIAQHAA GDYARWRVEH AFALVPAQAY VAKHLQTYVK YPPRQAPGSF SMDHVLEKLQ RGGGQ
|
| |