Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_5143 |
Symbol | |
ID | 8007003 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012848 |
Strand | - |
Start bp | 544747 |
End bp | 545982 |
Gene Length | 1236 bp |
Protein Length | 411 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 644822056 |
Product | protein of unknown function DUF900 hydrolase family protein |
Protein accession | YP_002973316 |
Protein GI | 241113481 |
COG category | [S] Function unknown |
COG ID | [COG4782] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCCGGG CGACATCGCT TCGACTGATC GCGCTGCTGG TTCTCCTGGC GCCGCTTTGC GCCTGCGGAC ATCCACGCGG CGTCATGCAG CCTGTGGCGC TGACTGCGGC CACGCCCGGA ACCGCGCAGG TCGACATGCT CGTCGCAACG ACCCGACAGC CGTCGGGCGA TCCGGCGACG TTGTTCAACG GCGAGCGCAG CCCAAAGCCC TCCATGACCG ACGTTGCGGT TTCCATTCCG CCGAAGCGCG AGGCGGGCAC CGTCCAGTGG CCGCAGCGAC TGCCGCCCAA TCCTGCCACG GACTTTGCCG TGACACGGGT GAAGCAGATC GACACCATTC CGGAAGGCCG GGCATGGTTC CGCCAGCATA TTCAGGGCGG GCATGCGTTG GTTTTCATCC ACGGGTTTAA CAACACATAC GAGGACTCGG TCTTCCGCCT CGCCCAGATC GTCCACGACA GCGGCATGCA GGCGACGCCG ATCCTCTTCA CCTGGCCGTC ACGCGCGCAG CTCACCGGAT ACGAATACGA CAAGGAAAGC ACGAACTATT CGCGCACGGC GCTGGAGCAG GCGCTGCGGG TCCTCGCCGC CGATCCTGAT GTGAAGGACA TCACCATCCT CGCGCATTCC ATGGGAACGT GGCTGGCGAT GGAATCGCTG CGGCAGATGG GCATCCGCGA CGGTCACGTC AACGCCAAGA TACACAACGT CATCCTCGCC TCGCCCGACA TCGACATCCA GGTGTTCGCC AAGCAGTTCG TCGAGATGGG AGACCCGAAA CCGAAGTTCA CCATCTTCGT GTCCCAGGAC GACCGGGCGC TCGCGGCATC GAGCTTCATC ACCGGCAACG TGTCGCGGCT CGGTGCCATA GACCCGTCGA AGGAGCCCTA TCGATCCAGG CTGGAAAAGG CGGGCATCAC CGCGATCGAC CTCACGAAGG TGAAGGCCGG CGACAGCCTC CATCATGGCA AGTTCGCCGA AAGTCCCGAC ATCGTCCAGC TCATCGGCCA GCGTCTGATG ACCGGGCAAA CGCTGACGGA TTCCAACATT TCTCTCGGAC AGGGCGTCGC CGCCGTCGTG GGCGGGACAG CGCGCACCGT CGGCACAGTC GCAGGCGCTG CAGTTGCAGC ACCACTTGTG ATCATCGAGC AGCCGGCAAG AAAGCGGCAG CCGACAGGAA CGGAGCTGGA AGACGGCCTG CACAACGACC GCCAGTCGAA GCCCCTGACG CAATAG
|
Protein sequence | MPRATSLRLI ALLVLLAPLC ACGHPRGVMQ PVALTAATPG TAQVDMLVAT TRQPSGDPAT LFNGERSPKP SMTDVAVSIP PKREAGTVQW PQRLPPNPAT DFAVTRVKQI DTIPEGRAWF RQHIQGGHAL VFIHGFNNTY EDSVFRLAQI VHDSGMQATP ILFTWPSRAQ LTGYEYDKES TNYSRTALEQ ALRVLAADPD VKDITILAHS MGTWLAMESL RQMGIRDGHV NAKIHNVILA SPDIDIQVFA KQFVEMGDPK PKFTIFVSQD DRALAASSFI TGNVSRLGAI DPSKEPYRSR LEKAGITAID LTKVKAGDSL HHGKFAESPD IVQLIGQRLM TGQTLTDSNI SLGQGVAAVV GGTARTVGTV AGAAVAAPLV IIEQPARKRQ PTGTELEDGL HNDRQSKPLT Q
|
| |