Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_6957 |
Symbol | |
ID | 8022985 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012858 |
Strand | + |
Start bp | 402217 |
End bp | 403494 |
Gene Length | 1278 bp |
Protein Length | 425 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 644833814 |
Product | protein of unknown function DUF900 hydrolase family protein |
Protein accession | YP_002984948 |
Protein GI | 241666864 |
COG category | [S] Function unknown |
COG ID | [COG4782] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTGCTG GTCCTATCCG GCGACTTCTT CCATGCCTCG CGCTGGTCGT GCTCGCGGGC TGCGGCGGAC ATCCGAAGAA CGTGCTGTTC CCCGTTGCGG ACACCGTGCC GGATACCAGT CGCGTCGACA TGCTTGTCGC GACCACGCGT GCCCGATCGA CTGTTTCCGG CGAGATGTTC ACGGGCGAGC GCGCCCGCAC GCCGTCCTTC GCACAGATGA GAGTTTCGCT GCCCAAGGTC CGCAACGAGG GCGACGTCGC CTGGCCCAAG AGGCTGCCTT CAAATCCGAA GACCGACTTC GCCACGCTGA AGGCCGACGC GCTTAGTCTC GAAGCGGCGA AAGGCTGGCT GAACGCCTCG GTCAGAAGGA ACCGCGACCA CAGCGTCCTG GTCTTCATCC ACGGCTTCAA CAACCGCTTC GAGGACAGTG TCTACCGCTT CGCTCAGATC GTCCACGATT CCAATGTCCG CAGCACGCCG GTCCTTGTGA CCTGGCCATC GCGCGGCAGC CTGCTCGCCT ATGGTTATGA CCGCGAGAGC ACCAACTACA CGCGCAACGC GCTCGAGACC CTGTTTCAGT ATCTCGCCAA AGACGGCACC GTGAAGGAAG TCAACGTGCT GGCCCACTCG ATGGGCAACT GGCTGGCGCT GGAAGCGCTC CGGCAGATGG CGATCCGCAA CGGCGGTCTC CCGGCAAAGT TCAAGAACGT CATGCTGGCC GCTCCGGACG TCGACGTGGA CGTCTTCCGG TCGCAGATCG AGGATATGGG CGACCCCCAT CCGCAATTCA CGCTTTTCGT CTCGCGCGAC GACAAGGCGC TTGCCTTCTC GCGGCGGGTC TGGGGCAACA TCCCGCGCCT CGGTTCGATC GATCCGGAAA CCGCGCCGTA CAAGACCGAA CTCGCCGACT ACAAGGTATC GGTGATCGAT CTGACCAAGA TCAAGGTCAG CGACGACCTC AATCACAGCA AGTTCGCGGA ATCACCGCAA GTCGTCCAGC TCATCGGCCA GCGGTTGTCC GAAGGGCAGA CCCTGACCGA CAGCCGGGTC GGCCTTGGCG ACACGATCCT TGCGGGGACA ACGAACGTGG CGGCGGCGGC CGGAAGCGCC GCGGGCCTCG TACTGACCGC CCCGGTCGCG GTGCTCGACG CCGATACTCG CAGCAACTAC GCCAATCATG TCGGCGGTCT GACCGGACAA GACGGCGGGA CGCAGAAGAT CGCGGTCAAG AACTGCGCGG CGACGCCGGC CGACCCGGCA TGCCGGAAGC AGCGATAG
|
Protein sequence | MTAGPIRRLL PCLALVVLAG CGGHPKNVLF PVADTVPDTS RVDMLVATTR ARSTVSGEMF TGERARTPSF AQMRVSLPKV RNEGDVAWPK RLPSNPKTDF ATLKADALSL EAAKGWLNAS VRRNRDHSVL VFIHGFNNRF EDSVYRFAQI VHDSNVRSTP VLVTWPSRGS LLAYGYDRES TNYTRNALET LFQYLAKDGT VKEVNVLAHS MGNWLALEAL RQMAIRNGGL PAKFKNVMLA APDVDVDVFR SQIEDMGDPH PQFTLFVSRD DKALAFSRRV WGNIPRLGSI DPETAPYKTE LADYKVSVID LTKIKVSDDL NHSKFAESPQ VVQLIGQRLS EGQTLTDSRV GLGDTILAGT TNVAAAAGSA AGLVLTAPVA VLDADTRSNY ANHVGGLTGQ DGGTQKIAVK NCAATPADPA CRKQR
|
| |