Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_4050 |
Symbol | |
ID | 6982821 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | - |
Start bp | 4226334 |
End bp | 4227560 |
Gene Length | 1227 bp |
Protein Length | 408 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 643398780 |
Product | protein of unknown function DUF900 hydrolase family protein |
Protein accession | YP_002283538 |
Protein GI | 209551621 |
COG category | [S] Function unknown |
COG ID | [COG4782] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAAAGC GTAGCGTGGC TCTTTTTCTT CTACTTCTTC TGGCAAGTTG CGGGCATCCG GGCGGCGTGA TGACGCCAGT TTCAGCCTCA TTGTCTTCAT CGTCGATAAC GCCGACCTCC ACGGTCGACA TGCTGGTTGC GACCACGCGC GAGCCTTCCG GGAACCCGGC GACGCTCTTC AACGGCGAGC GCAGTTCCAA GCCGCATCTG ACGCAGATCT CGATTTCGAT CCCGGCAAAG CGCGAAGCCG GCACCGTGCA GTGGCCGAAG CGGCTTCCGC CCGATCCGGC CACCGACTTT GCCGTCACCC GTGTTCAGCA GATCGACACC GTGGCACAGG GCCGCGTCTG GTTCCGGCAG CATGTGCACG GCGGCCATGC GCTCGTCTTC ATCCACGGCT TCAACAACAC CTATGAGGAT TCCGTCTTCC GGCTGGCGCA ATTGGTGCAC GACAGCAAGA TGCAGGCGAC GCCTGTCCTG TTCACCTGGC CGTCGAGAGC GGAGATCACG GCCTACCAAT ACGACAAGGA GAGTACGAAT TATTCGCGGA CGGCGCTGGA GCAGGCGCTC CGCACGCTGG CCGCCGACCC CGATGTCAAG GACATCACCG TCATGGCCCA TTCGATGGGA ACGTGGCTTG CCATGGAATC GCTGCGGCAG ATGGGAATAC GCGACGGTCA CGTCATTTCG AAGATCCACA ATGTCATCCT GGCCTCGCCC GATATCGACA TCCAGGTCTT CGCCAAACAA TATGTGGAGA TGGGCGAGCC GCGTCCGAAA TTCACGATCT TCGTTTCCCA GGACGACAAG GCTCTTGCGG TTTCGAGCTT CATCACCGGA CGCGTTTCCC GTCTCGGCGC GATCAATCCG GCCGAGGAGC CCTACCGCTC GAAGCTGGAA AATGCCGGCA TCACTGCCAT CGACCTGACG AAGGTGAAGA CCCACGACAG GCTCAATCAC GGCAAATTCG CCGAAAGTCC CGAGATCGTC CAACTCATCG GCCAGCGCCT CATGACTGGC CAGACCTTGA CCGATTCCAA GGTCACGCTG GGTCAGGGCA TTACGGCCGT CGTCGGCGGA ACGGTATCGA CGATCGGCAC CGTTGCCGCG ACCGCCGCCG CGGCGCCGGT TGCGATCATC GAACAGCCCG TCACGCGGAA GAAGCCGCCG CGAGCTGCCA ACGAGACGCT CGACGGCGAC CTGAAGCAAC AGACGCTGAC ACAGTGA
|
Protein sequence | MAKRSVALFL LLLLASCGHP GGVMTPVSAS LSSSSITPTS TVDMLVATTR EPSGNPATLF NGERSSKPHL TQISISIPAK REAGTVQWPK RLPPDPATDF AVTRVQQIDT VAQGRVWFRQ HVHGGHALVF IHGFNNTYED SVFRLAQLVH DSKMQATPVL FTWPSRAEIT AYQYDKESTN YSRTALEQAL RTLAADPDVK DITVMAHSMG TWLAMESLRQ MGIRDGHVIS KIHNVILASP DIDIQVFAKQ YVEMGEPRPK FTIFVSQDDK ALAVSSFITG RVSRLGAINP AEEPYRSKLE NAGITAIDLT KVKTHDRLNH GKFAESPEIV QLIGQRLMTG QTLTDSKVTL GQGITAVVGG TVSTIGTVAA TAAAAPVAII EQPVTRKKPP RAANETLDGD LKQQTLTQ
|
| |