Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_5090 |
Symbol | |
ID | 8007683 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012848 |
Strand | + |
Start bp | 480015 |
End bp | 481196 |
Gene Length | 1182 bp |
Protein Length | 393 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 644822005 |
Product | protein of unknown function DUF900 hydrolase family protein |
Protein accession | YP_002973265 |
Protein GI | 241113430 |
COG category | [S] Function unknown |
COG ID | [COG4782] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.0102585 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGGAAC AGCGTTTGAA AGAGTCTCCG ATGGGAGCCA AGGCGATGCT GGTCGCGATC GGCCTGGCGA TGCTCGCCGG TTGCGCCACG AGACCATCGC CTGATGTGTT GAACCCGGTC CGTCTGCCGG TTCATCTCCC AGGTCATCTG AATGCATCGC TTCATTCCGA TGAAGCCGCT GCGAACCACG TAAACGTGCT CGCTGCGACA AACAGGAGCC CCGACACTGC GCGCGGTGGG TTCGGGAGCG CGTGGGCTGA CAATCTCACC TACGAGCAAT ATGCGTTTTC GGTCCCTCCC AACCGGAAGG ACACCGCGAT CACATATCCG ACCGCGAGAC CGGATCCCGA ACGGCAATTT GCCGTCATCG GGCGCAAGCA GCTTGCAAAG GCGGCCTTCG TGCAGGCAGC GCTGGGCTCC GTTCAGTCCG ACGGCACTGT CGGCATCTTC GTCCATGGCT ATAACTATAG CTATCAGGAG GCGCTGTTTC GCACCGCGCA GATCGCTGCG GACGCCAATA TTCCGGGCTC TCCGATTCTG TTTTCGTGGC CTTCGGCCGC TGCCGTCGCC GGCTATGTCG CCGACCGCGA TGCGGCGCTG TCCTCGCGTA GCGACCTCGA TTCGCTTATC ACCTCGCTCT CGGCTTCAGG AAAGGTGAAA CGCGTCATCC TTTTCGGACA CAGCATGGGC GGATTCCTGG TCATGGAGAC AGTGCGTGAG CTCAAACTGC AGCATCGCGA CGACGTCATC GGTAAACTGG CGGTGATCCT CGCCGCCCCT GACATCGACG TCGATGTTTT CCGGTCGCAG TTGAAGGATA TCGGGCGGAT GCCGATCCCA ATATCCCTTC TCGTTTCGAA GGACGACAGG GCGCTGGTGG CCTCGAGCTT CATAGCCGGA GAGCGGGCGC GGGTCGGACG CCTCGATATC GACGATCCCG TCATCAGGGA GGCTGCCTTG AAGGAAAGGC TTCGGGTCAT CGACATCACG TCGATCCAGG CGTCCGACGG GATGGGGCAC GACCGCTACG CATCGCTCGC CAAGTTCGGC GCGCAGCTTG CCTCCTTCGA AAGTGGGAAG CGTTCGACCG CCGGCGAGGT TGGCGCCTAT GTCTTCGATG CCGCCGGCGC CGCGGTCGCA AGTCCATTTC GTCTGGCCGG ACGTGTCGTC GGCTCGCAAT GA
|
Protein sequence | MTEQRLKESP MGAKAMLVAI GLAMLAGCAT RPSPDVLNPV RLPVHLPGHL NASLHSDEAA ANHVNVLAAT NRSPDTARGG FGSAWADNLT YEQYAFSVPP NRKDTAITYP TARPDPERQF AVIGRKQLAK AAFVQAALGS VQSDGTVGIF VHGYNYSYQE ALFRTAQIAA DANIPGSPIL FSWPSAAAVA GYVADRDAAL SSRSDLDSLI TSLSASGKVK RVILFGHSMG GFLVMETVRE LKLQHRDDVI GKLAVILAAP DIDVDVFRSQ LKDIGRMPIP ISLLVSKDDR ALVASSFIAG ERARVGRLDI DDPVIREAAL KERLRVIDIT SIQASDGMGH DRYASLAKFG AQLASFESGK RSTAGEVGAY VFDAAGAAVA SPFRLAGRVV GSQ
|
| |