Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_0115 |
Symbol | |
ID | 8011353 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | + |
Start bp | 107377 |
End bp | 108654 |
Gene Length | 1278 bp |
Protein Length | 425 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 644822706 |
Product | protein of unknown function DUF900 hydrolase family protein |
Protein accession | YP_002973965 |
Protein GI | 241202869 |
COG category | [S] Function unknown |
COG ID | [COG4782] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.958101 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 38 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCGGCA AAAGAGGCGG CTTGCCCGCG GCTCTGACAG GTCTTCTGAT TGCAACGATG GCGCTCGCCG GCTGCGGCGG CAGGCCGGTC GGTGTCATGC AGGCGGCCGG CACCGCGGCC CCCGGCACCT CCAAGGTCGA CCTGCTCGTC GCGACGACGC GCGCTGCCGA CGACAATCCC GCCGTGCTTT TCTCCGGCGA ACGCGGCACC GGGCTTGCCG TCAATGCCGT CGACGTCTCC ATTCCGCCGG AAGCCAATCG CAAGGTCGGC CAGGTGCAAT GGCCAAGCCG CCTGCCGGCC GATCCGCTGC GCGATTTCGT CACAGTTTCT GTCGATCCGC TGGAAGGCGA GCGGGCCGGC GAGACGTGGC TGAAGTCCCA TATGCCGAAG AGCCGCCGCG TACTGGTCTT CGTCCACGGC TTCAACAATC GTTATGAGGA TGCCGTCTAC CGCTTCGCGC AGATCGTCCA CGATTCGCAT GCCGACGTTG CGCCCGTCGT CTTCACCTGG CCTTCGCGCG GCAGCATCTT CGATTATAAT TACGACAAGG AAAGCACCAA CTATTCCCGC GACGCGCTGG AGGAATTGTT GACCCGCACC GCCGCCAATC CCGCCGTTAG CGACGTCACC ATCATGGCCC ATTCGATGGG CACCTGGCTC ACCGTCGAAG CGCTGCGGCA GATGGCGATC CGCAACGGTC ATGTCGCCTC GAAGATCAAC AATGTCATCC TCGCTTCGCC GGATCTCGAT GTCGACGTTT TCGGCCGCCA GTTCGCCAGC CTCGGCAAGG AAAGGCCGCA CTTCACCATC TTCGTCTCGC AGGACGATCG CGCTTTGGCG CTGTCGCGGC GCATCTCCGG CAATGTCGAC CGGCTCGGCC AGATCGATCC TTCCGTCGAA CCCTATCGCA GCAAGCTCGA AGCGGCCGGC ATCACCGTGC TCGACCTCAC CAAGCTCAAG GGCGGCGACC GGCTGAACCA CGGCAAATTC GCCGAAAGCC CCGAAGTGGT GAAGCTGATC GGCGACCGGC TGATTGCCGG CCAGACGATC ACCGATTCCA ATGTCGGCCT CGGCGAGGCC GTCGGCGCCG TGGCGATGGG CGCTGCCCAG ACCGCCGGAA GTGCCGTCAG CGTCGCCGTC AGCACGCCGA TTGCGATCTT CGATCCGCGC ACCCGGCGCA ACTACGATGC CCAGCTGAAA CGTCTCGGCC AGTCGATGAA CAATACCGTC GGTTCGGTCG GCGACAGCGT CGGCGCCGGC CTGCCGGCAA GCCAGTAA
|
Protein sequence | MIGKRGGLPA ALTGLLIATM ALAGCGGRPV GVMQAAGTAA PGTSKVDLLV ATTRAADDNP AVLFSGERGT GLAVNAVDVS IPPEANRKVG QVQWPSRLPA DPLRDFVTVS VDPLEGERAG ETWLKSHMPK SRRVLVFVHG FNNRYEDAVY RFAQIVHDSH ADVAPVVFTW PSRGSIFDYN YDKESTNYSR DALEELLTRT AANPAVSDVT IMAHSMGTWL TVEALRQMAI RNGHVASKIN NVILASPDLD VDVFGRQFAS LGKERPHFTI FVSQDDRALA LSRRISGNVD RLGQIDPSVE PYRSKLEAAG ITVLDLTKLK GGDRLNHGKF AESPEVVKLI GDRLIAGQTI TDSNVGLGEA VGAVAMGAAQ TAGSAVSVAV STPIAIFDPR TRRNYDAQLK RLGQSMNNTV GSVGDSVGAG LPASQ
|
| |