Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_5370 |
Symbol | |
ID | 6978464 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011368 |
Strand | + |
Start bp | 1002236 |
End bp | 1003378 |
Gene Length | 1143 bp |
Protein Length | 380 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 643394472 |
Product | protein of unknown function DUF900 hydrolase family protein |
Protein accession | YP_002279290 |
Protein GI | 209547372 |
COG category | [S] Function unknown |
COG ID | [COG4782] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.494551 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAGCAAC GACGTTTGAA CGGGCTTGCG AAGATAGCTC CGGCCGTGCT GGTCGCGCTC GGTCTGGTGA CACTCACGGG CTGCGTCTCG AGGCCGTCGC CCGATGTCTT GAAGCCGGTC CATCTGCAGG CTCGTGTCGA TGAGGCCGAA GTGAGCGTGC TGACTGCGAC GAACAGGACC GCCGATACCG TGCGGGGCGG GTTCGGGAGC GCATGGGCCG ACAACCTGAC GTATGAGCAA TATGCATTTT CTGTCCCTCC GGACAGAAAG GGTGTCACGA TCGCATACCC GACCTCGACA CTGGATCCCG AACGGCAATT TGCCGTCATC GACCGCAAGC AGCTCGCAAA GGGCGCTTTC GTTGAGGAGG CGTTGCGGTC GGTGCAACCG GACGGCACTA TTGGTATCTT CGTGCATGGT TACAACTACA GCTACCAGGA GGCGCTTTAT CGCACGGCGC AGATCGCCGC AGATGCGAAA ATGCCCGGCG CCCCGATCCT GTTTTCCTGG CCTTCGGCTG CAGCCGTTGC CGGCTATGTC GCAGACCGGG ATGCCGCGCT CGCCTCGCGC AGCGCGCTCG ATTCCCTCGT CACCTCGCTT TCGGCCTCCG GCAAGGTGAA ACGCATCGTC CTGTTCGGAC ACAGCATGGG CGGGTTCCTG GTCATGGAAA CGGTCCGCCA ACTCAAGCTG CAGCATCGCG ACGATGTCGT CGGCAAACTG GCGGTGATCC TCGCCGCCCC CGACATTGAC GTCGATGTCT TCCGGTCTCA GCTGAAGGAT ATCGGGCGGA TGCCTATCCC CATCTCTCTC CTGGTCTCGA AGGACGACCG GGCGCTTGTG GCCTCGAGCT TCATCGCAGG GGAGCGGCCG CGCGTCGGGC GTCTTGATAT CAACGATCCC GTCATCGAGG AGGCGGCCGT GAAGGAAAGG CTCCGGGTCA TCGACATCAC CTCGATCCAG GCCTCCGACG GGCTCGGGCA TGATCGTTAT GCATCACTCG CCAAGTTCGG TGCGCAGCTT GCCTCCTTCG AAACCGGCAG GCGTTCGACC GCCGGCGACG TCGGCGCCTA TGTCTTCGAT GCGGCCGGCG CTGCGGTCGC AAGCCCGTTT CGGCTGGCTG GACGGGTCGT CGGCGCGCAG TGA
|
Protein sequence | MEQRRLNGLA KIAPAVLVAL GLVTLTGCVS RPSPDVLKPV HLQARVDEAE VSVLTATNRT ADTVRGGFGS AWADNLTYEQ YAFSVPPDRK GVTIAYPTST LDPERQFAVI DRKQLAKGAF VEEALRSVQP DGTIGIFVHG YNYSYQEALY RTAQIAADAK MPGAPILFSW PSAAAVAGYV ADRDAALASR SALDSLVTSL SASGKVKRIV LFGHSMGGFL VMETVRQLKL QHRDDVVGKL AVILAAPDID VDVFRSQLKD IGRMPIPISL LVSKDDRALV ASSFIAGERP RVGRLDINDP VIEEAAVKER LRVIDITSIQ ASDGLGHDRY ASLAKFGAQL ASFETGRRST AGDVGAYVFD AAGAAVASPF RLAGRVVGAQ
|
| |