Gene Rleg_0115 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_0115 
Symbol 
ID8011353 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp107377 
End bp108654 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content65% 
IMG OID644822706 
Productprotein of unknown function DUF900 hydrolase family protein 
Protein accessionYP_002973965 
Protein GI241202869 
COG category[S] Function unknown 
COG ID[COG4782] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.958101 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGGCA AAAGAGGCGG CTTGCCCGCG GCTCTGACAG GTCTTCTGAT TGCAACGATG 
GCGCTCGCCG GCTGCGGCGG CAGGCCGGTC GGTGTCATGC AGGCGGCCGG CACCGCGGCC
CCCGGCACCT CCAAGGTCGA CCTGCTCGTC GCGACGACGC GCGCTGCCGA CGACAATCCC
GCCGTGCTTT TCTCCGGCGA ACGCGGCACC GGGCTTGCCG TCAATGCCGT CGACGTCTCC
ATTCCGCCGG AAGCCAATCG CAAGGTCGGC CAGGTGCAAT GGCCAAGCCG CCTGCCGGCC
GATCCGCTGC GCGATTTCGT CACAGTTTCT GTCGATCCGC TGGAAGGCGA GCGGGCCGGC
GAGACGTGGC TGAAGTCCCA TATGCCGAAG AGCCGCCGCG TACTGGTCTT CGTCCACGGC
TTCAACAATC GTTATGAGGA TGCCGTCTAC CGCTTCGCGC AGATCGTCCA CGATTCGCAT
GCCGACGTTG CGCCCGTCGT CTTCACCTGG CCTTCGCGCG GCAGCATCTT CGATTATAAT
TACGACAAGG AAAGCACCAA CTATTCCCGC GACGCGCTGG AGGAATTGTT GACCCGCACC
GCCGCCAATC CCGCCGTTAG CGACGTCACC ATCATGGCCC ATTCGATGGG CACCTGGCTC
ACCGTCGAAG CGCTGCGGCA GATGGCGATC CGCAACGGTC ATGTCGCCTC GAAGATCAAC
AATGTCATCC TCGCTTCGCC GGATCTCGAT GTCGACGTTT TCGGCCGCCA GTTCGCCAGC
CTCGGCAAGG AAAGGCCGCA CTTCACCATC TTCGTCTCGC AGGACGATCG CGCTTTGGCG
CTGTCGCGGC GCATCTCCGG CAATGTCGAC CGGCTCGGCC AGATCGATCC TTCCGTCGAA
CCCTATCGCA GCAAGCTCGA AGCGGCCGGC ATCACCGTGC TCGACCTCAC CAAGCTCAAG
GGCGGCGACC GGCTGAACCA CGGCAAATTC GCCGAAAGCC CCGAAGTGGT GAAGCTGATC
GGCGACCGGC TGATTGCCGG CCAGACGATC ACCGATTCCA ATGTCGGCCT CGGCGAGGCC
GTCGGCGCCG TGGCGATGGG CGCTGCCCAG ACCGCCGGAA GTGCCGTCAG CGTCGCCGTC
AGCACGCCGA TTGCGATCTT CGATCCGCGC ACCCGGCGCA ACTACGATGC CCAGCTGAAA
CGTCTCGGCC AGTCGATGAA CAATACCGTC GGTTCGGTCG GCGACAGCGT CGGCGCCGGC
CTGCCGGCAA GCCAGTAA
 
Protein sequence
MIGKRGGLPA ALTGLLIATM ALAGCGGRPV GVMQAAGTAA PGTSKVDLLV ATTRAADDNP 
AVLFSGERGT GLAVNAVDVS IPPEANRKVG QVQWPSRLPA DPLRDFVTVS VDPLEGERAG
ETWLKSHMPK SRRVLVFVHG FNNRYEDAVY RFAQIVHDSH ADVAPVVFTW PSRGSIFDYN
YDKESTNYSR DALEELLTRT AANPAVSDVT IMAHSMGTWL TVEALRQMAI RNGHVASKIN
NVILASPDLD VDVFGRQFAS LGKERPHFTI FVSQDDRALA LSRRISGNVD RLGQIDPSVE
PYRSKLEAAG ITVLDLTKLK GGDRLNHGKF AESPEVVKLI GDRLIAGQTI TDSNVGLGEA
VGAVAMGAAQ TAGSAVSVAV STPIAIFDPR TRRNYDAQLK RLGQSMNNTV GSVGDSVGAG
LPASQ