Gene Rleg_5143 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_5143 
Symbol 
ID8007003 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012848 
Strand
Start bp544747 
End bp545982 
Gene Length1236 bp 
Protein Length411 aa 
Translation table11 
GC content64% 
IMG OID644822056 
Productprotein of unknown function DUF900 hydrolase family protein 
Protein accessionYP_002973316 
Protein GI241113481 
COG category[S] Function unknown 
COG ID[COG4782] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCCGGG CGACATCGCT TCGACTGATC GCGCTGCTGG TTCTCCTGGC GCCGCTTTGC 
GCCTGCGGAC ATCCACGCGG CGTCATGCAG CCTGTGGCGC TGACTGCGGC CACGCCCGGA
ACCGCGCAGG TCGACATGCT CGTCGCAACG ACCCGACAGC CGTCGGGCGA TCCGGCGACG
TTGTTCAACG GCGAGCGCAG CCCAAAGCCC TCCATGACCG ACGTTGCGGT TTCCATTCCG
CCGAAGCGCG AGGCGGGCAC CGTCCAGTGG CCGCAGCGAC TGCCGCCCAA TCCTGCCACG
GACTTTGCCG TGACACGGGT GAAGCAGATC GACACCATTC CGGAAGGCCG GGCATGGTTC
CGCCAGCATA TTCAGGGCGG GCATGCGTTG GTTTTCATCC ACGGGTTTAA CAACACATAC
GAGGACTCGG TCTTCCGCCT CGCCCAGATC GTCCACGACA GCGGCATGCA GGCGACGCCG
ATCCTCTTCA CCTGGCCGTC ACGCGCGCAG CTCACCGGAT ACGAATACGA CAAGGAAAGC
ACGAACTATT CGCGCACGGC GCTGGAGCAG GCGCTGCGGG TCCTCGCCGC CGATCCTGAT
GTGAAGGACA TCACCATCCT CGCGCATTCC ATGGGAACGT GGCTGGCGAT GGAATCGCTG
CGGCAGATGG GCATCCGCGA CGGTCACGTC AACGCCAAGA TACACAACGT CATCCTCGCC
TCGCCCGACA TCGACATCCA GGTGTTCGCC AAGCAGTTCG TCGAGATGGG AGACCCGAAA
CCGAAGTTCA CCATCTTCGT GTCCCAGGAC GACCGGGCGC TCGCGGCATC GAGCTTCATC
ACCGGCAACG TGTCGCGGCT CGGTGCCATA GACCCGTCGA AGGAGCCCTA TCGATCCAGG
CTGGAAAAGG CGGGCATCAC CGCGATCGAC CTCACGAAGG TGAAGGCCGG CGACAGCCTC
CATCATGGCA AGTTCGCCGA AAGTCCCGAC ATCGTCCAGC TCATCGGCCA GCGTCTGATG
ACCGGGCAAA CGCTGACGGA TTCCAACATT TCTCTCGGAC AGGGCGTCGC CGCCGTCGTG
GGCGGGACAG CGCGCACCGT CGGCACAGTC GCAGGCGCTG CAGTTGCAGC ACCACTTGTG
ATCATCGAGC AGCCGGCAAG AAAGCGGCAG CCGACAGGAA CGGAGCTGGA AGACGGCCTG
CACAACGACC GCCAGTCGAA GCCCCTGACG CAATAG
 
Protein sequence
MPRATSLRLI ALLVLLAPLC ACGHPRGVMQ PVALTAATPG TAQVDMLVAT TRQPSGDPAT 
LFNGERSPKP SMTDVAVSIP PKREAGTVQW PQRLPPNPAT DFAVTRVKQI DTIPEGRAWF
RQHIQGGHAL VFIHGFNNTY EDSVFRLAQI VHDSGMQATP ILFTWPSRAQ LTGYEYDKES
TNYSRTALEQ ALRVLAADPD VKDITILAHS MGTWLAMESL RQMGIRDGHV NAKIHNVILA
SPDIDIQVFA KQFVEMGDPK PKFTIFVSQD DRALAASSFI TGNVSRLGAI DPSKEPYRSR
LEKAGITAID LTKVKAGDSL HHGKFAESPD IVQLIGQRLM TGQTLTDSNI SLGQGVAAVV
GGTARTVGTV AGAAVAAPLV IIEQPARKRQ PTGTELEDGL HNDRQSKPLT Q