Gene Rleg_5144 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_5144 
Symbol 
ID8006971 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012848 
Strand
Start bp545992 
End bp546987 
Gene Length996 bp 
Protein Length331 aa 
Translation table11 
GC content62% 
IMG OID644822057 
Producthypothetical protein 
Protein accessionYP_002973317 
Protein GI241113482 
COG category[C] Energy production and conversion 
COG ID[COG4313] Protein involved in meta-pathway of phenol degradation 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.62916 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATACGCA AGCTGCTTGT CTTCTCAGTC GCCTCCACCT TTCTCGCAGG CGCAGCTGTG 
ACGCTGCTGC CTCCGCTCGC CTTCGCCGCC GAGGGCGGAG CGGGTTTTTA TCTTCTCGGA
TCGAAGGGCC CTGCCGCGGC CATCACGCCG CCGCCGGGCG TTTTCTTCTC GAACGATATC
TACCTCTATA CGGGAGATCT CGGCGGCGGA AAAGTCCTAC CGACCGGCGG ACGTCTGGCC
GTCGGCGTGG AGGGCAAGGC AGTCATCGAG GTGCCTACCG TGCTCTGGAT TCTTCCGGAA
GATGTTGCGG GAGGTCACCT TGGGCTGTCG CTGACCGTCC CTGTCGGCTG GAAGAACACG
GACGCCGATG TGACATTGGC GGGACCGCGA GGTGGGACAG CGTCCGGATC GATCTCCGAT
CCGATCTTCA CCGTTGGCGA TCCTGTGCTT GGCGCCCTGC TGGGCTGGGA GGCCGGCAAC
TTCCATTGGC AGACCGCCCT TCTGGTCAAC GTTCCGATCG GTGACTATCA AGACGGCGAG
ATATCGAACA TAGCCTTCCA CCATTGGGGC GCGGACATTT CGGCCGGCGT GACCTGGCTC
GATCCGGCAA TCGGGCTTGA CCTGTCGGCG GTGGTCGGCA TGACGTTCAA TGCCGAGAAC
CCTGCCACCG ACTACCGCAC GGGCAACGAG TTCCATGTCG AATGGTCAGC CGTCCAGCAC
TTCAACGAGC AGTTCGACGC CGGTCTCGTT GGGTACTATT ACGATCAGGT GAGCGGCGAC
AGCGGCGCCG GCGCGTCCAG CGATTTCAAG GGGCGCGTCG CGGCCATCGG CGCGACCATC
GGCTGGACGT TCAAAGCGGG CGAAGTGCCG ATTTCGACCC GCATCAAGTA TTTCCACGAG
TTCGCCGCCG AAAACCGTGC CGAGGGAGAC GCCGTCTATC TGACGGTCTC GATGCCCTTG
TCGATCACCA AGCCGATGAA CATAGCGGCG CAATAG
 
Protein sequence
MIRKLLVFSV ASTFLAGAAV TLLPPLAFAA EGGAGFYLLG SKGPAAAITP PPGVFFSNDI 
YLYTGDLGGG KVLPTGGRLA VGVEGKAVIE VPTVLWILPE DVAGGHLGLS LTVPVGWKNT
DADVTLAGPR GGTASGSISD PIFTVGDPVL GALLGWEAGN FHWQTALLVN VPIGDYQDGE
ISNIAFHHWG ADISAGVTWL DPAIGLDLSA VVGMTFNAEN PATDYRTGNE FHVEWSAVQH
FNEQFDAGLV GYYYDQVSGD SGAGASSDFK GRVAAIGATI GWTFKAGEVP ISTRIKYFHE
FAAENRAEGD AVYLTVSMPL SITKPMNIAA Q