Gene Rleg_1561 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_1561 
Symbol 
ID8012639 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp1540852 
End bp1541820 
Gene Length969 bp 
Protein Length322 aa 
Translation table11 
GC content60% 
IMG OID644824147 
Productalpha/beta hydrolase fold protein 
Protein accessionYP_002975389 
Protein GI241204293 
COG category[R] General function prediction only 
COG ID[COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily)  
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.757336 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.0190305 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATCCG GGTCAACTTT CCCACTCACA AGACGAGACG CATTGCTCGC AGGATCGGCC 
GCCGCCGTGG CGCTGGCAGT CGGTCCGTCG GCATCAGCCG CCACCAAACA GCAACAATCA
ACAATGGAGA ACAAGTTCAT GGGTACTGTG AAGGTGAAAG ACGGAACCGA AATCTTCTAC
AAGGACTGGG GTCCGAAGGA CGGGCAGCCG ATCGTATTCC ACCACGGCTG GCCGCTCAGC
GGCGACGACT GGGACAACCA GATGCTCTTC TTTCTCGGAG AAGGTTTCCG GGTCATCGCA
CATGATCGCC GAGGGCACGG CCGCTCGTCG CAGAGCCGGA CGGGTCACGA GATGGACACC
TACGCGGCTG ACGTTGCAGA GGTGGCGGAA GCGCTCGATC TCAAGAATGC GGTCCACATC
GGCCACTCGA CAGGAGGCGG CGAAGTAGTC CATTACGTCG CGCGGTCAAA ACCGGGCAGG
GTTGCCAAGG CGGTCATCGC CGGCGCAATT CCGCCGGTCA TGTTGAAGTC TGACAAGAAC
CCCGGCGGCC TGCCGATCGA CGTGTTCGAC GGCTTGCGCA AGGCGCTGGC CGCCAACCGC
GCACAGTTCT ACATCGACGT TCCGACCGGC CCCTTCTACG GCTTCAATCG TCCGGACGCC
AAGATCTCCC AGGGCCTGAT CGACAACTGG TGGCGGCAGG GCATGATGGG CGCTGCAAAT
GCCCACTACG AGTGCATCAA GGCGTTCTCG GAGACGGACT TTACCGAAGA TCTGAAGAAG
ATCGAGGTTC CGGTCTACGT CATTCACGGA ACCGACGATC AGATCGTGCC CTACAAGGAC
GCGGCGGAGC TTTCAGTCAA ACTCCTGAAG CACGGCACCT TGAAGCTTTA CGATGGATAT
CCGCATGGGA TGCTGTCGAC GCACCCCGAA GTGCTGAATT CGGACATCTT GGCGTTCATC
AAAGCCTGA
 
Protein sequence
MKSGSTFPLT RRDALLAGSA AAVALAVGPS ASAATKQQQS TMENKFMGTV KVKDGTEIFY 
KDWGPKDGQP IVFHHGWPLS GDDWDNQMLF FLGEGFRVIA HDRRGHGRSS QSRTGHEMDT
YAADVAEVAE ALDLKNAVHI GHSTGGGEVV HYVARSKPGR VAKAVIAGAI PPVMLKSDKN
PGGLPIDVFD GLRKALAANR AQFYIDVPTG PFYGFNRPDA KISQGLIDNW WRQGMMGAAN
AHYECIKAFS ETDFTEDLKK IEVPVYVIHG TDDQIVPYKD AAELSVKLLK HGTLKLYDGY
PHGMLSTHPE VLNSDILAFI KA