Gene Rleg_5449 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_5449 
Symbol 
ID8016758 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012853 
Strand
Start bp26614 
End bp27714 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content58% 
IMG OID644827622 
Productpeptidase S1 and S6 chymotrypsin/Hap 
Protein accessionYP_002978822 
Protein GI241518194 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.0114113 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.0272252 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATTCAC CTGGGATTGC CAGCTGCCGT CTTGCCTTCA CCTTGGCTTT TCTGGCGATT 
TTCGGAGTGG CGTCTGCACA GGACGACAGC GCCGGAGTGC ACGTCCGTTA CGGAGAAGCA
GCGCTCTTCG ATCCCGAAGC TGGCGACGCA ATGGTTGCCG ACGACACGAA CGCGGTGGCT
GACCTCAAAG CGGAGTCTCA GGAAATGCCG TCGGCGAGGG TCGCCAATGG CGTCGACGTG
GTTCTGTCGA ATTTCTCCGA AGTGGTGAAG ATCGAATTCA GTGACGCTAG GGGTCGCCAC
GCATGCACCG GGGTGATGTT ATCACCTGAT GCGGTCCTCA CAGCCGGCCA TTGCGGTTGC
GGTCGCGCAT ACGAAGTTAC GATGCAGACC GCCCCTGTTG AAAGGGCCGG CGACACGGCA
TTTTCGATCC TGAGGATCGA AGGCGGCCCC TTCCTTTTCC CAGGCTATAG TTGCTCATAT
CCCGAAACCA CCGGCGTTGG ACACGACCTG GCTCTGATGC GGATCGTCCC ACCCGCGGCA
AAGGAGGGAA ATGTTTTCGA GCTGGATGAT GGCGTAGCGG TAGAGCTGAG CTTTCCAGTC
ATTCGATCCG GCGTACAGGT TCTCTCGCAA CAACTGCTGA ATAGCATATT TATTCTGGGG
TTCGGGCGAA CCGAAACTGG TGCAGTCGCA AAGAACCTAC AGGGTGCGAA CGTTGGTGTG
CTCTCACGCC ACTGTATCGC TGGCCATGTT TTTATGAGCT ACTGCGCGCC CTTCAGGGAG
TTTTCGTTGG GACGAAACTC CAATACCCCA GGCATCGCTC CCGATAGTTG TGGCGGAGAC
AGCGGAGGTC CGGCTTATCG TATGGACAGC GACCTCATCA TGGACCCGTC CGGCTTGTTC
CCGCTGCATT TGAGCAGGCG AACGCTGGTT GGCATCGTCT CCCGCGCAGT TGCAGGAGTG
GTTCATCCTT ACCGCGGATA TTGTGGAGGC GGCGGAATCT ACACGACGGT CGGAACGCGG
CCAGTTCTCG ACTGGCTGCG CTCTCAGAAG GTCTCGTTCC TCTACGATCC CAACCCCACG
TATCGTGCTG CAGGAGGTTG A
 
Protein sequence
MNSPGIASCR LAFTLAFLAI FGVASAQDDS AGVHVRYGEA ALFDPEAGDA MVADDTNAVA 
DLKAESQEMP SARVANGVDV VLSNFSEVVK IEFSDARGRH ACTGVMLSPD AVLTAGHCGC
GRAYEVTMQT APVERAGDTA FSILRIEGGP FLFPGYSCSY PETTGVGHDL ALMRIVPPAA
KEGNVFELDD GVAVELSFPV IRSGVQVLSQ QLLNSIFILG FGRTETGAVA KNLQGANVGV
LSRHCIAGHV FMSYCAPFRE FSLGRNSNTP GIAPDSCGGD SGGPAYRMDS DLIMDPSGLF
PLHLSRRTLV GIVSRAVAGV VHPYRGYCGG GGIYTTVGTR PVLDWLRSQK VSFLYDPNPT
YRAAGG