Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_5449 |
Symbol | |
ID | 8016758 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012853 |
Strand | + |
Start bp | 26614 |
End bp | 27714 |
Gene Length | 1101 bp |
Protein Length | 366 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 644827622 |
Product | peptidase S1 and S6 chymotrypsin/Hap |
Protein accession | YP_002978822 |
Protein GI | 241518194 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.0114113 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 0.0272252 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATTCAC CTGGGATTGC CAGCTGCCGT CTTGCCTTCA CCTTGGCTTT TCTGGCGATT TTCGGAGTGG CGTCTGCACA GGACGACAGC GCCGGAGTGC ACGTCCGTTA CGGAGAAGCA GCGCTCTTCG ATCCCGAAGC TGGCGACGCA ATGGTTGCCG ACGACACGAA CGCGGTGGCT GACCTCAAAG CGGAGTCTCA GGAAATGCCG TCGGCGAGGG TCGCCAATGG CGTCGACGTG GTTCTGTCGA ATTTCTCCGA AGTGGTGAAG ATCGAATTCA GTGACGCTAG GGGTCGCCAC GCATGCACCG GGGTGATGTT ATCACCTGAT GCGGTCCTCA CAGCCGGCCA TTGCGGTTGC GGTCGCGCAT ACGAAGTTAC GATGCAGACC GCCCCTGTTG AAAGGGCCGG CGACACGGCA TTTTCGATCC TGAGGATCGA AGGCGGCCCC TTCCTTTTCC CAGGCTATAG TTGCTCATAT CCCGAAACCA CCGGCGTTGG ACACGACCTG GCTCTGATGC GGATCGTCCC ACCCGCGGCA AAGGAGGGAA ATGTTTTCGA GCTGGATGAT GGCGTAGCGG TAGAGCTGAG CTTTCCAGTC ATTCGATCCG GCGTACAGGT TCTCTCGCAA CAACTGCTGA ATAGCATATT TATTCTGGGG TTCGGGCGAA CCGAAACTGG TGCAGTCGCA AAGAACCTAC AGGGTGCGAA CGTTGGTGTG CTCTCACGCC ACTGTATCGC TGGCCATGTT TTTATGAGCT ACTGCGCGCC CTTCAGGGAG TTTTCGTTGG GACGAAACTC CAATACCCCA GGCATCGCTC CCGATAGTTG TGGCGGAGAC AGCGGAGGTC CGGCTTATCG TATGGACAGC GACCTCATCA TGGACCCGTC CGGCTTGTTC CCGCTGCATT TGAGCAGGCG AACGCTGGTT GGCATCGTCT CCCGCGCAGT TGCAGGAGTG GTTCATCCTT ACCGCGGATA TTGTGGAGGC GGCGGAATCT ACACGACGGT CGGAACGCGG CCAGTTCTCG ACTGGCTGCG CTCTCAGAAG GTCTCGTTCC TCTACGATCC CAACCCCACG TATCGTGCTG CAGGAGGTTG A
|
Protein sequence | MNSPGIASCR LAFTLAFLAI FGVASAQDDS AGVHVRYGEA ALFDPEAGDA MVADDTNAVA DLKAESQEMP SARVANGVDV VLSNFSEVVK IEFSDARGRH ACTGVMLSPD AVLTAGHCGC GRAYEVTMQT APVERAGDTA FSILRIEGGP FLFPGYSCSY PETTGVGHDL ALMRIVPPAA KEGNVFELDD GVAVELSFPV IRSGVQVLSQ QLLNSIFILG FGRTETGAVA KNLQGANVGV LSRHCIAGHV FMSYCAPFRE FSLGRNSNTP GIAPDSCGGD SGGPAYRMDS DLIMDPSGLF PLHLSRRTLV GIVSRAVAGV VHPYRGYCGG GGIYTTVGTR PVLDWLRSQK VSFLYDPNPT YRAAGG
|
| |