Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_4726 |
Symbol | |
ID | 8007201 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012848 |
Strand | + |
Start bp | 95922 |
End bp | 98111 |
Gene Length | 2190 bp |
Protein Length | 729 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 644821659 |
Product | catalase/peroxidase HPI |
Protein accession | YP_002972919 |
Protein GI | 241113084 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0376] Catalase (peroxidase I) |
TIGRFAM ID | [TIGR00198] catalase/peroxidase HPI |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACAACC CCACTGACAG CGCAGGCAAA TGTCCTGTGC CACACGGCAA TACGCCTCGC AGCAGCCGAT CCAACCGCGA CTGGTGGCCA GACCAGTTGA ACGTGCAGAT TCTTCACCAC AATTCCGGCC GCGCCGACCC GCTCGGCCAA GCCTTCGACT ATGCGGAGGA GTTCAAGAAG CTCGATCTCG ACGGCCTGAA GAAGGATCTT CACGCGTTGA TGACGGATTC GCAGGATTGG TGGCCGGCCG ATTTCGGTCA CTATGGCGGC CTGTTCATCC GCATGGCCTG GCACAGCGCC GGCACATACC GCATCACCGA TGGCCGCGGC GGCGCCGGGG CTGGCCAGCA GCGTTTTGCG CCACTCAACA GCTGGCCGGA CAACGTGAAC CTCGACAAGG CTCGCCGCCT GCTGTGGCCT ATCAAGCAGA AATACGGCAA CCGCATCTCC TGGGCTGACC TGTTGATCCT CACCGGCAAC GTCGCGCTCG AATCCATGGG TTTCAAGACC TTCGGTTTCG CCGGCGGCCG CGCCGACGTC TGGGAGCCGG AAGAACTTTA CTGGGGTCCT GAGGGAACCT GGCTGGGTGA CGAACGCTAC AGTGGCGAAC GCGAACTGGC AGAGCCGCTT GGCGCCGTGC AGATGGGCCT TATCTATGTC AACCCGGAAG GCCCGAACGG CACCCCGGAT CCGCTGGCAT CCGCCCGCGA CATCCGCGAA ACCTTCGCCC GTATGGCGAT GAACGACGAA GAAACCGTGG CGCTGATCGC CGGCGGGCAT ACCTTCGGCA AGACCCATGG CGCTGGCGAT CCGTCGTTTG TCGGTGTCGA CCCGGAAGGC GGCGAGCTCG AAGCTCAGGG TCTGGGCTGG ACCAGCAAGT TTAACACCGG CGTCGGTCGC GATGCCATCG GCAGCGGCCT CGAAGTGACC TGGACCCAGA CGCCGACCCA GTGGAGCAAC TACTTCTTCG AAAACCTGTT CGCTTTCGAA TGGGAGCTGA CCAAGAGCCC GGGCGGCGCG CATCAGTGGC AGGCCAAGAA CGCTGAAGCC TCCATTCCGG ATGCCTACGA CGCCTCGAAG AGGCACCTGC CGACCATGCT GACCAGCGAC CTCGCGCTGC GTTTCGATCC TGTTTACGAG AAGATTTCGC GCCGCTTCCT CGAAAATCCC GACCAGTTCG CCGACGCTTT CGCCCGCGCC TGGTTCAAGC TGACCCACCG CGACATGGGA CCGAAGGTGC GCTACCTCGG CCCGGAAGTT CCGGCCGAAG ACCTGATCTG GCAGGATGTG ATCCCGGCCG TCGACCACCC GCTTGTCGAC GACAAGGACA TTGCCGACCT GAAGGAAAAG GTTCTCGCCA CCGGCCTCAC GGTGCAGGAA CTCGTCTCGA CCGCCTGGGC GTCGGCCTCG ACCTTCCGCG GCTCCGACAA GCGTGGCGGC GCCAACGGCG CGCGCATCCG CCTTGCCCCG CAGAAAGACT GGGACGCCAA CCAGCCAGCC CAGCTCGCCA AAGTCATCGG CGTTCTCGAA GGCATCCAGA GGGACTTCAA CGCAGTTCAG ACCGGGGCCA AGAAGATCTC GCTCGCCGAC CTGATCGTTC TCGCCGGTGC AGCCGGCGTC GAGAAGGCGG CGGCAGCGGG CGGCAACGCT GTCAGCGTGC CCTTCACGCC GGGCCGCACG GATGCTTCCG AAGCCCAGAC CGACGCGCAT TCCTTCGCAG CGCTCGAGCC GCGCATCGAC GGCTTCCGCA ACTATGTGAA CGGCAAGCGC CATCAGTTCA TGAAGCCGGA AGAAGCGCTC GTCGACCGCG CCCAGTTGCT GACGCTGACC GGACCGGAAA TGACCGTCCT CGTCGGCGGC CTGCGTGTGC TGAAGGCCGG CGCGCCCGAG CATGGCGTCT TCACTTCGCG TCCCGAGACG TTGACGAACG ACTTCTTCGT CAACCTGCTC GACATGGGCA CGCAGTGGGT TCCCCTCGCC GGCAAGGAAG GCGTCTATGA AGGCCGCGAC CGCAAGACTG GCGCCGCCAA GTGGACCGGC ACCCGCGTCG ACCTGATCTT CGGCTCGCAC TCGCAGCTTC GCGCCTTTGC CGAAGTCTAC GGCCAGGCCG ATACCAAGGA GAAGTTCGTG AGGGACTTCG TCGCGGCCTG GACCAAGGTT ATGAGCGCCG ATCGTTTCGA TCTCGTCTGA
|
Protein sequence | MDNPTDSAGK CPVPHGNTPR SSRSNRDWWP DQLNVQILHH NSGRADPLGQ AFDYAEEFKK LDLDGLKKDL HALMTDSQDW WPADFGHYGG LFIRMAWHSA GTYRITDGRG GAGAGQQRFA PLNSWPDNVN LDKARRLLWP IKQKYGNRIS WADLLILTGN VALESMGFKT FGFAGGRADV WEPEELYWGP EGTWLGDERY SGERELAEPL GAVQMGLIYV NPEGPNGTPD PLASARDIRE TFARMAMNDE ETVALIAGGH TFGKTHGAGD PSFVGVDPEG GELEAQGLGW TSKFNTGVGR DAIGSGLEVT WTQTPTQWSN YFFENLFAFE WELTKSPGGA HQWQAKNAEA SIPDAYDASK RHLPTMLTSD LALRFDPVYE KISRRFLENP DQFADAFARA WFKLTHRDMG PKVRYLGPEV PAEDLIWQDV IPAVDHPLVD DKDIADLKEK VLATGLTVQE LVSTAWASAS TFRGSDKRGG ANGARIRLAP QKDWDANQPA QLAKVIGVLE GIQRDFNAVQ TGAKKISLAD LIVLAGAAGV EKAAAAGGNA VSVPFTPGRT DASEAQTDAH SFAALEPRID GFRNYVNGKR HQFMKPEEAL VDRAQLLTLT GPEMTVLVGG LRVLKAGAPE HGVFTSRPET LTNDFFVNLL DMGTQWVPLA GKEGVYEGRD RKTGAAKWTG TRVDLIFGSH SQLRAFAEVY GQADTKEKFV RDFVAAWTKV MSADRFDLV
|
| |