Gene Rleg_4726 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_4726 
Symbol 
ID8007201 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012848 
Strand
Start bp95922 
End bp98111 
Gene Length2190 bp 
Protein Length729 aa 
Translation table11 
GC content64% 
IMG OID644821659 
Productcatalase/peroxidase HPI 
Protein accessionYP_002972919 
Protein GI241113084 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0376] Catalase (peroxidase I) 
TIGRFAM ID[TIGR00198] catalase/peroxidase HPI 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACAACC CCACTGACAG CGCAGGCAAA TGTCCTGTGC CACACGGCAA TACGCCTCGC 
AGCAGCCGAT CCAACCGCGA CTGGTGGCCA GACCAGTTGA ACGTGCAGAT TCTTCACCAC
AATTCCGGCC GCGCCGACCC GCTCGGCCAA GCCTTCGACT ATGCGGAGGA GTTCAAGAAG
CTCGATCTCG ACGGCCTGAA GAAGGATCTT CACGCGTTGA TGACGGATTC GCAGGATTGG
TGGCCGGCCG ATTTCGGTCA CTATGGCGGC CTGTTCATCC GCATGGCCTG GCACAGCGCC
GGCACATACC GCATCACCGA TGGCCGCGGC GGCGCCGGGG CTGGCCAGCA GCGTTTTGCG
CCACTCAACA GCTGGCCGGA CAACGTGAAC CTCGACAAGG CTCGCCGCCT GCTGTGGCCT
ATCAAGCAGA AATACGGCAA CCGCATCTCC TGGGCTGACC TGTTGATCCT CACCGGCAAC
GTCGCGCTCG AATCCATGGG TTTCAAGACC TTCGGTTTCG CCGGCGGCCG CGCCGACGTC
TGGGAGCCGG AAGAACTTTA CTGGGGTCCT GAGGGAACCT GGCTGGGTGA CGAACGCTAC
AGTGGCGAAC GCGAACTGGC AGAGCCGCTT GGCGCCGTGC AGATGGGCCT TATCTATGTC
AACCCGGAAG GCCCGAACGG CACCCCGGAT CCGCTGGCAT CCGCCCGCGA CATCCGCGAA
ACCTTCGCCC GTATGGCGAT GAACGACGAA GAAACCGTGG CGCTGATCGC CGGCGGGCAT
ACCTTCGGCA AGACCCATGG CGCTGGCGAT CCGTCGTTTG TCGGTGTCGA CCCGGAAGGC
GGCGAGCTCG AAGCTCAGGG TCTGGGCTGG ACCAGCAAGT TTAACACCGG CGTCGGTCGC
GATGCCATCG GCAGCGGCCT CGAAGTGACC TGGACCCAGA CGCCGACCCA GTGGAGCAAC
TACTTCTTCG AAAACCTGTT CGCTTTCGAA TGGGAGCTGA CCAAGAGCCC GGGCGGCGCG
CATCAGTGGC AGGCCAAGAA CGCTGAAGCC TCCATTCCGG ATGCCTACGA CGCCTCGAAG
AGGCACCTGC CGACCATGCT GACCAGCGAC CTCGCGCTGC GTTTCGATCC TGTTTACGAG
AAGATTTCGC GCCGCTTCCT CGAAAATCCC GACCAGTTCG CCGACGCTTT CGCCCGCGCC
TGGTTCAAGC TGACCCACCG CGACATGGGA CCGAAGGTGC GCTACCTCGG CCCGGAAGTT
CCGGCCGAAG ACCTGATCTG GCAGGATGTG ATCCCGGCCG TCGACCACCC GCTTGTCGAC
GACAAGGACA TTGCCGACCT GAAGGAAAAG GTTCTCGCCA CCGGCCTCAC GGTGCAGGAA
CTCGTCTCGA CCGCCTGGGC GTCGGCCTCG ACCTTCCGCG GCTCCGACAA GCGTGGCGGC
GCCAACGGCG CGCGCATCCG CCTTGCCCCG CAGAAAGACT GGGACGCCAA CCAGCCAGCC
CAGCTCGCCA AAGTCATCGG CGTTCTCGAA GGCATCCAGA GGGACTTCAA CGCAGTTCAG
ACCGGGGCCA AGAAGATCTC GCTCGCCGAC CTGATCGTTC TCGCCGGTGC AGCCGGCGTC
GAGAAGGCGG CGGCAGCGGG CGGCAACGCT GTCAGCGTGC CCTTCACGCC GGGCCGCACG
GATGCTTCCG AAGCCCAGAC CGACGCGCAT TCCTTCGCAG CGCTCGAGCC GCGCATCGAC
GGCTTCCGCA ACTATGTGAA CGGCAAGCGC CATCAGTTCA TGAAGCCGGA AGAAGCGCTC
GTCGACCGCG CCCAGTTGCT GACGCTGACC GGACCGGAAA TGACCGTCCT CGTCGGCGGC
CTGCGTGTGC TGAAGGCCGG CGCGCCCGAG CATGGCGTCT TCACTTCGCG TCCCGAGACG
TTGACGAACG ACTTCTTCGT CAACCTGCTC GACATGGGCA CGCAGTGGGT TCCCCTCGCC
GGCAAGGAAG GCGTCTATGA AGGCCGCGAC CGCAAGACTG GCGCCGCCAA GTGGACCGGC
ACCCGCGTCG ACCTGATCTT CGGCTCGCAC TCGCAGCTTC GCGCCTTTGC CGAAGTCTAC
GGCCAGGCCG ATACCAAGGA GAAGTTCGTG AGGGACTTCG TCGCGGCCTG GACCAAGGTT
ATGAGCGCCG ATCGTTTCGA TCTCGTCTGA
 
Protein sequence
MDNPTDSAGK CPVPHGNTPR SSRSNRDWWP DQLNVQILHH NSGRADPLGQ AFDYAEEFKK 
LDLDGLKKDL HALMTDSQDW WPADFGHYGG LFIRMAWHSA GTYRITDGRG GAGAGQQRFA
PLNSWPDNVN LDKARRLLWP IKQKYGNRIS WADLLILTGN VALESMGFKT FGFAGGRADV
WEPEELYWGP EGTWLGDERY SGERELAEPL GAVQMGLIYV NPEGPNGTPD PLASARDIRE
TFARMAMNDE ETVALIAGGH TFGKTHGAGD PSFVGVDPEG GELEAQGLGW TSKFNTGVGR
DAIGSGLEVT WTQTPTQWSN YFFENLFAFE WELTKSPGGA HQWQAKNAEA SIPDAYDASK
RHLPTMLTSD LALRFDPVYE KISRRFLENP DQFADAFARA WFKLTHRDMG PKVRYLGPEV
PAEDLIWQDV IPAVDHPLVD DKDIADLKEK VLATGLTVQE LVSTAWASAS TFRGSDKRGG
ANGARIRLAP QKDWDANQPA QLAKVIGVLE GIQRDFNAVQ TGAKKISLAD LIVLAGAAGV
EKAAAAGGNA VSVPFTPGRT DASEAQTDAH SFAALEPRID GFRNYVNGKR HQFMKPEEAL
VDRAQLLTLT GPEMTVLVGG LRVLKAGAPE HGVFTSRPET LTNDFFVNLL DMGTQWVPLA
GKEGVYEGRD RKTGAAKWTG TRVDLIFGSH SQLRAFAEVY GQADTKEKFV RDFVAAWTKV
MSADRFDLV