Gene Rleg2_1044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_1044 
Symbol 
ID6979763 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp1062629 
End bp1064029 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content67% 
IMG OID643395756 
ProductDNA repair protein RadA 
Protein accessionYP_002280564 
Protein GI209548647 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1066] Predicted ATP-dependent serine protease 
TIGRFAM ID[TIGR00416] DNA repair protein RadA 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGAAGG CCAGGACACA ATTCATCTGC CAGAATTGCG GCACGGTTCA TAACCGCTGG 
GCGGGCAAAT GCGAGAACTG CGGCGAGTGG AACACCATCG TCGAAGAAGA TCCGATGGGC
GGGATCGGTT CCGGTCCCGG CAAGACGCCG AAGAAGGGTC GGCCGGTGGC GCTGACGGCG
CTGTCGGGCG AGATCGAGGA GGCGCCGCGC ATCCATACCG CCATGTCGGA GCTTGACCGG
GCGCTCGGCG GCGGCTTCGT GCGTGGGTCG GCGGTGCTGA TCGGCGGCGA TCCCGGTATC
GGCAAATCGA CGCTGCTGAT GCAAGCGGCG GCCGCCCTTG CGCGGCGCGG CCACAAGATC
ATCTACGTCT CCGGCGAAGA AGCCGTCGCC CAGGTCCGGC TGCGGGCGCA GCGCCTTGCG
GCGGCCGATA CCGACGTGAT GCTGGCGGCG GAAACCAATG TCGAGGATAT TCTGGCGACG
CTTGCCGAGG GCAAGCGGCC GGACCTCGTC ATCATCGATT CCATCCAGAC GCTGTGGAGC
GAACTTGCCG AATCCGCCCC GGGAACGGTG ACGCAGGTGC GCACCGGCGT GCAGGCGATG
ATCCGTTTCG CCAAGCAGAC GGGGGCCGCC ATGGTGCTCG TCGGGCATGT GACCAAGGAC
GGGCAGATCG CCGGCCCGCG CGTCGTCGAG CACATGGTCG ATGCCGTGCT CTATTTCGAA
GGCGATCGCG GCCATCACTA CCGCATCCTG CGCACGGTCA AGAACCGCTT CGGCCCGACC
GACGAGATCG GCGTCTTCGA AATGTCGGAC AAGGGACTCC GCGAGGTCGC CAACCCCTCG
GAGCTCTTCC TCGGCGAGCG CAACGAAAAA TCGCCGGGTG CAGCCGTCTT CGCCGGCATG
GAGGGCACGC GCCCGGTGCT GGTCGAAGTC CAGGCGCTGG TGGCGCCGAC CTCGCTCGGC
ACGCCCAGGC GCGCCGTGGT CGGCTGGGAT TCGGCCCGGC TGTCGATGAT CCTGGCGGTG
CTGGAGGCCC ATTGCGGCGT CAGGCTCGGC CAGCACGACG TCTATCTCAA TATCGCCGGC
GGCTACCGCA TCACCGAACC GGCCGCCGAT CTCGCCGTCG CCTCGGCGCT CGTTTCCTCG
CTTGCCGGTA TTGCCCTTCC CGCCGATTGC GTCTATTTCG GCGAAGTCAG CCTGTCGGGC
GCCATCCGGC CGGTTGCGCA CACCGCCCAG CGCCTCAAGG AAGCCGAGAA GCTGGGCTTT
TCCGCGGCAT TGCTTCCGTC CGCCTCTGCC GAACTGCCGA AGGGTTCCGG CGGGCGCTGG
AGCGAGGTCG AAAGCCTGCC GGATCTGGTT GCGCGCATCG CCGGGTCGAA GGGGGCGCTG
CGTGTGGAAG ACGAGGTTTG A
 
Protein sequence
MAKARTQFIC QNCGTVHNRW AGKCENCGEW NTIVEEDPMG GIGSGPGKTP KKGRPVALTA 
LSGEIEEAPR IHTAMSELDR ALGGGFVRGS AVLIGGDPGI GKSTLLMQAA AALARRGHKI
IYVSGEEAVA QVRLRAQRLA AADTDVMLAA ETNVEDILAT LAEGKRPDLV IIDSIQTLWS
ELAESAPGTV TQVRTGVQAM IRFAKQTGAA MVLVGHVTKD GQIAGPRVVE HMVDAVLYFE
GDRGHHYRIL RTVKNRFGPT DEIGVFEMSD KGLREVANPS ELFLGERNEK SPGAAVFAGM
EGTRPVLVEV QALVAPTSLG TPRRAVVGWD SARLSMILAV LEAHCGVRLG QHDVYLNIAG
GYRITEPAAD LAVASALVSS LAGIALPADC VYFGEVSLSG AIRPVAHTAQ RLKEAEKLGF
SAALLPSASA ELPKGSGGRW SEVESLPDLV ARIAGSKGAL RVEDEV