Gene Rleg_3163 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_3163 
Symbol 
ID8015769 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp3162534 
End bp3163649 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content63% 
IMG OID644825729 
ProductHEAT domain containing protein 
Protein accessionYP_002976957 
Protein GI241205861 
COG category[L] Replication, recombination and repair 
COG ID[COG4335] DNA alkylation repair enzyme 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.729871 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.393425 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGGAAC CGCTCAAGAA CCTGCTGCAT GAGGCATTGG TCGGCGATAT GGCCGATCGC 
ATTGCCGGCA ACGCGCCAAC TTTCGATAAA AAACGCTTCG TGATGCTGGC GACCGATGGC
CTCGGAGCGC TGGAGCTGAT GGAGCGCTCG GCGCTTATCC GCGACGCGCT GTTTGCCACG
CTTCCCGGTG ATTTCCGGGA GGCCGCAGCC ATTTTCAAGG CCAGCCTGCC CACTGCCGGG
AGTCCGGGGC TCTCCGGCTG GATGCTGCTG CCGATCAATC AGTTCATCGC CGCACGCGGC
CTTGATCATT TCGATCTCGG GCTCGAGCTC CTGAAGGCGC TGACGCCGCA TTTCACCGCC
GAATTCGGTA TCCGCCCCTT CATCCACCGC GACCAGCAGC GCGCGCTGGC CATCATTTCC
GGCTGGGTCG ACGATCCCGA CCAGCATGTA CGCCGGCTGG CGAGCGAGGG AACGCGGCCG
CGCCTGCCCT GGGCGATGCG CCTGCCGCAG CTCGTCAAGG ATCCGGCTCC GATCCTGCCC
ATCCTGACCG CGCTGATGGA TGATCCGGAG GATTATGTGC GCCGCTCCGT GGCCAACAGC
CTGAACGACG TCGCCAAGGA CCACCCGGAT CTGGTCGCCG CGTTCATCGC CAGTCATATC
GAAGGCGCTT CGCCCGAACG CCGCTGGCTG CTGAAACATG CCTCACGCAC GCTGATGAAG
AACGGCCACG CGCAGGCGCT CGCCAATTTC GGCTTCGCGG CCAGCGATTC GCTCGAATGC
GAACTGCGGC TCGTAAACGG CGAGGTGATG TTCGGCGAAG GGCTGGATTT CGAAATCCGG
GTGACGAATG CAGGCGAGCG AGCGCAGTCG CTGATGATCG ACTACGCCGT TCACCATGTG
AAGAGCGACG GTTCGCTCTC ACCCAAGGTG TTCAAGTGCA AGGCGATCTT GCTCGCTCCG
GGGCAAAGCC ATACAATTGA GCGTCGTCAC GCCATGCGGC CAATCACGAC GCGGCGCTAT
TATCCAGGCG AACATCGCAT CGCCATCCTC GTCAACGGCG CAGAGACAGC ATCGCAAAGC
TTCGTCCTCG TCATGCCCTC ACCTGACCAA GGCTGA
 
Protein sequence
MPEPLKNLLH EALVGDMADR IAGNAPTFDK KRFVMLATDG LGALELMERS ALIRDALFAT 
LPGDFREAAA IFKASLPTAG SPGLSGWMLL PINQFIAARG LDHFDLGLEL LKALTPHFTA
EFGIRPFIHR DQQRALAIIS GWVDDPDQHV RRLASEGTRP RLPWAMRLPQ LVKDPAPILP
ILTALMDDPE DYVRRSVANS LNDVAKDHPD LVAAFIASHI EGASPERRWL LKHASRTLMK
NGHAQALANF GFAASDSLEC ELRLVNGEVM FGEGLDFEIR VTNAGERAQS LMIDYAVHHV
KSDGSLSPKV FKCKAILLAP GQSHTIERRH AMRPITTRRY YPGEHRIAIL VNGAETASQS
FVLVMPSPDQ G