Gene Rleg_4991 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_4991 
Symbol 
ID8007582 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012848 
Strand
Start bp374808 
End bp376004 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content64% 
IMG OID644821906 
Producthypothetical protein 
Protein accessionYP_002973166 
Protein GI241113331 
COG category[S] Function unknown 
COG ID[COG5441] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.148886 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCAGA TCTACGTGGT CGGCACGGCC GACACGAAGG GCGAGGAGCT TGCCTATCTT 
GCCGCTTGCA TCGAAGCGGC GGGCGGCGGT GTTGTCCGCG TTGACGTCGG CATAGGCGAG
CCTGCGACCG CCGTCGATGT GAAGGCCGAC GCAGTGGCGG CGTGCCATCC GGACGGGGCT
GGAGCCGTTC TTGCCAGCGG GGACCGCGGA AGTGCGGTCG CGGCGATGGG CATTGCTTTC
GCGCGCTTCC TTGTGGAGCG CCAGGACATC GCCGGCGTCA TCGGTATAGG TGGAGGCGGA
GGCACTTCGA TCATTACCGC AGGCATGCGC CAATTGCCGC TCGGTCTGCC AAAGATCATG
GTATCGACGC TCGCATCCGG CGATGTGGCT CCCTTTGTCG ATGTTTCAGA CATCGTGATG
ATGCCTTCGG TCACGGACAT GGCGGGCCTG AACCGGCTGA GCCGCGTCAT CCTTCACAAC
GCCGCTCAGG CGATCACCGC CATGACCCAC CGCCCGGCTG AGGTGACTGC ATCCAAGCCG
GCCCTCGGGC TTACCATGTT CGGCGTTACC ACACCTGCCG TATCGGCCAT GGTCGAGCGC
CTCCGAGCAG ATTATGATTG CCTGGTCTTC CACGCCACAG GCACGGGCGG GCGGGCGATG
GAGAAGCTTG CCGACAGCGA GCTCATCTCT GGCGTGCTCG ACATCACGAC GACCGAGGTC
TGCGACCTGC TTTTCGGCGG CGTCCTGCCG GCCACCTCGG ACCGTTTCGG CGCCATTGCT
CGCAAAGGCT TGCCCTATAT CGGTTCGGTT GGTGCGCTCG ACATGGTGAA CTTCTGGGCG
CCGGAGACCG TTCCGGAGCG TTATTCCGGT CGGCTGTTTT ACCAGCACAA CCCGAACGTC
ACCTTGATGC GCACGACGCT GGCCGAATGC GCGCAGATTG GTCGCTGGAT CGGCGACAAG
CTCAATCTCT GCCACGGCCC CCTACGCTTC CTCATTCCCG AAAAGGGTGT TTCGGCCCTC
GACATCGAAG GCGGTGCGTT CTTCGATCCG CAAGCCGACG CCGCGCTTTT CGCCGCGCTC
GAGGCGACGG TGAAGCCGAC GGCGTCGCGA CGTATTATTC GCCTGCCGCT CCATATCAAC
GACCCAGATT TCGCCGAGGC CGCCGTCGCG GCCTATCGTG ACATCGCCAA CCCCTGA
 
Protein sequence
MKQIYVVGTA DTKGEELAYL AACIEAAGGG VVRVDVGIGE PATAVDVKAD AVAACHPDGA 
GAVLASGDRG SAVAAMGIAF ARFLVERQDI AGVIGIGGGG GTSIITAGMR QLPLGLPKIM
VSTLASGDVA PFVDVSDIVM MPSVTDMAGL NRLSRVILHN AAQAITAMTH RPAEVTASKP
ALGLTMFGVT TPAVSAMVER LRADYDCLVF HATGTGGRAM EKLADSELIS GVLDITTTEV
CDLLFGGVLP ATSDRFGAIA RKGLPYIGSV GALDMVNFWA PETVPERYSG RLFYQHNPNV
TLMRTTLAEC AQIGRWIGDK LNLCHGPLRF LIPEKGVSAL DIEGGAFFDP QADAALFAAL
EATVKPTASR RIIRLPLHIN DPDFAEAAVA AYRDIANP