Gene Rleg2_5104 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_5104 
Symbol 
ID6978198 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011368 
Strand
Start bp744231 
End bp745532 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content61% 
IMG OID643394237 
Productprotein of unknown function DUF900 hydrolase family protein 
Protein accessionYP_002279055 
Protein GI209547137 
COG category[S] Function unknown 
COG ID[COG4782] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGCAGCG TTGGTGGTGG GGTGGCGGCT CCTAGGAGCT GGTCTCGTTG GGTTGTTGTC 
TGTGTAGTCG TAGTCCTGCT TCCCGGATGT GGTGGCCATC CGAAAGATGT GCTGACCCCG
GTGGCCGACA CGTCTCCGCA GTCGCCGAAA GTGGACATGC TGATCGCGAC GACCCGCACG
CGGTCGACGG TTCCTGGCGA GATGTTCAAC GGCGAGCGGG CGCGGGCTCC CGCCTTTGCT
GACATGACGA TTTCCATTCC GCCCAGCAAG GTTCGCAAGG AGGGACAGGT CGCCTGGCCG
AAGAGGCTGC CCTCCAACCC GGCGACCGAC TTCGCCACCA TCAGAGCCGA CGATCTCAAC
GTTGAGAATG CCAAGAAATG GTTGAGCGCA AGCGTCAGGA AAACCCCGGA CGGCAGCGTC
CTTGTGTTCA TCCACGGCTT CAACAATCGG TTCGAGGATG CGGTCTATCG CTTCGCGCAG
ATCGTCCACG ACTCCGGAGT TCATAGCGCA CCCGTCCTGG TGACATGGCC CTCGCGCGGC
AGCCTGCTCG CCTACGGCTA TGATCGTGAG AGTACAAATT ACACACGCAA CGCGTTGGAG
ACGCTCTTCC AATATCTCGC GAAGGATCCC GAGGTAAAGG AAGTCTCGAT CCTCGCGCAT
TCGATGGGGA ATTGGCTCGC GCTGGAATCC TTACGCCAGA TGGCCATTCG CAACGGACGT
CTGCCCGCGA AGTTCAAGAA CGTCATGCTC GCAGCGCCCG ACGTCGACGT CGACGTCTTC
AGCCAACAGA TCGTCGACAT GGGCAAGCAG CACCCGCAGT TCACGCTGTT CGTATCCCGC
GACGACAAAG CACTGGCATT TTCGCGTCGC GTCTGGGGCG ATGTCTCCAG GCTCGGCGCC
ATCGACCCGG AGCAAGAGCC CTACAAGAAG GAACTGGAAG ACAACAAGAT CGTGGTCATC
GATCTCACCA AGATCAAATC CGGCGATAGC ATGAACCACG GCAAGTTCGC CGAGTCTCCC
CAGATCGTTC AGCTCATCGG CCAGCGCATC TCCGAAGGGC AGACGCTGAC CGACAGCCAC
GTCGGACTTG GCGACCAAAT CCTCGTCGCG ACAACCGGGG CTGCGGCAGC GGCCGGGAAC
GTCGCCGGTC TGGTTCTCGC CGCACCCGTC GCCGTGGTCG ATCAGGACAC CCGAGACAAC
TACGCGACCC ATGTCGGAAG CCTGTCGGGT CCAGGCCGTG CCCAGCCGGT CGCGTTCAAA
AAATGCAATC CAGCCCGACC GACGCCGAGC TGCCAACGTT GA
 
Protein sequence
MGSVGGGVAA PRSWSRWVVV CVVVVLLPGC GGHPKDVLTP VADTSPQSPK VDMLIATTRT 
RSTVPGEMFN GERARAPAFA DMTISIPPSK VRKEGQVAWP KRLPSNPATD FATIRADDLN
VENAKKWLSA SVRKTPDGSV LVFIHGFNNR FEDAVYRFAQ IVHDSGVHSA PVLVTWPSRG
SLLAYGYDRE STNYTRNALE TLFQYLAKDP EVKEVSILAH SMGNWLALES LRQMAIRNGR
LPAKFKNVML AAPDVDVDVF SQQIVDMGKQ HPQFTLFVSR DDKALAFSRR VWGDVSRLGA
IDPEQEPYKK ELEDNKIVVI DLTKIKSGDS MNHGKFAESP QIVQLIGQRI SEGQTLTDSH
VGLGDQILVA TTGAAAAAGN VAGLVLAAPV AVVDQDTRDN YATHVGSLSG PGRAQPVAFK
KCNPARPTPS CQR