Gene Rleg2_3479 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_3479 
Symbol 
ID6982233 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp3594742 
End bp3595890 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content62% 
IMG OID643398197 
Productgalactonate dehydratase 
Protein accessionYP_002282972 
Protein GI209551055 
COG category[M] Cell wall/membrane/envelope biogenesis
[R] General function prediction only 
COG ID[COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.412363 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.814852 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGATCA CCAAACTCAC CACCTATATC GTTCCGCCGC GCTGGCTGTT TTTGAAGATC 
GAGACCGACG AGGGCATCAT CGGCTGGGGC GAACCGGTCG TCGAAGGCCG CGCGCTGACC
GTCCAAGCCG CCGTCCACGA ACTGGAAGAC TATCTGATCG GCAAGGATCC CTTCCTGATC
GAGGACCACT GGACGGTCAT GTATCGCGGC GGCTTCTATC GCGGCGGCGC TGTACACATG
AGCGCCATCT CCGGCATCGA CCAGGCGCTG TGGGACATCA AGGGCAAGGC CCTCGGCCAG
CCGATCCATT CCCTGCTGGG CGGCCAGCTG CGCGACCGCA TCAAGGTCTA TTCCTGGATC
GGCGGCGATC GTCCGGCGGA TGTCGCCAAC AATGCCAGAG AGGTGGTCGC CCGCGGCTTC
AAGGCGATCA AGCTCAACGG CTGCGAGGAA ATGCAGATCG TCGACACCAA CGAGAAGGTG
GAAAAGGCGG TCGAAACCAT CGCCGTCATC CGCGAGGCGA TCGGCCCGCA TATCGGCATC
GGCGTCGATT TCCACGGCCG CGTGCACAAG CCGATGGCCA AGGTTCTCGC CAAAGAACTC
GAACCCTACA AGCTGATGTT CATCGAGGAG CCGGTTCTTT CGGAAAACAA GGAGGCGCTG
CGCGACATCG TCAACCATAC CTCGACGCCG ATCGCGCTTG GCGAACGGCT CTTTTCGCGC
TGGGACTTCA AGCAGGTCCT GTCGGACGGT TATGTCGACA TCATCCAGCC GGATCTGTCG
CATGCCGGCG GCATCACCGA GTGCCGCAAG ATCGCGGCGA TGGCCGAAGC CTATGATGTG
GCGCTGGCGC CGCATTGCCC GCTAGGCCCG ATCGCACTCG CCGCCTGCCT GCAGGTCGAT
GCCGTCAGCT ATAATGCCTT CATCCAGGAA CAGAGCCTCG GCATCCATTA CAACAAGGGC
AACGACATCC TCGACTACAT CTCCAACAAG GAGGTGTTCC AATATGCCGA TGGTTTCGTC
TCGATCCCCC AGGGGCCCGG TCTCGGCATC GAAGTCGACG AGGCCTATGT CATCGAGCGC
GCGAGGGAAG GCCACCGCTG GCGCAACCCG ATCTGGCGCC ACGCCGACGG CAGTTTCGCC
GAATGGTAA
 
Protein sequence
MKITKLTTYI VPPRWLFLKI ETDEGIIGWG EPVVEGRALT VQAAVHELED YLIGKDPFLI 
EDHWTVMYRG GFYRGGAVHM SAISGIDQAL WDIKGKALGQ PIHSLLGGQL RDRIKVYSWI
GGDRPADVAN NAREVVARGF KAIKLNGCEE MQIVDTNEKV EKAVETIAVI REAIGPHIGI
GVDFHGRVHK PMAKVLAKEL EPYKLMFIEE PVLSENKEAL RDIVNHTSTP IALGERLFSR
WDFKQVLSDG YVDIIQPDLS HAGGITECRK IAAMAEAYDV ALAPHCPLGP IALAACLQVD
AVSYNAFIQE QSLGIHYNKG NDILDYISNK EVFQYADGFV SIPQGPGLGI EVDEAYVIER
AREGHRWRNP IWRHADGSFA EW