Gene Rleg_3767 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_3767 
Symbol 
ID8014597 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp3821770 
End bp3822918 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content61% 
IMG OID644826330 
Productgalactonate dehydratase 
Protein accessionYP_002977549 
Protein GI241206453 
COG category[M] Cell wall/membrane/envelope biogenesis
[R] General function prediction only 
COG ID[COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.854343 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.799766 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGATCA CCAAACTCAC CACCTATATC GTTCCCCCGC GCTGGCTGTT TTTGAAGGTC 
GAGACCGATG AAGGCATCGT CGGCTGGGGC GAGCCGGTCG TCGAAGGCCG TGCGCTCACC
GTTCAGGCCG CCGTCCATGA GCTGGAAGAC TACCTGATCG GCAAGGATCC TTTCCTGATC
GAAGACCACT GGACCGTGAT GTATCGCGGC GGCTTCTATC GCGGCGGCGC CGTCCACATG
AGCGCAATCT CGGGCATCGA CCAGGCGCTG TGGGACATCA AGGGCAAGGC GCTCGGCCAG
CCGATCCATT CCCTGCTCGG CGGCCAGCTC CGTGATCGCA TCAAAGTCTA TTCCTGGATC
GGCGGCGACC GTCCCTCGGA TGTCGCCAAC AATGCCAAGG AAGTGGTGGC CCGCGGTTTC
AAAGCGATCA AGCTCAATGG CTGCGAGGAA ATGCAGATCG TCGACACCAA CGAAAAGGTG
GAGAAGGCGG TCGAGACCAT CGCCGCTATC CGCGAGGCGA TCGGCCCGCA TATCGGCATC
GGCGTCGATT TCCACGGCCG CGTCCACAAG CCGATGGCGA AGGTTCTCGC CAAGGAGCTC
GATCCCTACA AGCTGATGTT CATCGAAGAG CCGGTGCTTT CCGAAAACAA GGAAGCGCTG
CGCGATATCG TCAACCACAC CTCGACGCCG ATTGCGCTGG GTGAACGCCT CTTTTCGCGT
TGGGACTTCA AGCAGGTTCT CTCCGACGGT TATGTCGACA TCATCCAGCC GGATCTCTCC
CATGCCGGCG GCATCACCGA ATGCCGCAAG ATCGCGGCGA TGGCCGAAGC CTATGACGTG
GCGCTGGCGC CGCATTGCCC ACTGGGTCCG ATCGCGCTTG CCGCCTGCCT GCAGGTCGAT
GCCGTCAGCT ACAATGCCTT CATCCAGGAA CAGAGCCTCG GCATCCACTA CAACAAGGGC
AACGACATCC TCGACTACAT CTCCAACAAG GAGGTGTTCC AGTATGCCGA TGGTTTCGTC
TCGATCCCGC AGGGTCCGGG TCTCGGCATC GAGGTCGACG AGGCCTATGT CATCGAACGC
GCCAAGGAGG GCCACCGCTG GCGCAACCCG ATCTGGCGGC ATGCCGACGG CAGCTTCGCC
GAGTGGTGA
 
Protein sequence
MKITKLTTYI VPPRWLFLKV ETDEGIVGWG EPVVEGRALT VQAAVHELED YLIGKDPFLI 
EDHWTVMYRG GFYRGGAVHM SAISGIDQAL WDIKGKALGQ PIHSLLGGQL RDRIKVYSWI
GGDRPSDVAN NAKEVVARGF KAIKLNGCEE MQIVDTNEKV EKAVETIAAI REAIGPHIGI
GVDFHGRVHK PMAKVLAKEL DPYKLMFIEE PVLSENKEAL RDIVNHTSTP IALGERLFSR
WDFKQVLSDG YVDIIQPDLS HAGGITECRK IAAMAEAYDV ALAPHCPLGP IALAACLQVD
AVSYNAFIQE QSLGIHYNKG NDILDYISNK EVFQYADGFV SIPQGPGLGI EVDEAYVIER
AKEGHRWRNP IWRHADGSFA EW