Gene Rleg_5378 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_5378 
Symbol 
ID8007336 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012848 
Strand
Start bp788638 
End bp789768 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content62% 
IMG OID644822282 
ProductXylose isomerase domain protein TIM barrel 
Protein accessionYP_002973542 
Protein GI241113707 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1082] Sugar phosphate isomerases/epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATTGG GAATCGACAG CATAAAGCTG CCTGAGGCGA AGAAGCGGGG GCCGCTGGCA 
AGCCTTGATC ACGTCAAGGA ACTTGGGCTC GCAGGCATCT TCTTCAGTAC GGCGCTGGAC
ATGAGCCCCG ACCTCGACAG CGGCCTGCTG CGCGACATCA GGGCGAAGGC CGACGACCTT
GGCCTCTATC TCGAAAGCGG CATCGGCAAG ATCAATCCCT ATTGCAGCGC CGAGGAACCG
GTACTCCGGG CCGCCGGCGG CGGCGATATC ATTGCCGGTT TCACGCGCAT GATCGAGGCA
AGTGCCGCGA TCGGCTGCCA TGAGCTTTGG GTTGCACCGG GCAATTTCAA GGGCGAATAT
CGCGGCCGGC TGGCCAATGA CCGCTTCCGC ACCGACGTGA CCTGGGAAGA GCAGTTGCTC
GGAATTGAAA ACGTCCTCCG CAAGCTGGCG CCCGTCGCAC GTGCCAATGG CGCGCACATG
AACATCGAAA CCCATGACGA GATCACGTCC TTCGAGATCC TGCGATTGAT CGAGAAGGTC
GGCGCCGATT GCGTCGGCGT CGTCTTCGAC ACGGCAAACG GACTGCAGCG GGGCGAGCAT
CCGGTCTTCG CCGCCAAGCG CCTGGCTCCT CATATCCGAC AGACCCATAT CAAGGATGCC
TATGTCGGCC GCGCTCCGGG TGGTCTCGAT TTCCAGACCA GACCCGTTGG CGGCGGCATT
GTCGATTTCG CCGCGATCCT TCCCATTCTC AGCGACGCCA GCGCCGCGCT GAACCTGTCG
CTGGAGGTTG CCCAGTCTGT CGCCGACAAG CCTCGCAAGG CCAATCCACG CCAGTGCATC
GAGATCGACG ATCCGGTCTG GCGAGCTGGC CACCCGGACC TGACGGCGGA TGAGCTTGCG
GCCTACATGG CGATGGTGGA TGCCTATGAA AAGCGGGTCG CCTCCGGAGC GGTTCTCGAC
TGGGAAGCCT ACGAGAGCAG CCGCTACGGC TACCCGACCT ATGAGGTGCA ATCCTACGGT
TTCGACGAGG CGATTGGTTT CATCAAGCAG TCGGCCCGCC ACATCGAGGC TATTTGCGCC
GAAAAGGGTA TTACCTTGTC CCCGCCGGCA AAAGAACAAA AGGCAGCCTA G
 
Protein sequence
MKLGIDSIKL PEAKKRGPLA SLDHVKELGL AGIFFSTALD MSPDLDSGLL RDIRAKADDL 
GLYLESGIGK INPYCSAEEP VLRAAGGGDI IAGFTRMIEA SAAIGCHELW VAPGNFKGEY
RGRLANDRFR TDVTWEEQLL GIENVLRKLA PVARANGAHM NIETHDEITS FEILRLIEKV
GADCVGVVFD TANGLQRGEH PVFAAKRLAP HIRQTHIKDA YVGRAPGGLD FQTRPVGGGI
VDFAAILPIL SDASAALNLS LEVAQSVADK PRKANPRQCI EIDDPVWRAG HPDLTADELA
AYMAMVDAYE KRVASGAVLD WEAYESSRYG YPTYEVQSYG FDEAIGFIKQ SARHIEAICA
EKGITLSPPA KEQKAA