Gene Rleg_3000 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_3000 
Symbol 
ID8013917 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp2997103 
End bp2998245 
Gene Length1143 bp 
Protein Length380 aa 
Translation table11 
GC content64% 
IMG OID644825570 
Productglucose sorbosone dehydrogenase 
Protein accessionYP_002976798 
Protein GI241205702 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2133] Glucose/sorbosone dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.0518762 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.592053 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAGAA TATCCGCCTT CCATTTCGCC TCAGCGCTGG TCCTGTTCGG CGCCATGTCC 
GCGGATGCGG CCGACATCGT CAACACGCAG GATCTCGCGG TCCGTGTCGA CAAGCTCGCC
GACGGCCTCC AACATCCCTG GGCGGTCGAA GTGTTGCCCG ACGGGGCCTA TCTCGTCACC
GAGCGGCCGG GCCGCATGCG CATCGTCCGC GACGGCAAGG TTTCCGAGCC GATCGGCGGC
GTACCCAAGG TCAGCGCTCG TGGTCAGGGC GGCCTGATGG ACGTGGCGCT CGCGCCGGAC
TTTGCGAAAT CTCGCAAGCT CTATTTCACC GCCGCCATCG CCAACAGCCA GGGCTCCGGC
ACCGAAGCCT TCAGCGCCGC GCTTTCCACT GACGAGAAGA CACTCGACGC CGTGAGGCCT
ATCTTCAGCA TGCGGCGCTT CACGTCGGGC AATATCCAGT ACGGCTCGCG CATCGCGATT
GCCTCAGACG GTACGCTGTT CATCAGCGTC GGTGATCGCG GCAACCGCGA CCGCTCGCAA
GACTGGCAGG ACGATGCCGG CTCGATCATC CACATCAACG CCGATGGCAG CATTCCTGCC
GACAATCCAT TCAAGGAAGG CGGCAAGGCG CTGCCGGAAA TCTGGTCGAA AGGTCACCGC
AACCCGCAGG GCATCACCTT CGACGCCAAA GATGGCAAGC TCTATACCGT CGAACACGGT
GCGCGCGGCG GCGACGAGAT CAACCAGCCC GAGGCCGGCA AGAATTACGG CTGGCCGATC
ATCACCTATG GCCGCGACTA TTCGGGTGCC GAGATCGGTG AAGGCACCGC CAAGGACGGG
CTGGAACAGC CGCTCCATTA CTGGGATCCT TCGATCGCAC CAGGCGCCCT CGTCGTCTAT
CGTGGCGCCA TGTTCCCGGA ATGGGACGGC AATTTCCTCG TCGCGGCGCT GAAGTTCCAA
CTGCTCTCGC GCATGCAGCG CGACGACGGC GGCGCCTTCG TCACCGAAGA GCGCCTGTTC
GAGGGCGAAT ACGGCCGCAT CCGCGACGTC GTCGTCGCCC CCGACGGCGC CCTGCTGATG
GTGACGGATG AGGACAACGG CGCGCTGCTC AGGATATCCC GAGCGCAAGC CCGCAACGGC
TGA
 
Protein sequence
MKRISAFHFA SALVLFGAMS ADAADIVNTQ DLAVRVDKLA DGLQHPWAVE VLPDGAYLVT 
ERPGRMRIVR DGKVSEPIGG VPKVSARGQG GLMDVALAPD FAKSRKLYFT AAIANSQGSG
TEAFSAALST DEKTLDAVRP IFSMRRFTSG NIQYGSRIAI ASDGTLFISV GDRGNRDRSQ
DWQDDAGSII HINADGSIPA DNPFKEGGKA LPEIWSKGHR NPQGITFDAK DGKLYTVEHG
ARGGDEINQP EAGKNYGWPI ITYGRDYSGA EIGEGTAKDG LEQPLHYWDP SIAPGALVVY
RGAMFPEWDG NFLVAALKFQ LLSRMQRDDG GAFVTEERLF EGEYGRIRDV VVAPDGALLM
VTDEDNGALL RISRAQARNG