Gene Rleg_5608 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_5608 
Symbol 
ID8016834 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012853 
Strand
Start bp190091 
End bp191668 
Gene Length1578 bp 
Protein Length525 aa 
Translation table11 
GC content62% 
IMG OID644827773 
Productputative L-sorbosone dehydrogenase protein 
Protein accessionYP_002978973 
Protein GI241518345 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2133] Glucose/sorbosone dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.759477 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.00328711 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAGAAGT CCCAGATTCT CGGCGCTTCG GTTCTGTCGA TATCTGTCGG CGTCGGTCTT 
GCGGCTTACG CCCAGAGCGG AGATTTTGAC ATCTCTCAGC AGATCGGACC GAACCCCGTG
CTCCCAGATC CAGCCCCTTC CCTGCTGCCT GATCTGAAGG TAGCGGAGGT CGTTGGCTGG
AAGGATGGTG AGACGCCGGC CGCACCGAAC GGTTTGACGG TCACTGCTTA TGCCAAGGAC
CTCGCCAATC CAAGGACAGT CCACACCTTG CCGAACGGCG ACGTACTGGT AGTTCAGGCG
CGTGGCCCGT CAGGCGAACC GGCCTCCCGG CCGAAGGATT TGATCAGAGG CTGGATCATG
TCCATCGCTC ATGGCGACGG CGGCGAGCAG AAGGAAAGCA ACATCATCAC ACTGCTGCGT
GACGCCAACC GCGACGGCAA GGTGGATGAG AGGCACGATC TGCTGAAGAA ACTCGATTCG
CCGTTCGGCG TCGCCTGGGT CGACAACACG CTCTATGTCG CCTCGACGTC CGCCATTCTC
GCCTACCCCT ATGAACTTGG GCAGAATGAA ATCACCGCCC AACCGAAAAC CATCACGCCC
CTGCCCGGTG GTCCGATCAA TCATCATTGG ACCAAGGATC TGGCGCTCAG CCCCGATGGG
CAGATGCTCT ACGTTTCGGT CGGCTCGAAT TCCAATATCG TCGAGAATGG GCTTGAAGCA
GAAAAGGGCC GTGCGGCGAT CTGGCAGGTC GACCGGCGCA CCGGCGCGGC GCGCGTCTTC
GCCTCAGGTC TGCGCAATCC GAACGGCCTC GCCTTCAACC CTGAGACAGG TTCGCTCTGG
ACGGTCGTCA ATGAGCGCGA CGAACTCGGT CCGAACCTCG TTCCCGATTA CATGACCTCG
GTTAAGGAAG GCGGCTTCTA CGGCTGGCCC TGGAGCTATT ACGGCAACCA TGTCGATGCG
CGCGTGCATC CGCCGCGTCC GGACATGGTC GAAAGGGCGA CGCCACCGGA TTATGCCCTG
TCGAGCCATG TCGCGGCCCT TGGATTGGCC TTCTCGATGA ATTCAGCGCT GCCGGCCGCC
TACGCCAATG GCGCCTTCAT CGGAGAGCAC GGCAGCTGGA ACCGGGACAG CTTCAATGGC
TACAAGGTGG TGTACGTACC ATTCGAGGCC GGGAAGCCAT CCGGCAAGGC GCAGGACGTC
GTCACGGGCT TTATCCAGGA CGACCAAGCG AAGGGACGGC CGGTCGGAGT CGGGATCGAC
GGGACGGGAG CTCTGCTCGT CGCAGATGAC GCCGGCAACA CCGTCTGGCG CGTTGCTTCG
TCCGACGGCA AGATTACGCC GCAGCCCATC GGCACGGACC AGGTTTCGGC AAATCGGCAA
GTCTCGACTG ATGCGACGGC GGGCGGGACC GCCGATATGA ATCCTGGCAT CGGAACCGAA
AGGACGGGTT CGACCCCTCA ATCGCAGATG CCGGCAGCCC CGGCAGATGA ACGCCCGACC
GACCAGAAAC CCCTTCCCGG ACAAGCGGAT AAATCCCAGC CTGCGCAAAT GCAGATCGCC
CCTGCAGGTG GTCCATGA
 
Protein sequence
MKKSQILGAS VLSISVGVGL AAYAQSGDFD ISQQIGPNPV LPDPAPSLLP DLKVAEVVGW 
KDGETPAAPN GLTVTAYAKD LANPRTVHTL PNGDVLVVQA RGPSGEPASR PKDLIRGWIM
SIAHGDGGEQ KESNIITLLR DANRDGKVDE RHDLLKKLDS PFGVAWVDNT LYVASTSAIL
AYPYELGQNE ITAQPKTITP LPGGPINHHW TKDLALSPDG QMLYVSVGSN SNIVENGLEA
EKGRAAIWQV DRRTGAARVF ASGLRNPNGL AFNPETGSLW TVVNERDELG PNLVPDYMTS
VKEGGFYGWP WSYYGNHVDA RVHPPRPDMV ERATPPDYAL SSHVAALGLA FSMNSALPAA
YANGAFIGEH GSWNRDSFNG YKVVYVPFEA GKPSGKAQDV VTGFIQDDQA KGRPVGVGID
GTGALLVADD AGNTVWRVAS SDGKITPQPI GTDQVSANRQ VSTDATAGGT ADMNPGIGTE
RTGSTPQSQM PAAPADERPT DQKPLPGQAD KSQPAQMQIA PAGGP