Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_5608 |
Symbol | |
ID | 8016834 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012853 |
Strand | + |
Start bp | 190091 |
End bp | 191668 |
Gene Length | 1578 bp |
Protein Length | 525 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 644827773 |
Product | putative L-sorbosone dehydrogenase protein |
Protein accession | YP_002978973 |
Protein GI | 241518345 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2133] Glucose/sorbosone dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.759477 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.00328711 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAGAAGT CCCAGATTCT CGGCGCTTCG GTTCTGTCGA TATCTGTCGG CGTCGGTCTT GCGGCTTACG CCCAGAGCGG AGATTTTGAC ATCTCTCAGC AGATCGGACC GAACCCCGTG CTCCCAGATC CAGCCCCTTC CCTGCTGCCT GATCTGAAGG TAGCGGAGGT CGTTGGCTGG AAGGATGGTG AGACGCCGGC CGCACCGAAC GGTTTGACGG TCACTGCTTA TGCCAAGGAC CTCGCCAATC CAAGGACAGT CCACACCTTG CCGAACGGCG ACGTACTGGT AGTTCAGGCG CGTGGCCCGT CAGGCGAACC GGCCTCCCGG CCGAAGGATT TGATCAGAGG CTGGATCATG TCCATCGCTC ATGGCGACGG CGGCGAGCAG AAGGAAAGCA ACATCATCAC ACTGCTGCGT GACGCCAACC GCGACGGCAA GGTGGATGAG AGGCACGATC TGCTGAAGAA ACTCGATTCG CCGTTCGGCG TCGCCTGGGT CGACAACACG CTCTATGTCG CCTCGACGTC CGCCATTCTC GCCTACCCCT ATGAACTTGG GCAGAATGAA ATCACCGCCC AACCGAAAAC CATCACGCCC CTGCCCGGTG GTCCGATCAA TCATCATTGG ACCAAGGATC TGGCGCTCAG CCCCGATGGG CAGATGCTCT ACGTTTCGGT CGGCTCGAAT TCCAATATCG TCGAGAATGG GCTTGAAGCA GAAAAGGGCC GTGCGGCGAT CTGGCAGGTC GACCGGCGCA CCGGCGCGGC GCGCGTCTTC GCCTCAGGTC TGCGCAATCC GAACGGCCTC GCCTTCAACC CTGAGACAGG TTCGCTCTGG ACGGTCGTCA ATGAGCGCGA CGAACTCGGT CCGAACCTCG TTCCCGATTA CATGACCTCG GTTAAGGAAG GCGGCTTCTA CGGCTGGCCC TGGAGCTATT ACGGCAACCA TGTCGATGCG CGCGTGCATC CGCCGCGTCC GGACATGGTC GAAAGGGCGA CGCCACCGGA TTATGCCCTG TCGAGCCATG TCGCGGCCCT TGGATTGGCC TTCTCGATGA ATTCAGCGCT GCCGGCCGCC TACGCCAATG GCGCCTTCAT CGGAGAGCAC GGCAGCTGGA ACCGGGACAG CTTCAATGGC TACAAGGTGG TGTACGTACC ATTCGAGGCC GGGAAGCCAT CCGGCAAGGC GCAGGACGTC GTCACGGGCT TTATCCAGGA CGACCAAGCG AAGGGACGGC CGGTCGGAGT CGGGATCGAC GGGACGGGAG CTCTGCTCGT CGCAGATGAC GCCGGCAACA CCGTCTGGCG CGTTGCTTCG TCCGACGGCA AGATTACGCC GCAGCCCATC GGCACGGACC AGGTTTCGGC AAATCGGCAA GTCTCGACTG ATGCGACGGC GGGCGGGACC GCCGATATGA ATCCTGGCAT CGGAACCGAA AGGACGGGTT CGACCCCTCA ATCGCAGATG CCGGCAGCCC CGGCAGATGA ACGCCCGACC GACCAGAAAC CCCTTCCCGG ACAAGCGGAT AAATCCCAGC CTGCGCAAAT GCAGATCGCC CCTGCAGGTG GTCCATGA
|
Protein sequence | MKKSQILGAS VLSISVGVGL AAYAQSGDFD ISQQIGPNPV LPDPAPSLLP DLKVAEVVGW KDGETPAAPN GLTVTAYAKD LANPRTVHTL PNGDVLVVQA RGPSGEPASR PKDLIRGWIM SIAHGDGGEQ KESNIITLLR DANRDGKVDE RHDLLKKLDS PFGVAWVDNT LYVASTSAIL AYPYELGQNE ITAQPKTITP LPGGPINHHW TKDLALSPDG QMLYVSVGSN SNIVENGLEA EKGRAAIWQV DRRTGAARVF ASGLRNPNGL AFNPETGSLW TVVNERDELG PNLVPDYMTS VKEGGFYGWP WSYYGNHVDA RVHPPRPDMV ERATPPDYAL SSHVAALGLA FSMNSALPAA YANGAFIGEH GSWNRDSFNG YKVVYVPFEA GKPSGKAQDV VTGFIQDDQA KGRPVGVGID GTGALLVADD AGNTVWRVAS SDGKITPQPI GTDQVSANRQ VSTDATAGGT ADMNPGIGTE RTGSTPQSQM PAAPADERPT DQKPLPGQAD KSQPAQMQIA PAGGP
|
| |