Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_5690 |
Symbol | |
ID | 6977081 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011366 |
Strand | - |
Start bp | 88789 |
End bp | 89946 |
Gene Length | 1158 bp |
Protein Length | 385 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 643393147 |
Product | oxidoreductase domain protein |
Protein accession | YP_002277965 |
Protein GI | 209546075 |
COG category | [R] General function prediction only |
COG ID | [COG0673] Predicted dehydrogenases and related proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.324305 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAGTGG GAATCATCGG GCTGGGATTC CGGCTCGGCT ATCTCGGCTA CGTCTTCAAA GCGATCGACA GCAGCTTCGA CATTGTCGGC TATGTGGATC CGGAGCCCGC CGGACTTCCA GGATTGACGG AAAAGGGAGT CTCGGTCGGC AAGGCCTATG GGTCGCCGGA AGAGCTGCTC GCTTCCGAAA AGCTCGATCT GCTGATGATC GGCTCCCCCA ATCACCTGCA TCTCGATCAT ATCAGGCTCG GGCTGCAAGC CGGGCTCAAG GTCTTCTGCG AAAAGCCGAT CGTTACCACG ATTGCCGAAA GCATCGAGCT TGCCCATCTG ATGGCGAAAT TCGGCCATGA GCGGCTGATG GTCGGCCTCG TGCTGCGTTA TTCGCCCCTC TACAAGGATC TGCGCGCCAT CCAGGCCGAG GGCAAACTCG GCCAGATCGT CTCGATCGAG GCTTCCGAAC ATATCGAGCC CTATCACGGC GCCTTCTTCA TGCGCGATTG GCGCCGCTAC GAGCGTTATT CCGGCAGCTT CATGCTGGAG AAATGCTGCC ACGACCTCGA CCTTTACAAT GGTGTCGTCG GCGCGCGGCC GGAGCGGGTC GCCAGTTTCG GTGGCCGCAA GAGCTTCATT CCGGCCAACG ACCCGGCCCG CGAAGGCATC AACGACCTCG AGCTTTTCCA CCGCAAGCCG AGCGGCTGGA TGGGATCGGA CAAGGTCTTC GACAGCGATG CCGACATCAT CGATTATCAG GTGGCGATCG TCGAATATGA AAATGGCGTC GGCATGAACT TCCACACCAA TCTGAACGTG CCCGATCAGT TCCGCCGTTT CGCCATCATG GGTTCGCGCG GCATGGCCGA GGGCGATTTC GTGCGCGGCT ATCTCGATGT GCATGAGCAG CTGACCGGCA ACAAGGTGAT CGAAAATAAA TATGCCGCCA CCGAGCTCTC CCAGCATTAT GGCGCCGACG AACAGATGGC GAGCGATCTG CTGGAAAGCG TGCGCACCGG GCTCGAACTT CCGGTTTCAA CGCTGAATGC GCTCGAAGCC GGCATCCTCG CCCTGGCGAT GGATGAGGCG AGGATGAAGA AAACCGTCGT CGACCTGCGT CCCGTCTGGG ACCGTTTCGA CGAGGCGCTC CACGCAAGAG CGGCTTGA
|
Protein sequence | MKVGIIGLGF RLGYLGYVFK AIDSSFDIVG YVDPEPAGLP GLTEKGVSVG KAYGSPEELL ASEKLDLLMI GSPNHLHLDH IRLGLQAGLK VFCEKPIVTT IAESIELAHL MAKFGHERLM VGLVLRYSPL YKDLRAIQAE GKLGQIVSIE ASEHIEPYHG AFFMRDWRRY ERYSGSFMLE KCCHDLDLYN GVVGARPERV ASFGGRKSFI PANDPAREGI NDLELFHRKP SGWMGSDKVF DSDADIIDYQ VAIVEYENGV GMNFHTNLNV PDQFRRFAIM GSRGMAEGDF VRGYLDVHEQ LTGNKVIENK YAATELSQHY GADEQMASDL LESVRTGLEL PVSTLNALEA GILALAMDEA RMKKTVVDLR PVWDRFDEAL HARAA
|
| |