Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_6455 |
Symbol | |
ID | 6983526 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011371 |
Strand | - |
Start bp | 119442 |
End bp | 120527 |
Gene Length | 1086 bp |
Protein Length | 361 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 643399452 |
Product | polysaccharide pyruvyl transferase |
Protein accession | YP_002284208 |
Protein GI | 209552293 |
COG category | [G] Carbohydrate transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG5039] Exopolysaccharide biosynthesis protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 0.854254 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.145413 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTATCATC TCGGTAAAGT TGTAATACAG CGTCAACAAG TGAAGCTGAT TAAATCGGTA CAAAAATTTG TAAAACGAGG CCAACCTTAC ATCATAATGA ACTTCCCTTC GAATAAAGAT CCAGGCCATA ACGCGAGTTG GCTAGGATTA GCTCATATAT TACGGGAAGT AACAGGGCGT CTTCCAGTGT TTACCGGTAG CAATGCCGAT AACATCAACC AGATCAAATC CACGCCAGGC GATGCTCCAA TATTCGTTTG CGGCTGGGGT AACCTTGGAG ATGCACGGAC GGGTCGGGAC GATATTCTCT ATCGCTTGCT GTGGAAATAT CCCGATCGAA CGATAGTTCA AATGCCGCAA ACCATCGATT TCGGCAGCGA AGCGCTTGCA GCGTATGCAA AACGTACAAT CGCAAGCCAT CGCAGATTCT TCCTAATGGC GCGCGATAAT CAGAGCTTTG AGGTAGCAAA GGCCAGCTTC GATTGTCATG TTGAGGCAGT CCCGGATACG GCATTCGGTA TTGAAAGACT TAAACCATTC GAAGCTGACC CACTCAGCCT GCTGTATGTC ATGCAGCCCT TTGGAGAGGA CGGCGTTGAT ATAGGCAGAG CGCGAGCGAT TGCTGACGGC CCTCTTACCG ATTGGAGCAA CGGCCCAAAG GGCTTGTCCC GCATGCGCCA GTCGAGTCTC GTCAAGGCAG CGCGGCGACG TGGCTTTAGG CGCGTCGAGA TGGTGGCCCA ACATCATGAG GACGTTGCAG CACGTTATGT CAATTACGGC GCGAAGATCC TGTCCAGCGC GCAGCGGATA ATCACCAACA GTTTGCACGG GCACATCCTC TGTCTCTTGC TCAATAAGCC CCACATCGCG GTTGCGAGCA ACGGAAGCAG GCTCCACGAC TTCATCGGCA GTTGGACTGG TGACAGCCCT CTGGTTGAGA AAGCAACGAG CACTAGTGAG ATTTTAGCGG CCATGTCTCG TCTTCCATAT GAGTTGAACG GCACCTGGTA CAAGGCGAGC AGACCGATGG ATCTCAACCC CAGCCTTCCC GCCCCAGATC TGGTGCCTCA TCCCTCCATC GCCTGA
|
Protein sequence | MYHLGKVVIQ RQQVKLIKSV QKFVKRGQPY IIMNFPSNKD PGHNASWLGL AHILREVTGR LPVFTGSNAD NINQIKSTPG DAPIFVCGWG NLGDARTGRD DILYRLLWKY PDRTIVQMPQ TIDFGSEALA AYAKRTIASH RRFFLMARDN QSFEVAKASF DCHVEAVPDT AFGIERLKPF EADPLSLLYV MQPFGEDGVD IGRARAIADG PLTDWSNGPK GLSRMRQSSL VKAARRRGFR RVEMVAQHHE DVAARYVNYG AKILSSAQRI ITNSLHGHIL CLLLNKPHIA VASNGSRLHD FIGSWTGDSP LVEKATSTSE ILAAMSRLPY ELNGTWYKAS RPMDLNPSLP APDLVPHPSI A
|
| |