Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_6005 |
Symbol | |
ID | 8016271 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012852 |
Strand | - |
Start bp | 32344 |
End bp | 34311 |
Gene Length | 1968 bp |
Protein Length | 655 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 644827317 |
Product | polysaccharide biosynthesis protein CapD |
Protein accession | YP_002978517 |
Protein GI | 241258633 |
COG category | [G] Carbohydrate transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1086] Predicted nucleoside-diphosphate sugar epimerases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.104997 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 52 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGCTGC AAGCGCTTGT CGCCCCTTTG CTGGCGATGC CGCGTGTTGC CAAACGCGCC CTGGCTTTGC TGGTGGATTC CAGCTTTTGT GTTCTGACGA TATGGCTGGC CTATTGCTTC CGTTTGAACG AATGGACGGT GCTCACTGGT GTCCAGTGGT TGCCGGTCTT CGTTTCACTG TGCATGGCCC TTCCCATCTT CATCGTCATG GGCATGTACC GGGCGATCTT CCGTTATGCC AATATGGCTG CTTTCATTAC TGTTCTGAAG GCCATTGCGA TCTACGGCTT CGCCTTCATG ACGATATTTA CAGCCCTCAG CGTACCGGGC GTTCCGAGAA CAGTCGGTAT TCTCCAGCCC TTCCTGCTGT TGATTGCGAT CGGACTGTCG AGGTTGAGCA TCCGCTACTG GCTCGGGGAT GCCTACCAGC GCATCCTTCA CAAGAATACG CTCGCCAAGG TGCTGATCTA TGGAGCAGGG AAGGCCGGGC GGCAGCTGGC CGGTGCCTTG ATCAACAGTG CCGAACTCAA TGTCGTCGGC TATCTGGATG ATGATCCGCG TCTCAAGGGC GGCGTCATGG GTGGTTTGCC GATCTACGAC CCCTCGGATC TTCCGGTGCT TGCCGAATCT CTTGGCGTGC ACAACGTCCT TCTTGCTCTT CCATCTGCAT CGCGGCAGCG TCGCAACGAA ATCCTGGAGC ACATCCGTAA AGCCAGGGTC AATGTTCGCA CGTTGCCGGA TCTCACGGCC CTGGCTCAGG GACGCATCGC CGTCTCCGAC ATCAGAGAGC TGGAGATCGA AGATCTGCTG GGGAGGGAAG CGGTCGCACC ACGGCAGGAG TTGCTCGACA AGGCGATGCG CAAAAAGGTG GTTATGGTCA CGGGTGCTGG TGGCTCGATC GGCGGCGAGT TATGCCGCCA GATTCTGCGC AACGAGCCTT CGAGCCTGAT CCTCATCGAT CAGAACGAGT TTGCGCTTTA TAATATTCAT GCCGAATTGC AGAAGCTGGC CGAACTGTAC AAACACGAAA ATACGCAGAT CGTCCCGATT CTCTGTTCTG TCCGCGATCA GGATCGCATG GAACATGTCA TGCAGAGCTG GCGTCCTCAG ACGCTCTATC ATGCAGCCGC TTACAAGCAT GTTCCCCTTG TCGAACACAA TGCCGTGGAA GGCATCAAGA ACAATGTGAT GGGTACGCTG GTCGCGGCAC GCGCGGCGAA TAAATGCGGC GTCTCGAATT TCGTGCTGAT CAGTACAGAC AAGGCCGTGC GTCCGACAAA TGTGATGGGC GCCAGCAAGA GGTTGGCAGA GATGGTTCTG CAGGCGCTCG CAGCAGAATC GGCGACTGAC AGAATGCGAA CGAATTTCTC CATGGTCCGC TTCGGAAACG TCCTCGGCTC CTCCGGATCT GTCGTGCCGC TTTTCAGGCA GCAGATCAAG GAAGGCGGCC CGGTGACGCT GACGCATCCT GACATAACCC GCTATTTCAT GACCATTTCG GAAGCCTCGC AGCTCGTCAT ACAGGCCGGC GCGATGGCCG ACGGCGGCGA TGTTTTCTTG CTCGACATGG GGGAGCCCGT CCGCATCGCC GATCTCGCCC GCAAGATGGT CGAGCTTTCC GGGCTGGCCG TCCGCGATGA GAACAATCCC GAAGGTGATA TCGAGCTTTC CGTTACCGGT CTTCGGCCCG GCGAGAAGCT CTACGAAGAA CTTTTGATCG GCGACAACCC AGAAAGAACC GAACATCCGC GCATTATGAA GGCGCGCGAG GATTTCCTCT TCTGGTCGGA GCTTTCGAAA AAGCTCAACT CGCTCAATGC GGTATTGGAT CGGAACGATA TGGTCGCGGC ACGTGCGATG TTGGCGGACC TCGTCTCCGG CTATTCCTCA ACAGGTGAGG TCTCGGATCT GGCTTTCAGC GGCGCCGAGC CACTACGGCT GCCGCAGCCA ATTCAAAGCA CGTTGTAG
|
Protein sequence | MPLQALVAPL LAMPRVAKRA LALLVDSSFC VLTIWLAYCF RLNEWTVLTG VQWLPVFVSL CMALPIFIVM GMYRAIFRYA NMAAFITVLK AIAIYGFAFM TIFTALSVPG VPRTVGILQP FLLLIAIGLS RLSIRYWLGD AYQRILHKNT LAKVLIYGAG KAGRQLAGAL INSAELNVVG YLDDDPRLKG GVMGGLPIYD PSDLPVLAES LGVHNVLLAL PSASRQRRNE ILEHIRKARV NVRTLPDLTA LAQGRIAVSD IRELEIEDLL GREAVAPRQE LLDKAMRKKV VMVTGAGGSI GGELCRQILR NEPSSLILID QNEFALYNIH AELQKLAELY KHENTQIVPI LCSVRDQDRM EHVMQSWRPQ TLYHAAAYKH VPLVEHNAVE GIKNNVMGTL VAARAANKCG VSNFVLISTD KAVRPTNVMG ASKRLAEMVL QALAAESATD RMRTNFSMVR FGNVLGSSGS VVPLFRQQIK EGGPVTLTHP DITRYFMTIS EASQLVIQAG AMADGGDVFL LDMGEPVRIA DLARKMVELS GLAVRDENNP EGDIELSVTG LRPGEKLYEE LLIGDNPERT EHPRIMKARE DFLFWSELSK KLNSLNAVLD RNDMVAARAM LADLVSGYSS TGEVSDLAFS GAEPLRLPQP IQSTL
|
| |