Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_6495 |
Symbol | |
ID | 6983566 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011371 |
Strand | - |
Start bp | 160405 |
End bp | 162345 |
Gene Length | 1941 bp |
Protein Length | 646 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 643399492 |
Product | polysaccharide biosynthesis protein CapD |
Protein accession | YP_002284248 |
Protein GI | 209552333 |
COG category | [G] Carbohydrate transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1086] Predicted nucleoside-diphosphate sugar epimerases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAAGCGC TCGTCGCCCC TTTGCTGGCG ATGCCGCGGG TTGCCAAACG CGCTCTGGCA TTGCTGGTGG ATTCCAGCTT TTGTATTCTG ACGATCTGGC TGGCCTATTG CTTCCGGCTG AATGAATGGA CGGTGCTGAC CGGCGTGCAG TGGTTGCCGG TTTTCGTTTC GCTGTGCATG GCGCTTCCCA TCTTCATCGT CATGGGCATG TATCGGGCGA TCTTCCGTTA TGCCAATCTG GCCGCTTTCA TTACCGTCTT AAAGGCGATT GCGATCTACG GCTTCGCCTT CATGACGATA TTTACGGCTC TCAGCGTACC TGGTGTTCCG AGAACCGTCG GCATTCTCCA GCCTTTCCTG CTGCTGATCG GGATCGGGCT GTCGAGGCTG GGGATCCGTT ATTGGCTCGG CGATGCCTAC CAGCGTATCC TTCACAAGAA TATGCTTGCG AAGGTCCTCA TCTATGGGGC GGGCAAGGCC GGACGTCAGC TGGCGGCCGC TTTGACGAAC AGCGCCGAAC TCAATGTCGT CGGCTATCTG GATGATGATC CGCGCCTCAA GGGCGGCGTC ATGGGCGGCT TGCCGATCTA TGACCCCTCG GATCTTCCGG TGCTTGCCGA AACCCTTGGC GTGCACAATG TGCTTCTTGC TTTGCCATCC GCCTCCCGGC AGCGACGCAA CGAAATCCTG GAGCATATCC GCAAAGCTAG GGTGAATGTT CGTACATTGC CGGATCTCAC GGCGCTCGCT CAGGGACGTG TCGCCGTCTC CGATATTCGT GAGCTGGAGA TCGAAGATCT GCTGGGGAGA GAAGCGGTCG CGCCGCGGCA GGAATTGCTC GACAAGGCGA TGCGCAACAA GGTGGTGATG GTGACGGGCG CCGGCGGCTC GATCGGCGGC GAGTTATGCC GCCAGATTCT GCGCAATGCG CCTTCCAGCC TGATCCTCCT CGATCAGAAC GAGTTTGCGC TTTATAATAT CGATGCCGAA TTGCGGAAGC TCGCCGAACT CTACGAGCAT GAAAATCTGC AGATCGTTCC GATCCTCTGT TCCGTCCGCG ATCAGGACCG CGTGGAGCAT ATCATCCAGA GCTGGCGGCC GCAGACGCTC TATCATGCCG CCGCCTACAA GCATGTGCCG CTTGTCGAAC ATAATGCCGT GGAAGGCATC AAGAACAACG TCATGGGTAC GCTTGTTGCG GCGCGCGCGG CGCGTAAATA CGGCGTCTCG AATTTCGTGC TGATCAGTAC GGATAAGGCC GTGCGTCCGA CAAATGTCAT GGGCGCCAGC AAGAGGCTGG CGGAGATGGT TCTGCAGGCG CTCGCCGCAG AATCGGCAAC CGACAGACTG CGAACGAATT TTTCCATGGT CCGCTTCGGA AACGTCCTCG GCTCCTCCGG ATCCGTCGTG CCGCTTTTCA GGCAACAGAT CAAGGAAGGC GGCCCCGTCA CGCTGACGCA TCGTGAGATA ACCCGCTATT TCATGACTAT TTCGGAAGCC TCGCAGCTCG TCATCCAGGC AGGCGCGATG GGCGAGGGCG GCGATGTTTT TCTGCTCGAT ATGGGCGAAC CCGTTCGCAT CGCCGATCTG GCCCGCAAGA TGGTGGAGCT GTCCGGGCTG AGCGTCCGCG ACGACATCAG CCCCGAAGGG GATATCGAGC TTTCCGTGAC CGGTCTCAGG CCCGGCGAGA AGCTCTATGA AGAACTTCTG ATCGGGGATA ATCCGGAAAC AACCGAACAT CCCCGGATCA TGAAGGCGCG TGAGGATTTC CTGTCCTGGC CGGAGCTTTT GAAAAGGCTC AACTCGCTCA ACGCGGCATT GGATCGGAAC GATATGGCCG CTGCACGTGC GATATTGGCC GAGCTTGTTT CGGGCTATTC GTCGACGGGT GAGGTCTCGG ATCTGGCATT CACCGGCGCC GAAACCAATA CGGCCGCCTG A
|
Protein sequence | MQALVAPLLA MPRVAKRALA LLVDSSFCIL TIWLAYCFRL NEWTVLTGVQ WLPVFVSLCM ALPIFIVMGM YRAIFRYANL AAFITVLKAI AIYGFAFMTI FTALSVPGVP RTVGILQPFL LLIGIGLSRL GIRYWLGDAY QRILHKNMLA KVLIYGAGKA GRQLAAALTN SAELNVVGYL DDDPRLKGGV MGGLPIYDPS DLPVLAETLG VHNVLLALPS ASRQRRNEIL EHIRKARVNV RTLPDLTALA QGRVAVSDIR ELEIEDLLGR EAVAPRQELL DKAMRNKVVM VTGAGGSIGG ELCRQILRNA PSSLILLDQN EFALYNIDAE LRKLAELYEH ENLQIVPILC SVRDQDRVEH IIQSWRPQTL YHAAAYKHVP LVEHNAVEGI KNNVMGTLVA ARAARKYGVS NFVLISTDKA VRPTNVMGAS KRLAEMVLQA LAAESATDRL RTNFSMVRFG NVLGSSGSVV PLFRQQIKEG GPVTLTHREI TRYFMTISEA SQLVIQAGAM GEGGDVFLLD MGEPVRIADL ARKMVELSGL SVRDDISPEG DIELSVTGLR PGEKLYEELL IGDNPETTEH PRIMKAREDF LSWPELLKRL NSLNAALDRN DMAAARAILA ELVSGYSSTG EVSDLAFTGA ETNTAA
|
| |