Gene Rleg2_6495 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_6495 
Symbol 
ID6983566 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011371 
Strand
Start bp160405 
End bp162345 
Gene Length1941 bp 
Protein Length646 aa 
Translation table11 
GC content59% 
IMG OID643399492 
Productpolysaccharide biosynthesis protein CapD 
Protein accessionYP_002284248 
Protein GI209552333 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1086] Predicted nucleoside-diphosphate sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAGCGC TCGTCGCCCC TTTGCTGGCG ATGCCGCGGG TTGCCAAACG CGCTCTGGCA 
TTGCTGGTGG ATTCCAGCTT TTGTATTCTG ACGATCTGGC TGGCCTATTG CTTCCGGCTG
AATGAATGGA CGGTGCTGAC CGGCGTGCAG TGGTTGCCGG TTTTCGTTTC GCTGTGCATG
GCGCTTCCCA TCTTCATCGT CATGGGCATG TATCGGGCGA TCTTCCGTTA TGCCAATCTG
GCCGCTTTCA TTACCGTCTT AAAGGCGATT GCGATCTACG GCTTCGCCTT CATGACGATA
TTTACGGCTC TCAGCGTACC TGGTGTTCCG AGAACCGTCG GCATTCTCCA GCCTTTCCTG
CTGCTGATCG GGATCGGGCT GTCGAGGCTG GGGATCCGTT ATTGGCTCGG CGATGCCTAC
CAGCGTATCC TTCACAAGAA TATGCTTGCG AAGGTCCTCA TCTATGGGGC GGGCAAGGCC
GGACGTCAGC TGGCGGCCGC TTTGACGAAC AGCGCCGAAC TCAATGTCGT CGGCTATCTG
GATGATGATC CGCGCCTCAA GGGCGGCGTC ATGGGCGGCT TGCCGATCTA TGACCCCTCG
GATCTTCCGG TGCTTGCCGA AACCCTTGGC GTGCACAATG TGCTTCTTGC TTTGCCATCC
GCCTCCCGGC AGCGACGCAA CGAAATCCTG GAGCATATCC GCAAAGCTAG GGTGAATGTT
CGTACATTGC CGGATCTCAC GGCGCTCGCT CAGGGACGTG TCGCCGTCTC CGATATTCGT
GAGCTGGAGA TCGAAGATCT GCTGGGGAGA GAAGCGGTCG CGCCGCGGCA GGAATTGCTC
GACAAGGCGA TGCGCAACAA GGTGGTGATG GTGACGGGCG CCGGCGGCTC GATCGGCGGC
GAGTTATGCC GCCAGATTCT GCGCAATGCG CCTTCCAGCC TGATCCTCCT CGATCAGAAC
GAGTTTGCGC TTTATAATAT CGATGCCGAA TTGCGGAAGC TCGCCGAACT CTACGAGCAT
GAAAATCTGC AGATCGTTCC GATCCTCTGT TCCGTCCGCG ATCAGGACCG CGTGGAGCAT
ATCATCCAGA GCTGGCGGCC GCAGACGCTC TATCATGCCG CCGCCTACAA GCATGTGCCG
CTTGTCGAAC ATAATGCCGT GGAAGGCATC AAGAACAACG TCATGGGTAC GCTTGTTGCG
GCGCGCGCGG CGCGTAAATA CGGCGTCTCG AATTTCGTGC TGATCAGTAC GGATAAGGCC
GTGCGTCCGA CAAATGTCAT GGGCGCCAGC AAGAGGCTGG CGGAGATGGT TCTGCAGGCG
CTCGCCGCAG AATCGGCAAC CGACAGACTG CGAACGAATT TTTCCATGGT CCGCTTCGGA
AACGTCCTCG GCTCCTCCGG ATCCGTCGTG CCGCTTTTCA GGCAACAGAT CAAGGAAGGC
GGCCCCGTCA CGCTGACGCA TCGTGAGATA ACCCGCTATT TCATGACTAT TTCGGAAGCC
TCGCAGCTCG TCATCCAGGC AGGCGCGATG GGCGAGGGCG GCGATGTTTT TCTGCTCGAT
ATGGGCGAAC CCGTTCGCAT CGCCGATCTG GCCCGCAAGA TGGTGGAGCT GTCCGGGCTG
AGCGTCCGCG ACGACATCAG CCCCGAAGGG GATATCGAGC TTTCCGTGAC CGGTCTCAGG
CCCGGCGAGA AGCTCTATGA AGAACTTCTG ATCGGGGATA ATCCGGAAAC AACCGAACAT
CCCCGGATCA TGAAGGCGCG TGAGGATTTC CTGTCCTGGC CGGAGCTTTT GAAAAGGCTC
AACTCGCTCA ACGCGGCATT GGATCGGAAC GATATGGCCG CTGCACGTGC GATATTGGCC
GAGCTTGTTT CGGGCTATTC GTCGACGGGT GAGGTCTCGG ATCTGGCATT CACCGGCGCC
GAAACCAATA CGGCCGCCTG A
 
Protein sequence
MQALVAPLLA MPRVAKRALA LLVDSSFCIL TIWLAYCFRL NEWTVLTGVQ WLPVFVSLCM 
ALPIFIVMGM YRAIFRYANL AAFITVLKAI AIYGFAFMTI FTALSVPGVP RTVGILQPFL
LLIGIGLSRL GIRYWLGDAY QRILHKNMLA KVLIYGAGKA GRQLAAALTN SAELNVVGYL
DDDPRLKGGV MGGLPIYDPS DLPVLAETLG VHNVLLALPS ASRQRRNEIL EHIRKARVNV
RTLPDLTALA QGRVAVSDIR ELEIEDLLGR EAVAPRQELL DKAMRNKVVM VTGAGGSIGG
ELCRQILRNA PSSLILLDQN EFALYNIDAE LRKLAELYEH ENLQIVPILC SVRDQDRVEH
IIQSWRPQTL YHAAAYKHVP LVEHNAVEGI KNNVMGTLVA ARAARKYGVS NFVLISTDKA
VRPTNVMGAS KRLAEMVLQA LAAESATDRL RTNFSMVRFG NVLGSSGSVV PLFRQQIKEG
GPVTLTHREI TRYFMTISEA SQLVIQAGAM GEGGDVFLLD MGEPVRIADL ARKMVELSGL
SVRDDISPEG DIELSVTGLR PGEKLYEELL IGDNPETTEH PRIMKAREDF LSWPELLKRL
NSLNAALDRN DMAAARAILA ELVSGYSSTG EVSDLAFTGA ETNTAA