Gene Rleg2_2949 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_2949 
Symbol 
ID6981694 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp3006507 
End bp3007685 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content60% 
IMG OID643397660 
Productglycosyl transferase family 2 
Protein accessionYP_002282443 
Protein GI209550526 
COG category[R] General function prediction only 
COG ID[COG1216] Predicted glycosyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.80757 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTGAAC CAATACCGTC CGATCTGAAC GATATAAATT TTCCCGTTCC GATTCAATAT 
CTCGATCTCG CATCCTGCCA TCTCGATGGC GTCTCGTCCC CGCTCTCCGA CGGGCTGGTG
GTCTTTCTGT CGAATGGGCT TCCGGTCGGA CAAGCCTATC TCGAAATGCC CGAGACGGCG
GCGGCACTTG CCGAGCGCGT CGTGCGGCCG GAGACGCTCG AGCACATCGC CCAAGTCGCG
CAAACGCCTC TTGGCAATCG CGACGTCAGC ATTCTGATCT GCACCAAGGA CCGGCCGGAG
GAATTGCGCC GGTGCCTGGC CTCGATCCCC GAACAATCGC TGAAGCCCGT CGAGATCATC
GTTGTCGACA ATGCCTCATC AGGCGATGCG ACGCGTCGCA TGGTCGAAGA AGCCGGCGTC
ACCTATGTCC GTGAAGATCG GGTCGGGCTG GACCATGCCC GCAATGCGGC CATTCGGGCG
GCGAAAACCG AATTCGTCGC CTTTACCGAC GATGACGTGG TTCTGCACGG GCGATGGCTG
GAAAACCTGA TGAAGGCGTT CGACCGGGCG GAGATTGCCT GCGTGACCGG GTTGATCCTT
CCTGGAGAAC TGGCAACGCC TGCACAATTT ATCTTCGAAA CACATTGGAG TTTCGGCAGA
GGTTATCTCC GGCAGGATTT CGACCGGGAT TTCTACCGCT TGCACGACCG TTACGGCGCC
CCGGTCTGGA CGATCGGCGC CGGCGCCAGC CAGGCGTTCC GCCGCAAGGT CTTCGAGGAG
ATCGGCCTGT TCGATGTCCG CCTGGATATG GGCGCCGCCG GATGCTCAGG CGATTCCGAA
TACTGGAACC GGTTGCTCCA TCACGGCCAT GTCTGCCGCT ACGAGCCGAC GGCCGTCTCC
TGGCACTTTC ATCGCAAGGA TATGAAGGGG CTGGCCAAAC AGATCCACCA ATATATGAGC
GGCCATATCG CGGCACTGCT GGTGCAGTAT CAGAATACCG GACGTAGCGG CAACCTGCGG
CGCATTCTTC TCTCCTTTCC AAGATATTAT GCCGGAAGAC TACGCAGGCG CTTGCGCAAG
GGTGCGACCT CGCGCGACTT TTTCCTCAAG CAGGAGATGC TCGGCAGCGT CCATGGCGCT
TTCTACGTTC TGCGCCGCTG GAAGATGCCC GCATGGTGA
 
Protein sequence
MREPIPSDLN DINFPVPIQY LDLASCHLDG VSSPLSDGLV VFLSNGLPVG QAYLEMPETA 
AALAERVVRP ETLEHIAQVA QTPLGNRDVS ILICTKDRPE ELRRCLASIP EQSLKPVEII
VVDNASSGDA TRRMVEEAGV TYVREDRVGL DHARNAAIRA AKTEFVAFTD DDVVLHGRWL
ENLMKAFDRA EIACVTGLIL PGELATPAQF IFETHWSFGR GYLRQDFDRD FYRLHDRYGA
PVWTIGAGAS QAFRRKVFEE IGLFDVRLDM GAAGCSGDSE YWNRLLHHGH VCRYEPTAVS
WHFHRKDMKG LAKQIHQYMS GHIAALLVQY QNTGRSGNLR RILLSFPRYY AGRLRRRLRK
GATSRDFFLK QEMLGSVHGA FYVLRRWKMP AW