Gene Rleg_5612 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_5612 
Symbol 
ID8016838 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012853 
Strand
Start bp198711 
End bp201002 
Gene Length2292 bp 
Protein Length763 aa 
Translation table11 
GC content60% 
IMG OID644827777 
Productglycosyl transferase group 1 
Protein accessionYP_002978977 
Protein GI241518349 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.48049 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.331243 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGTTG AACCATCTCC CCTGCGGATC CTTTTCGTTT TCGCCTGGCT GGTCGTCGGG 
GGCGAGGAGA CGGAAGTCCG GCTGCTTGCC AAGAATCTCG ACCGCCGTCG CTATCGCCTC
GACGTCGTGG CCTGCTTCCG GAAACCCGGC ATGCCAGAGC AAACACACCG GCAGCTTCGA
GGGCTGGGGA TCGATGTCGA TACGACGCCC TACGAATTGT CCTTCGAGGA TACGGTAAAA
TATCTCGCCG GTAAAATCTC GGGTTACGAC ATTGTCGTTT CCTGTCAGAA CGTAGCGGAC
ATCTATCCGG CATTGGAGCG ACTGCATCTT CGCCCGCCCC TCATTGAACA TGGCGGGCTG
GTCTCAGAGG CCTTGGCCGG CCCCAAACAT CTGACGGCGC GTTATGTGGG CGTGTGCCGC
ACGATCCGTG ACGCGGCCGC GTCACGAATG CCCGGGCGCG ACCGTCATGC CTTGGAAATC
CCCTCGATGA TCGATCTCTC GGCCTTCGAT CCGGCACATC GGGAGCGGGC ACGAGCGGGT
CTCGGGATCG CGACAGATGA GGTTCTCATC GGTTGGGTCG GCCGGCTCGA TCCCAAAAAG
AACGTCGAGG ATTTTATCGA GGCCGCCGCG CTGGTTCATG CCACCACGAA AAGTGCACGG
TTCGTTATTG TCGGAGGGCC GGATGCCTTC CAGCCGGAAT ATGCCGTGCA GCTCAAAGCC
CTGACCACCC GGCACGGGCT CGATGGAACG CTTCAGTTTC TCGGCGACCG GAGTGATATC
CCGCCCCTGC TTGCCGCATT CGATATATTC GTTTGGCTGT CATCCGGCGA AGGCATGCCG
CATGTTATCG CCGAGGCCGG GGCAGCCTCC CTCCCGGTGA TTGCGACGCC TGACAACGGT
GCCATGCAGC AGATCGATGA CGGGCTGTCC GGTGTCTTCG TGCCCCATCG CAGTCCAGGC
ATAGTCGCGA ACAATATCAT CGCGCTTATC GAGTCTCCCG CGCGCCGTCA CGCACTCGGC
ACCGCCCTGC GACGGAAGGT CGAGATGGAC TACTCCGTCG AGGCAGTGCT GCCGCGGTGG
GAGCGACTTC TAGCGGATGT CCATCGAGAG CGAAAGGCAG CGCGTCCTAC AGGTCTGTTC
CAATCATTCC TGCAAGGCGG TTTCGAATGT TCGAGCCATC GGCTCCGGCC AAGAAACGGT
CAGACGCAGG GCAATCGACT CGACTTAATC GCCGCGACTG GTCATGATCG CCATGCCGAG
ACGGACTATC GCCAACTTCA AGGCTTCGGA CTCACCACTG TTCGCGATGG CTTCCGATGG
CACCTGATTG AAAAAAATGG CCGATACGAC TGGTCGAGCA TCCGTCCGAT GCTCCAGGCG
GCTAAGCTAA CGAAGACCCA AGTTGTCTGG GATCTCCTGC ATTATGGCTG GCCCGACGAT
CTCGACATCT GGTCGCCGCG CTTCGTCGAT CGCTTCGCGC GGTTCGCACG TGCTTGCGCC
GAGTTGGTCC GCGAGGAGAG CGACGGCATT CCGTTTTATT GCCCTGTCAA CGAGATCTCG
TTTTTCTCAT GGGGCGGTGG TGACGTCGGC TATTTGAACC CATTTGCCAA TGGGCGCGGA
TTCGAACTCA AGGTGCAGAT GGCACGCGCC GCCATCGCGG CGATGGATGC CATCATCTCA
GTGGATGCCC GGGCCCGCTT CGTGCACTGC GAGCCGGTGA TCAATGTCGT CGCGGACCCG
TCGCGTCCCC ACGATGCGCA TACCGCCGAA GGACATCGCC AATCGCAGTT CCAGGCCTGG
GACTTGATTG GCGGAAGGAT GTGGCCACAG ATCGGAGGCG GCGAACGCTA TCTTGATATC
CTCGGTGTGA ATTACTATTC CAATAACCAG TGGATTCACG GCGGTCGGCC GATCGACGTT
GGGCATCCGC TTTATAAGCC CCTCAGCCGA ATTCTGGTCG AAACGTTCGC GCGCTACGGC
AAGCCGATGC TTATTGCCGA GACCGGCATC GAGGATGACC GTCGCGCATC CTGGTTGGAC
TACGTCGCCG ACCAAGCTCT GGACGCGATC CGCTCGGGCG TGCCGTTGGA AGGTTTGTGC
CTTTATCCGA TCGTCAATCA TCCTGGCTGG GATGATGATC GGCCTTGCGC GAACGGCCTC
CTCTCCGCCG ATGTCGCCCA GGGAGGACGG GCGCCATTTG CTCCCTTGGT TGCTGCGATA
CGCGAACGAG CAAAAGAGTT CGCAAGCTTT GGGCAGCATC CTATTGGCGG CGCGAAAGCT
AGTGACACCT GA
 
Protein sequence
MTVEPSPLRI LFVFAWLVVG GEETEVRLLA KNLDRRRYRL DVVACFRKPG MPEQTHRQLR 
GLGIDVDTTP YELSFEDTVK YLAGKISGYD IVVSCQNVAD IYPALERLHL RPPLIEHGGL
VSEALAGPKH LTARYVGVCR TIRDAAASRM PGRDRHALEI PSMIDLSAFD PAHRERARAG
LGIATDEVLI GWVGRLDPKK NVEDFIEAAA LVHATTKSAR FVIVGGPDAF QPEYAVQLKA
LTTRHGLDGT LQFLGDRSDI PPLLAAFDIF VWLSSGEGMP HVIAEAGAAS LPVIATPDNG
AMQQIDDGLS GVFVPHRSPG IVANNIIALI ESPARRHALG TALRRKVEMD YSVEAVLPRW
ERLLADVHRE RKAARPTGLF QSFLQGGFEC SSHRLRPRNG QTQGNRLDLI AATGHDRHAE
TDYRQLQGFG LTTVRDGFRW HLIEKNGRYD WSSIRPMLQA AKLTKTQVVW DLLHYGWPDD
LDIWSPRFVD RFARFARACA ELVREESDGI PFYCPVNEIS FFSWGGGDVG YLNPFANGRG
FELKVQMARA AIAAMDAIIS VDARARFVHC EPVINVVADP SRPHDAHTAE GHRQSQFQAW
DLIGGRMWPQ IGGGERYLDI LGVNYYSNNQ WIHGGRPIDV GHPLYKPLSR ILVETFARYG
KPMLIAETGI EDDRRASWLD YVADQALDAI RSGVPLEGLC LYPIVNHPGW DDDRPCANGL
LSADVAQGGR APFAPLVAAI RERAKEFASF GQHPIGGAKA SDT