Gene Rleg_0359 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_0359 
Symbol 
ID8011566 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp365518 
End bp367491 
Gene Length1974 bp 
Protein Length657 aa 
Translation table11 
GC content61% 
IMG OID644822954 
Producthypothetical protein 
Protein accessionYP_002974209 
Protein GI241203113 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG3914] Predicted O-linked N-acetylglucosamine transferase, SPINDLY family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.224796 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.0716049 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCGCTT CGGCGATCGA CAGCCATGCC AGAACATTTC CTTCACGGAG TTGGGTTTTG 
AACAGCAAAG TTTCGTTCTC TGCGGCATCC AAGGATTACC AGATGGGCCG CTACACACAG
TCCTTGGCGA CGCTCAACCA GTTGATGGAT ATCCAGCAGG ACGCCAAGAC CTATGCGCTG
CTGGCCAAGA ACCTCGTGCA GCTCGGCTTC AAGGCCGATG CCGCGAAAGC CTATGGCCTG
GCCGGGAATT GCGAAGGACC GAATTCCTAC GAATACCAGA AACAGGCTGC CAAGCTGCAT
TACGAGACCG GCAACGAGGA CGATGCTCTG CTGATCGCCA TGCGAAACCT CACCAAGGCG
CAGGAAGACG CGGAACTCGC CTTCATCATC ACCGCGATCT ATCTGAAGCG CCAGCAGCGG
GATATCATCC GCCCGTTCAA GACGGTGCTG TCGCAGAGTG CCAATCCCGA TCATATGCGC
CTGGCGGCCC TGCTGCTGAG CGACGATCTG AACGACGCAA CCAACCAGAA CCTCGCGCGC
AACATCTTCA AGCGTTTTCC TGGCAACCTC GCATTTCGCT TCCTTCACCT TGTTTTCGCC
CGCGAATTCA ATGAGTTCGA AGAGGCGAGC AAGCATCAGG CGGTAATCGA CGCGGCCCTT
GCAAAGGGCG ATATCGAGAT CCTGCGCAAG GACAACCCGT TCTATCATCT GCACTGGTGC
GGCAACGAAG ACTTCAACCG GTATGCGACG ATCGGCACCA CCCCCCTCAA CCAGGAACGG
GTGGCCTTCC GCCGCAATCA ACCGCACACA TGGTCGGACA AGATCCGCAT CGGCTACATG
TCATCAGACT TCTGGGATCG CCATGCGACG ATGAAGCTCT TGCAGCGCAT CCTGGAACTG
CACGACACGG ACCGGTTCGA GGTGACGCTC TTCTGCCATA CGGGCCCGGA ATACCTCAAG
CACAACACGA TCGACCGCAG CCGCTGGGGC CGGATCGTCA CCGTTCACGG CTTCTCCGAT
CAAGCGGTGC TGGAAGCGGT GCGCGAGCAT AATATCGACA TCATGGTCGA CCTGAAGGGC
CACACTTCGG GCAGTCGCGC GACGGCCTTC AACCTGCCGC TCGCACCGGT GCATGTCGGC
TGGCTCGGTT TCCCCGGCAG CACGGTCAAT GTCGATCTCG ATTACGTCAT CGGCGACCAT
TTCGTCCTGC CCGAGGTGGC CAAGCCCTTC TACCACGAGA AGTTCTGCCG CCTGCCGGAG
AGCTACCAGC CGAACGACCC GATGCATCGT CCGAAGCCGC GTCCTGTCAC CCGCGAGCAA
CTCGGCCTGC CTGATGACGC CTTCATCTTC GCGTCCTTCA ACGGTAACCG CAAGATCACG
CCGGAGACGA TCGACAGCTG GTGCCGCATT CTCAAGCGCG CACCGAACAG CGTGCTCTGG
CTGATGGCGA ATACGCCGCG CAACCAGGCA AACCTCCTGA AGCAATTCCA GACGGCGGGC
ATTTCCGCCA AGCGGATCAT CTTCTGCCCC CGCGCGCCCT ATGAACAGCA CATCGACCGC
CAGCAGGCGG CCGATATCGG CATCGATACC TTCCCCGTCA ATGGCCACAC GACCACCTCG
GAGCAGCTCT GGGGCGGCCT GCCGGTCCTG ACCGTCAAGG GCACCAACTT CGCCTCACGC
GTCAGCGAGA GCCTGCTCAG GGCCATCGAC CTGCCTGAGC TCGTCGCGCC CGACCTGCGG
GCCTATGAGG ATATGGCTGT CGAACTGGCT GAGAATCCCG GACGGATCGC TGAGTACAAG
GCGCATCTCA AGGAGAAGCG CTATACCGCG CCGCTCTTCG ACGCAGAACG GTTCTGCGAC
CATCTGGAGC AGGCCTATCA AATCATGGCC GAGCGCGCCA AGCAGGGTCT TGCCCCCGAT
CACATGGACA TTCCGGCGTT GCCGCCGCGC ACGGCGCCGT TCGCGGCCGA ATGA
 
Protein sequence
MRASAIDSHA RTFPSRSWVL NSKVSFSAAS KDYQMGRYTQ SLATLNQLMD IQQDAKTYAL 
LAKNLVQLGF KADAAKAYGL AGNCEGPNSY EYQKQAAKLH YETGNEDDAL LIAMRNLTKA
QEDAELAFII TAIYLKRQQR DIIRPFKTVL SQSANPDHMR LAALLLSDDL NDATNQNLAR
NIFKRFPGNL AFRFLHLVFA REFNEFEEAS KHQAVIDAAL AKGDIEILRK DNPFYHLHWC
GNEDFNRYAT IGTTPLNQER VAFRRNQPHT WSDKIRIGYM SSDFWDRHAT MKLLQRILEL
HDTDRFEVTL FCHTGPEYLK HNTIDRSRWG RIVTVHGFSD QAVLEAVREH NIDIMVDLKG
HTSGSRATAF NLPLAPVHVG WLGFPGSTVN VDLDYVIGDH FVLPEVAKPF YHEKFCRLPE
SYQPNDPMHR PKPRPVTREQ LGLPDDAFIF ASFNGNRKIT PETIDSWCRI LKRAPNSVLW
LMANTPRNQA NLLKQFQTAG ISAKRIIFCP RAPYEQHIDR QQAADIGIDT FPVNGHTTTS
EQLWGGLPVL TVKGTNFASR VSESLLRAID LPELVAPDLR AYEDMAVELA ENPGRIAEYK
AHLKEKRYTA PLFDAERFCD HLEQAYQIMA ERAKQGLAPD HMDIPALPPR TAPFAAE