Gene Rleg_0384 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_0384 
Symbol 
ID8011589 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp390778 
End bp392550 
Gene Length1773 bp 
Protein Length590 aa 
Translation table11 
GC content68% 
IMG OID644822979 
Productputative succinoglycan biosynthesis transport protein 
Protein accessionYP_002974234 
Protein GI241203138 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.69459 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTACGATA TCAGGGCCAG AAACAACTTC CAGGATCGCG TAGCCGCAGG GCCGCGCCAC 
GATGACGATT CCTATGGGGC ACCCCGGTCT CGCCAGCCTG ATATCGCGCC GCCCGAGGCC
GAATTGCTGC GCGCCATCGG CCGGGTGCTG GAGGAGCAGC GCGCCAGGGC GGCGCGCACC
GCACCGCTGG TCGACCGCAT CGAAACCATC CTCGGCAACC GCCTCCGGGC CGCCAACGAC
GTCGGCCATC TGCTTCTTCA GCAGCCGGCG GGCGACGAAG AGCTGAGCGT TCCGGCCGAC
GAGCCGATGA TCTCACCCGA ACCGTTGGCA GCGGTCCGTG AAGAACCCGC GGCGGCCCGG
TCGGTCGCAC CGCGCCGCCG CGTGGGCGGC ATTGGTCTTG TGATGATGGT GGCCGCCGCG
ACGATCGTCG GCGCGGGCTT GCCGGCGCAG ATGCCGGCTT CGCCGGCCCT CTACCGTGCC
GAGGCGACGC TTGCGGTGAA GAGCGATGCC GCAAGCCGCG CTGCCTTCAC CCAGGCCGCG
GCGAAAGGGC TGCTGTCGGC GCGGGTGGTT GCTTCGACGG TGGCCGCCCT GAAACTCGAT
CACGACCCCG AATTTGCCGG CGCCAGTGCC AATGCGCTCG GCGTCGCGCT CGATCTCCTC
TCGGCGACCG GCGCTGCTGC CGATCCGGCC TCGCGCGCCG AGGCGACGCT GAAACATGCG
GTCGAGATAC TGCCCGATGC CGCCGCCGGC ACCATCCTCG TCAGGGCGAC GACCGGTAAT
AGCGGCAAAT CCATGCGCAT CGCCGCAAAG CTTGCCGAAG CGGTGTCCGC AGCAGACGGA
CCCGGCGGCA ACGTCGAGAC CGATACCGCC TTGCGCAAAA CCTATGACGA GGTGAAAGTG
GAGCTTGCCG CCTTCACGGC GAAGAGCGGC GAGGGCAACG TCAAGGTGGC GATCGATCTT
CGCCGCCAGA TCGACCAGCT CGATGCCGAT CTGAAAGCGG CCGACCAGAA CATCCTTGCC
GCCAAGGCGC AGACCGACCG GCTCAAAGCC GCAAAACTTG CCGAAGTGCT CGACGGTTCG
CTCCCCTCCG ATATGCTTTC GCCGGCGCTG CAGGACTGGC GTGACAAATA TGCCGTCGCC
AAAACGACGC TTGCGCAGCT TTCGGCCGAG CTCGGCCCGC GCCATCCGCG CCTCTTGCAG
CAGCAGGCCG AAACCGACGG TCTGAAGGAG AATATGGGCA AGGAGCTGAC CCGTCTTGCC
CAGACTGCCA ACCTCGCCGC CAAAGCCGCC GTCGATGCGC GCAAGGGCCT GAACGATCGC
CGCAACACGC TGATCGCCCA GAGCCGGGAC ACCGGCGTCG ATTTGTCACG CCTGACCGAG
CTCAGCGAAA AGGCGAATGC CGCCCGCTCG CGCCTCGAAG AGGCGACGTC CACAGCGGTG
GAAACGGCCG CCGACGGTCG TATCGTTCTC CTGAAGCCAG CATTGGCAAC CGCAGTATCG
GGAAGCGATG GCCTGACCGG CCGCTCGCTG GCCGGTGCCG CAGCGGGCCT TGCGATAGGC
CTTGCCGCGG CTTTCTTGCT CCGGCTGCGG AGGCCCGTCG CCGATCCCGC AAGGGAAAAA
ATGCCAGTAC TTCGGCCGCA GGCCGCACTG CCGTCGATGC CCGCGCAGGC CCCCGCACCG
GTCGACGAGA TGGAGCTGCT GCGCTCCGAG ATCTCCGGCC TGCGCGACCG GCTTCGTGTT
CATGCGCTCG AAGCGCGGCA GCCGCTACGC TGA
 
Protein sequence
MYDIRARNNF QDRVAAGPRH DDDSYGAPRS RQPDIAPPEA ELLRAIGRVL EEQRARAART 
APLVDRIETI LGNRLRAAND VGHLLLQQPA GDEELSVPAD EPMISPEPLA AVREEPAAAR
SVAPRRRVGG IGLVMMVAAA TIVGAGLPAQ MPASPALYRA EATLAVKSDA ASRAAFTQAA
AKGLLSARVV ASTVAALKLD HDPEFAGASA NALGVALDLL SATGAAADPA SRAEATLKHA
VEILPDAAAG TILVRATTGN SGKSMRIAAK LAEAVSAADG PGGNVETDTA LRKTYDEVKV
ELAAFTAKSG EGNVKVAIDL RRQIDQLDAD LKAADQNILA AKAQTDRLKA AKLAEVLDGS
LPSDMLSPAL QDWRDKYAVA KTTLAQLSAE LGPRHPRLLQ QQAETDGLKE NMGKELTRLA
QTANLAAKAA VDARKGLNDR RNTLIAQSRD TGVDLSRLTE LSEKANAARS RLEEATSTAV
ETAADGRIVL LKPALATAVS GSDGLTGRSL AGAAAGLAIG LAAAFLLRLR RPVADPAREK
MPVLRPQAAL PSMPAQAPAP VDEMELLRSE ISGLRDRLRV HALEARQPLR