Gene Rleg_3230 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_3230 
Symbol 
ID8014122 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp3234112 
End bp3236388 
Gene Length2277 bp 
Protein Length758 aa 
Translation table11 
GC content58% 
IMG OID644825791 
Productexopolysaccharide transport protein family 
Protein accessionYP_002977018 
Protein GI241205922 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis 
TIGRFAM ID[TIGR01005] exopolysaccharide transport protein family
[TIGR01007] capsular exopolysaccharide family 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCTATCGC CAGATAAACT TACGCCGCGT ATCGACCCCT CCCGCGATAC CGGAAATGAT 
GCTGATTTCA TCGATTTCGA CAAGCTCATC GCAATCGCCC GCCGACAGTG GCGGATGGTT
GCGGCGTGCG GTTTCGCCTT TGCCATTCTC GGCATTGTGT ATGTCTTGAC TTCGGTGCCT
GTCTACACCG CCGACACCAG CGTGCTGATC GACCGCAGCG ATAGTCAGGT GATCAACCAG
CTGGCGGCGT TCGGCCAGAT GGACGATGAC GAAGGTACTG TCCTCAGTCA GGTCGAGCTG
TTGAAGTCCG ACACCATCGC CTATGCTGTT GTCGACAAGC TGAAACTCGT CGACAATCCG
GAGTTCATGG GACCGAAAAG CTCGCTTTTT TCTGTTTCAA CGTTGAAATC CTTTATGAAT
TTTCGGTCGT GGTTTGCCGA TGACGCAGCC GTCGCACCTG ATCCGGAAAT GAGGCGGCGA
GGCGCTGCCG AGACTGTCGC AGGCAATATC GATGTCGAAC GCGTCGGCCG CTCCTACGTC
CTCGATGTCA GTTATACAGC GCAGTCGCCG GACCTAGCGC GTGATATCGC GGCTGGGATC
GCCGACGTCT ATCTGGTCGA CAAGCTCAAT TCGAAATACG AGGCGACGCG CCGGGCCGGC
CAATGGCTGC AGGAGCGCAT CGAAGAACTC CGGCAGCAGG CGCTGGACAC CGACCTTGCG
GTTCAGAAAT TCCGCGGCGA GCATGGGCTT GTCGAGGCCG GATCCGGTAC GCTGATCAGC
GAGCAGCAGC TTTCCGAGAT CAATACGCAA TTGATCAATG CGCAGGCCGA GACGGCAAAG
GCCGAGGCAC GGTATGCGCG CGTCAAGTCG ATCATCGACG CCAAGCAGAC CGATGCGATC
GTCACGGATG TGCTCGACAG CTCCATTTCC AACGACCTGC GCAAGAAATA TCTGGAGGCT
TCCAAGCTTG AGACCGAAAT TGAGGCGCGG CTCGGTCCCG ATCACGTCCA GGCGGTTCGG
CTTCGTGCGG AAATGGAAGA ATATAAAAGG CTCATGTTCG ATGAGTTGAA CCGCATCGCC
GAGAGCTACC AAAGCGAGCT GCAGGTCGCA AAATCGCGGG AAAACTCTTT GCGCGACAGT
GTTACACAGG CGACGGGCGT AGCCGCAACG GCTGGCGAAA CGCAGGTACA GCTTCGCGAG
CTCGAGCGCA CGCGCGATAC CTACAAGAAT CTCTATCAGA GCTTCTTGAC GCGCTACCAG
GAAGCAATTC AGCAGCAGAG CTTTCCGATT ACCGCTGCGC GCATCATCAC GACGGCGGAG
ACGCCGACGA AGCCGAGCGC TCCGAAGAGA GCTCTCGTTG TTGCTTTCGC GATGTTCGTA
GGATGTGCAT TTGGCAGTGG TATAGCGGCC TTCCGCGAAT TCCGAGATCG GTTCTTCCGC
ACCGGCGATG ATGTCCGGGA CGTGCTTGAT GTCGAGTCTC TCGGGGTCAT GCCGCTGATT
GAAAATAACG TCGACGATCC GACGCTCGTT GATCCCAGCA ATCCGCGAAG CATCGCCAGG
GGCGGAAAGA CGACGACCTA TGTCGAGGAG CATCCACTCT CGGCATTTGC CGAGACGCTT
CGAAGCGCCA AGATTGCGAT CGACCTCAGT GCTGCCGATC AGCGCTGCAA GGTCATCGGC
GTCGTGTCGA GCTTGCCTGG CGAGGGGAAG TCGACGACAT CCATCAATTT CGCCAAGCTT
CTGGCAATGC AGGGTGCGCG TTGCCTGCTT ATCGATGGCG ATATGCGCAA TCCCGGTGCC
ACCCGGGCGA TCGGCCGCCA TGCCGAGGCC GGCTTGCTCG AGGCGATCGT CGACAGCCGT
CCGCTCAAGG ACCTGATCCT TCTCGATCCC AAGACAAAGC TGGCATTTTT GCCGACGGTG
GCAAGGTATC GTGTTCCGCA TTCGTCGGAA CTGCTTGCCT CACGCGGCAT GGATCAGCTT
CTTGAAACGG CGCGTCAAAG CTTCGACTAT ATTATCGTCG ACCTACCGCC CTTGGCGCCG
GTCGTGGATG CGCGCGCTAT CAATTCGAAA CTCGATGCGG TGGTTTTCGT TATCGAGTGG
GGAAAGACAT CGCGCAAGGT CGTGCAGTCG ACCCTGCTGT CCGAGCCGGA ACTCTATGCG
AAGTGCGTCG GTACTATCTT GACCAAGGTC GACCCGTCGC AGATGAAACT CTACCGGACA
TTCGGTTCGA GCGAATACTA TTATAAGCGT TATTCGCGAT ACTATACCGA AAGCTGA
 
Protein sequence
MLSPDKLTPR IDPSRDTGND ADFIDFDKLI AIARRQWRMV AACGFAFAIL GIVYVLTSVP 
VYTADTSVLI DRSDSQVINQ LAAFGQMDDD EGTVLSQVEL LKSDTIAYAV VDKLKLVDNP
EFMGPKSSLF SVSTLKSFMN FRSWFADDAA VAPDPEMRRR GAAETVAGNI DVERVGRSYV
LDVSYTAQSP DLARDIAAGI ADVYLVDKLN SKYEATRRAG QWLQERIEEL RQQALDTDLA
VQKFRGEHGL VEAGSGTLIS EQQLSEINTQ LINAQAETAK AEARYARVKS IIDAKQTDAI
VTDVLDSSIS NDLRKKYLEA SKLETEIEAR LGPDHVQAVR LRAEMEEYKR LMFDELNRIA
ESYQSELQVA KSRENSLRDS VTQATGVAAT AGETQVQLRE LERTRDTYKN LYQSFLTRYQ
EAIQQQSFPI TAARIITTAE TPTKPSAPKR ALVVAFAMFV GCAFGSGIAA FREFRDRFFR
TGDDVRDVLD VESLGVMPLI ENNVDDPTLV DPSNPRSIAR GGKTTTYVEE HPLSAFAETL
RSAKIAIDLS AADQRCKVIG VVSSLPGEGK STTSINFAKL LAMQGARCLL IDGDMRNPGA
TRAIGRHAEA GLLEAIVDSR PLKDLILLDP KTKLAFLPTV ARYRVPHSSE LLASRGMDQL
LETARQSFDY IIVDLPPLAP VVDARAINSK LDAVVFVIEW GKTSRKVVQS TLLSEPELYA
KCVGTILTKV DPSQMKLYRT FGSSEYYYKR YSRYYTES