Gene Rleg_3232 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_3232 
Symbol 
ID8014124 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp3237223 
End bp3238491 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content56% 
IMG OID644825793 
Productpolysaccharide export protein 
Protein accessionYP_002977020 
Protein GI241205924 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1596] Periplasmic protein involved in polysaccharide export 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGTATTTTG TCAGATCGCG GGGCAGTAGC TTCAAGGGCG TCAACACAGG GTATTCTATG 
GGTTGTTCTC GTATCGGTAG TACACGCGTC GCCATTGTTG TAGCGCTGAC GACTATATTG
GCAAGCTGCA CGTCGTTACC AAGATCCGGT CCCGATCACA AAGACGTTGA TCGAGATGCA
GCGGTGAAAG TGACGACGAA GGAACGTCGT GTCGGTATCG ACTACGCCCT GGTTGATCTC
AGCAAGAACG TTCTGTCGTA TTTTACCGCT CCCCAGCCGA CCTCGTTCAA AGGGTTTGGT
GGTGGCCGAG GCGGGGCGCC CGAAATTCCA CTCGGTTACG GCGATGTCGT TTCGGTTGCC
ATCTTCGAAG CTCAGTCCGG CGGTCTCTTC ATTCCGTCCG ATGCAGGTAG CCGACCCGGC
AATTACATCT CGCTGCCAGA GCAGACCATC GATAGAAACG GAACGATCAC AATTCCCTAT
GCGGGTCGGG TTCCGGCTGC CGGTCGCCTG AAGGAGACCG TAGAGCAGGA CGTCGAGGAT
CGCCTGGCGA GCCGCGCGAT CGAACCGCAG GTAGTTATTA CGACAACGAC AAGCCGCTCC
AGTCAGGTTG CCATCCTCGG CGATGTCAAC AATCCGCAGC GCGTTGAGAT CAGCCCGGCG
GGTGAGCGTG TTCTCGATGT CATTTCCGCC GCGGGCGGTT TGACGACCAA CAATATCGAA
ACGAATGTGA CGCTGCAGCG CCGCGGCAAG ACGGCAACCG TCGCCTACAC CACGCTGTTG
AAGAACCCGG CCGAAAATAT CTATGTCGCA CCGGATGATA CGATCTCGAT CGATCATGAG
CGCCGTACCT TTCTTATGCT CGGCGCCGCA GGCACCAGTG GCCGCTTCGA TTTCGAAGAG
TCTAACCTGA CCCTTGGAGA GGCAATCGCC AAGGCGGGCG GCCTGCGCGA CGACCGCGCC
GATCCGGCTC AGGTCTTGCT CTATCGTCTT GTCCCGAAGA AAACGGTTCA AGCGATGCAC
GTGGACACGA CGAGATTTGC GAGTGAAATG GTTCCAGTGA TCATCCGTGC GAACATGCGT
GACCCGGCAA CCTTGTTTGC TGTTCAGCAG TTCAAGATGG AAGACAAGGA TATTATCTAT
ATTTCCAATT CGGACTCTGT TGAACTGGTC AAGTTCCTTG ACATCGTGAA CTCGGTATCA
TCCACTGTTT CCGGAGTGAC CGACGATGCG AATGATACCC GTAACGCGGT ACAGGATCTT
GGAAATTGA
 
Protein sequence
MYFVRSRGSS FKGVNTGYSM GCSRIGSTRV AIVVALTTIL ASCTSLPRSG PDHKDVDRDA 
AVKVTTKERR VGIDYALVDL SKNVLSYFTA PQPTSFKGFG GGRGGAPEIP LGYGDVVSVA
IFEAQSGGLF IPSDAGSRPG NYISLPEQTI DRNGTITIPY AGRVPAAGRL KETVEQDVED
RLASRAIEPQ VVITTTTSRS SQVAILGDVN NPQRVEISPA GERVLDVISA AGGLTTNNIE
TNVTLQRRGK TATVAYTTLL KNPAENIYVA PDDTISIDHE RRTFLMLGAA GTSGRFDFEE
SNLTLGEAIA KAGGLRDDRA DPAQVLLYRL VPKKTVQAMH VDTTRFASEM VPVIIRANMR
DPATLFAVQQ FKMEDKDIIY ISNSDSVELV KFLDIVNSVS STVSGVTDDA NDTRNAVQDL
GN