Gene Rleg_3059 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_3059 
Symbol 
ID8013971 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp3056030 
End bp3058099 
Gene Length2070 bp 
Protein Length689 aa 
Translation table11 
GC content64% 
IMG OID644825627 
Productprotein of unknown function DUF1355 
Protein accessionYP_002976855 
Protein GI241205759 
COG category[S] Function unknown 
COG ID[COG5426] Uncharacterized membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.278325 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCTTCG ACTTTTCGCC CTTCCTGCCC TGGTCGGTTC TGGCTGCACT CGCGGTCGTC 
AGCGCTGTTA TCGCCGCTTT CGCGATCTGG CGCGGCATTC GCGGTGCCTG GATCAGAACG
CTGGCAGCAC TCGCCATGTT GACCGCCCTT GCCAATCCTG TCCTGCTGCA GGAGGATCGG
GATCAGTTGT CGACGATCGT CCCGGTTCTT GTCGATCGAA GCCAGAGCCA GCAGACGCCC
GATCGCGTCA AGATGACCGA CGATGCGCTT GCGGCCCTGA AGGGACAACT CGCCCGCTTC
CCCCAGATAG AGCCCCGTTT CGTCGATGTC GAAGGCGACG TCAATTCCGA CGTGCCGTCC
ACGCGTTTGT TCGACGCGCT GTCGGCCAAC ATTGCCGATG TCCCGCCCGC CCGTATCGGC
GGCGCCATCA TGCTGACCGA CGGTGAAGTC CATGATGCTC CGGCCGCCAA TCAGGCGCTT
GGTTTCGACG CGCCGATCCA TGGCCTCATC ACCGGCAAAG CCAACGAATT CGATCGCCGC
ATCGAAGTCA TCAAGGGGCC GCGCTTCGGC ATCGTCAACG AAGAGCAGCA GGTGATCCTG
CGCGTCTTCG ACGACGGCCC TGGCCCGGGC GGCACGGCCG ATGTCACGGT GAAGCTGAAT
GGCGACGAGA TCGCCACCCT GCAGGCGACA CCCGGGCAGG ATACGCCCTT CTCCTTCAAG
GTGCCGGGTG GCGGCAGCAA TGTGCTCGAA TTCTCGGTCG CGGCACTTCC CGGCGAAGTC
ACCACCGCCA ACAACCGCGC CGTCCACGTC ATCGACGGCA TCCGCCAGAA TCTCCGCGTC
CTGCTTGTCT CCGGCGAGCC GCATGCCGGC GAGCGCGCCT GGCGCAACCT GCTGAAATCC
GATGCCTCGG TCGATCTCGT CCACTTCACC ATCTTGCGTC CGCCGGAAAA ACAGGACGGC
ACGCCGATCA ACGAGCTGTC GCTGATCGCC TTTCCGACGC GTGAGCTCTT CGTCGACAAG
ATCAAGGATT TCGACCTGAT CATCTTCGAT CGCTACCAGC ACCGCGGCGT GCTGCCGTTG
CTCTATTACG ATTACATCGC GCAATATGTC GACAACGGCG GCGCGCTGCT GATCGCCGCC
GGTCCCGAGC ATGCCGGCCC CGATTCCATT GCGCTGACGC CGCTTTCCAC GGTCCTGCCG
GCAACGCCCA CCGGAGAGAT GATCGAAAAG GCCTTCTATC CCCGCCTCTC CGAGGAAGGC
CGCAAACATC CCGTCACGCG CGGCCTGGAT GGCTCGGGCG AGGACCCGCC GCATTGGGGA
CGCTGGTTCC GCAGCGTCGA TGTCGAGCGG CCGCAGGGCG AGACGATCAT GCTCGGCGCC
GACAACCACC CGCTGTTGGT GCTGAACCGC GCCGGCCAGG GCCGCGTCGC CATGCTGCTT
TCCGATCAGG GCTGGCTCTG GGCGCGCGGC TTCGAAGGCG GCGGTCCGAA CGTTTCGCTC
TATCGCCGTA TCGCCCATTG GCTGATGAAG GAGCCGGCGC TCGAGGAAGA AGCGCTGACG
GCGCGCGCCT CCGGCCGGAC CCTTGAAGTT ACCCGCCAGA CGATCGGCGA CAATCCCGGC
AACGCCACCG TGCGTTATCC CTCCGGCAAG ACCGAAACCC TGCCGCTCAC CCAGTCCGAA
CCGGGGCTCT ACAAGGCCGA GAAGAGGATG GACGAAATCG GTCTCTTCGA AGTCCGCAAC
GGCAAGCTGT CGACGCTTGT TCATATCGGC GCCGTCGACG CGCCGGAATT CAAGGCAATG
ATCTCGACGA CCGATGTGCT GCAGCCGGTC GCCGACAGAA GCAAGGGTCT CGTCACCCGC
GTCTCCAATG CAAATGGCGC GATCTCGGTC CCGCCGATCC TGCCGGTGCG CGGCCAGGTG
CGCGTCTCCG ACAACGATCG CATGATGATC CGCATGACCA GCGAGACGGT CTTGAAGGGC
ATCAACACGC TGCCGCTCTT TGCCGGGTTC GCCGGCGTCG GCATCCTGCT GCTCGCCTTC
GGCGCCATGT GGTGGCGCGA AGGGCGGTAA
 
Protein sequence
MTFDFSPFLP WSVLAALAVV SAVIAAFAIW RGIRGAWIRT LAALAMLTAL ANPVLLQEDR 
DQLSTIVPVL VDRSQSQQTP DRVKMTDDAL AALKGQLARF PQIEPRFVDV EGDVNSDVPS
TRLFDALSAN IADVPPARIG GAIMLTDGEV HDAPAANQAL GFDAPIHGLI TGKANEFDRR
IEVIKGPRFG IVNEEQQVIL RVFDDGPGPG GTADVTVKLN GDEIATLQAT PGQDTPFSFK
VPGGGSNVLE FSVAALPGEV TTANNRAVHV IDGIRQNLRV LLVSGEPHAG ERAWRNLLKS
DASVDLVHFT ILRPPEKQDG TPINELSLIA FPTRELFVDK IKDFDLIIFD RYQHRGVLPL
LYYDYIAQYV DNGGALLIAA GPEHAGPDSI ALTPLSTVLP ATPTGEMIEK AFYPRLSEEG
RKHPVTRGLD GSGEDPPHWG RWFRSVDVER PQGETIMLGA DNHPLLVLNR AGQGRVAMLL
SDQGWLWARG FEGGGPNVSL YRRIAHWLMK EPALEEEALT ARASGRTLEV TRQTIGDNPG
NATVRYPSGK TETLPLTQSE PGLYKAEKRM DEIGLFEVRN GKLSTLVHIG AVDAPEFKAM
ISTTDVLQPV ADRSKGLVTR VSNANGAISV PPILPVRGQV RVSDNDRMMI RMTSETVLKG
INTLPLFAGF AGVGILLLAF GAMWWREGR