Gene Rleg_1081 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_1081 
Symbol 
ID8015519 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp1061329 
End bp1062267 
Gene Length939 bp 
Protein Length312 aa 
Translation table11 
GC content63% 
IMG OID644823664 
Productprotein of unknown function DUF6 transmembrane 
Protein accessionYP_002974915 
Protein GI241203819 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism
[R] General function prediction only 
COG ID[COG0697] Permeases of the drug/metabolite transporter (DMT) superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACTCGG TTCTCAGAAA CCCGATGAGA GGCATTGCGC TGAAAGTCTC GTCGGTCGTG 
GTCTTCCTGG CCATGCAGAC CTTCATCAAG CTGGCGGGCT CAGACATCCC GCCGGGTCAG
GTCACCTTCT GCAGGTCGTT CTTCGCGCTC TTCCCGATCA TGGCCTATCT GGCCTATAAC
AGGCAGCTGC GCGCCGCCTT CTACACCGCC AATCCAATCG GTCATCTGAA GCGCGGCACG
ATCGGCATCT TGTCGATGGC TTTCGGTTTC TACGGCCTCC TGCATCTGCC GCTGCCCGAG
GCGATCGCGC TCGGCTATGC GCTGCCGCTC GTCGCCGTCA TCTTCGCTGC CGTTTTTCTC
GGCGAGACCG TGCGCATCTA TCGCTGGAGC GCCGTCCTGG TCGGCATCGT CGGCGTCGCC
ATCGTTTCCT GGCCGAAACT CACGCTGTTT CGCGACGGCG GCATGGAAGC AGACCAGGCC
GTCGGTGCGC TCTGCGTGTT GTTCTCGGCC GTTCTCGGCG GCGTGGCGAT GATCCAGGTG
CGCCGGCTCG TCGAGGAAGA GAAGACTGCG ACGATCGTGC TGTATTTCTC GATCACCGCC
TCGGTCTTCT CGCTGGCTTC TCTTCCCTTT GGCTGGCTCA TCCTGCCATG GCCGACGGCG
CTCTATCTGA TCGCCGCCGG CTTTTGCGGC GGCGTCGCGC AGATCCTGCT GACGGAAAGT
TACCGCCATG CCGACGTCTC CACCATCGCA CCGTTCGAAT ATACCTCGAT CCTGCTCGGC
GGCATCGTCG CCTACTTCGT CTTCGGCGAC GTGCCGAGCG TGACCATGCT GATCGGCACC
GTCATCGTCG TCGCCGCCGG CATCTTCATC ATCTATCGCG AGCACCAACT GGGCATCGAG
CAAAGAGAGG CGCGCAAGGC CACGACGCCG CAAGCCTGA
 
Protein sequence
MHSVLRNPMR GIALKVSSVV VFLAMQTFIK LAGSDIPPGQ VTFCRSFFAL FPIMAYLAYN 
RQLRAAFYTA NPIGHLKRGT IGILSMAFGF YGLLHLPLPE AIALGYALPL VAVIFAAVFL
GETVRIYRWS AVLVGIVGVA IVSWPKLTLF RDGGMEADQA VGALCVLFSA VLGGVAMIQV
RRLVEEEKTA TIVLYFSITA SVFSLASLPF GWLILPWPTA LYLIAAGFCG GVAQILLTES
YRHADVSTIA PFEYTSILLG GIVAYFVFGD VPSVTMLIGT VIVVAAGIFI IYREHQLGIE
QREARKATTP QA