Gene Rleg_0625 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_0625 
Symbol 
ID8011806 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp659294 
End bp660484 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content62% 
IMG OID644823215 
Productprotein of unknown function DUF1006 
Protein accessionYP_002974468 
Protein GI241203372 
COG category[S] Function unknown 
COG ID[COG3214] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.045012 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGAATC TGCTTTCCAA TTCCGACGCC CGCCGTGTCT TCCTTGCCAA GCAGGGCCTC 
AGCGCTCCGC CGAACCGCGC CTTGACCAAG GCCGGCTTGT TGCAGCTCAT CCATGAACTG
GGTTTCGTCC AGGTGGACAG CATCCAGACG GTGGAGCGGG CGCATCATCA GATCCTGTTT
TCCCGCAACC AGACCTACAG GCGCGAGCAT CTGACGGCGC TGCTCGAAAA GGACGGCGCG
CTGTTCGAGC ACTGGACGCA TGATGCTTCC ATCCTGCCGA GCGCCTTCTT CGTCTATTGG
AAGCACAAGT TCCTTCATCA GGAAAAGGTG CTCATCGAAC GCTGGCGCAA ATGGCGCGGC
GAGGGGTTCG AGGCGGCGTT TGCGGAGACC TATGAGCGCG TTGAGCGCGA CGGTGCAATC
CTCTCACGCG ATATCAAGGC CGATGGACAC GTTTCCGGCG GCTGGTGGAA CTGGCATCCG
AACAAGACGG CGCTCGAATA CTTCTGGCAC ACCGGCAAAT TCGCCATCGC CGGCCGCTCG
AATTTCCAGA AGATCTATGA CCTGGCGGAG CGCGTCATCC CGACCGAGTT CCGCGAGCCG
GAGGTGAGCC GCGAGGAATT CGTCGACTGG GCATGCCGCA GCGCGCTTAC CCGGCTCGGC
TTCGCCACCC ATGGCGAGAT ATCGGCCTTC TGGAACCTGG TCTCGCCCGA TGAGGCCAAG
GCCTGGGTAT CAGCCCATCG CGACGAACTG ATCGAAGTGC TGATCGAACC GGCGCTCGGC
GGCAAGGCGC GCCCATCCTG GGCTTTTGCC GATTTCCTCT CGACGCTCGA CACTTACCCC
GGCGCTCCGC CGCGCATTCG CGTGCTCAGC CCCTTCGACC CGATGATCCG CGACCGCAAC
CGCACCGAAC GCCTGTTCGG CTTCTTCTAC CGCATCGAGA TTTTCGTGCC CGAGCCCAAG
CGCGAATATG GCTATTACGT CTTCCCGCTG CTCGAAGGCG ACAGGCTGAT TGGCCGCATC
GACATGAAGG CGGATCGGAA GAAATCGACG CTCGACGTCA AGCGGCTCTG GCTGGAGCCC
GGTGTGAAGC CGTCGGCCGG GCGGCTGGAG AGACTGGAAG CGGAGTTGGA GCGGCTGGCG
CGGTTTGCCG GCGTGGAGAA GGTCGTGTTT CTGGAAGGGT GGAGGGGATA G
 
Protein sequence
MTNLLSNSDA RRVFLAKQGL SAPPNRALTK AGLLQLIHEL GFVQVDSIQT VERAHHQILF 
SRNQTYRREH LTALLEKDGA LFEHWTHDAS ILPSAFFVYW KHKFLHQEKV LIERWRKWRG
EGFEAAFAET YERVERDGAI LSRDIKADGH VSGGWWNWHP NKTALEYFWH TGKFAIAGRS
NFQKIYDLAE RVIPTEFREP EVSREEFVDW ACRSALTRLG FATHGEISAF WNLVSPDEAK
AWVSAHRDEL IEVLIEPALG GKARPSWAFA DFLSTLDTYP GAPPRIRVLS PFDPMIRDRN
RTERLFGFFY RIEIFVPEPK REYGYYVFPL LEGDRLIGRI DMKADRKKST LDVKRLWLEP
GVKPSAGRLE RLEAELERLA RFAGVEKVVF LEGWRG