Gene Rleg2_5521 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_5521 
Symbol 
ID6978615 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011368 
Strand
Start bp1169650 
End bp1170831 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content62% 
IMG OID643394620 
Productectoine utilization protein EutD 
Protein accessionYP_002279438 
Protein GI209547520 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID[TIGR02993] ectoine utilization protein EutD 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.00590765 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACCCAGC CCAACCTCAA ATTCTCGCTC GGCGAATATG CCGCGCGGCT GGAAAAGACG 
CGGCGCGCCA TGGAGGCGAA GGGCGTCGAC CTGCTTATTG TCAGCGACCC GTCGAACATG
GCCTGGCTGA CCGGTTATGA CGGCTGGTCT TTCTACGTGC ACCAGGCAGT GATCGTGCCG
CCGCAGGGCG AGCCGATCTG GTTCGGCCGC GGCCAGGATG CCAACGGCGC CAAGTTCACC
ACCTATCTGA AGCACGACAA CATCGTCGGT TATCCCGATC ATTACGTGCA GTCGACCGAG
CGCCATCCGA TGGATTACCT CTCGGGCATC CTGACGGAGC GCGGCTCTAG CAAGCTGACG
ATCGGCGTCG AGATGGACAA TTACTGGTTC TCGGCGGCCG CCTTCGCCGC GCTGCAGAAA
CATCTGCCGC ATGCGCGCTT CGTCGACGCG ACGGCGCTGG TCAACTGGCA GCGCGCGGTC
AAGAGCGAGA CCGAGATCAA ATATATGCGC AATGCCGCCC GCATCGTCGA AGCGATGCAT
GCCCGCATCT TCGACAAGAT CGAAGTTGGC ATGCGCAAAT GCGATCTAGT CGCGGAAATC
TATGATGCCG GCACTCGCGG CGTCGACGGC ATCGGCGGCG ATTATCCAGC GATCGTGCCG
CTGCTGCCGT CCGGCGTTGA AGCATCCGCG CCGCATCTAA CCTGGGACGA CCGGCCGCTG
AAGAAGGGCG AGGGCACCTT CTTCGAGATT GCCGGCTGCT ACCACCGCTA TCACCTGCCA
CTGTCGCGCA CCGTCTTCCT CGGCAAGCCG ACGCAGGCCT TTCTCGATGC CGAGAAGGCG
ACATTGGAAG GCATGGAGGC CGGTCTTGCC GTTGCCAAGC CCGGCAACAC CTGCGAGGAC
ATCGCCAACG CCTTCTTCGC CGTGCTGAAG AAATACGGCA TCGTCAAGGA CAACCGCACC
GGTTACCCGA TCGGTCTGTC CTATCCGCCG GACTGGGGCG AGCGCACCAT GAGCCTGCGG
CCGGGCGACC GGACCGAGTT GAAGCCCGGC ATGACTTTCC ATTTCATGAC TGGCCTCTGG
CTCGACGACA TGGGTTTCGA AACGACCGAG AGCATCCTGA TCACCGAGAG CGGTGTCGAA
TGTTTCGCCA ATGTGCCGCG CAGGCTGATG GTCAAGGATT GA
 
Protein sequence
MTQPNLKFSL GEYAARLEKT RRAMEAKGVD LLIVSDPSNM AWLTGYDGWS FYVHQAVIVP 
PQGEPIWFGR GQDANGAKFT TYLKHDNIVG YPDHYVQSTE RHPMDYLSGI LTERGSSKLT
IGVEMDNYWF SAAAFAALQK HLPHARFVDA TALVNWQRAV KSETEIKYMR NAARIVEAMH
ARIFDKIEVG MRKCDLVAEI YDAGTRGVDG IGGDYPAIVP LLPSGVEASA PHLTWDDRPL
KKGEGTFFEI AGCYHRYHLP LSRTVFLGKP TQAFLDAEKA TLEGMEAGLA VAKPGNTCED
IANAFFAVLK KYGIVKDNRT GYPIGLSYPP DWGERTMSLR PGDRTELKPG MTFHFMTGLW
LDDMGFETTE SILITESGVE CFANVPRRLM VKD