Gene Rleg_5398 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_5398 
Symbol 
ID8007356 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012848 
Strand
Start bp813501 
End bp815021 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content61% 
IMG OID644822302 
Productextracellular solute-binding protein family 5 
Protein accessionYP_002973562 
Protein GI241113727 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAAGA TGAAGGCAAT TGGTGCGCTG AGCATCGGAC TGGCATTTCT GCTGGGTCCG 
GGCAGCGCAA TTCGCGCCGA CGCTGCCTCG GACAAGCTGA CGGTCGTGGT GACCGACGAA
CCGAAGTCGC TCGATCCCTG TGACACCGAC CTTTCGGGCA ATTCTCGCAT TCTGCACAAC
AACATCACCG AAGCGCTGGT CAATCTGAGC CCGGCCGACG GATCGGTCGT CCCGAGCCTT
GCCGCCAGCT GGCGGCAAGT GGACGAGCTG ACCTGGGAAT TCAAGCTCCG CGACGACGTC
ACCTTCCACG ACGGCAAGGC GTTTGATGCC AGTGCAGTCG TTGCCGCTCT CAAGCGGGCG
CAGGATCCGG CACTGGCCTG CGAAGTCGGA CTTGCGACGC TGAAGGGCGT CAAGTTCAAT
GCAGAAGCGG TGAACCCCAC CACCCTCCTC ATCAAAACGG ATATCGTCGA GCCGATCCTG
CCGAACAAGA TGTCCGCCGT GGACATCGGT TCGCCGGCGA CCCCGAACGA CGGCAAGTCG
CGCTCCCCGG CCGGAACCGG ACCTTACAAG CTTGCAGCGT GGACGCCCGG GCAATCGGTC
GACCTCGTGG CCTATGACGG CTATTGGGGC GACAAGCCGG CGATCAAGAA CGCGACCATC
ATCTGGCGCG CCGAATCCGC TGTTCGCGCG GCAATGGTCG CCACCGGTGA GGCGCAGATC
GCCTATGAAA TAGCGCCCCA GGACGGCACG TCGGAACAGG ATCATGCCTT CCCGAACGCC
GAAACCTCGC TGCTGAGGAT CGACGCAGAA ATTGCGCCAC TCAACGACAA ACGCGTGCGC
GAAGCCCTTA ATCTTGCGAT CGACCGGGAC GGACTTGTCG GCACCATCTT CCACCAGGAT
GCCCAAAAGG CGATGCAGGC GGTGCCGCCG TCCGTCTTCG GCTTCAATCC CGACATCCCC
GTCTGGACGT ATGATCCCGA AAAAGCGAAG TCCCTGCTCG CCGCGGCGAA AGCCGATGGC
GTGCCGGTTG ACAAGGAAAT CGTCATCTAC GGCCGCATCG GCATCTATCC CAATTCGTCC
GAAAGCCTGG AAGCCATTCA GGCGATGCTC GCGGATGCAG GTTTCAATGC CCGGCTCGAA
ATGCTTGAAA CAAGCCCGTG GCTGAAGAAG CTTCTTAAGC CCTGGGACAA GGAACGTCAG
CCGTCGATCC TGCAGACGCA GATCGACAAC ACCGAAGGCG ACGCCGTATT CACGCTGCCG
AACCGTTTTA CTACCGACGG CAACCAGTCG ACCATCGCCG ATGCCAAACT CGACACATTG
ATTACCGACG CGTCGAAGGC AACCGGCGAC GAACGCAGGA AACTGTTCGA GGAGGCCTTC
AGCTACATCG CCGTCGATGC GGTCAATATC GTGCCGCTGT TCCACATGGT CACGATTGCC
CGCGTCGCCG AGAACGTCAC CTACACGCCC GATGTGCAGG CCGGCAACGA GATCAAGCTT
AAATCGATCA GCTACCGCTG A
 
Protein sequence
MKKMKAIGAL SIGLAFLLGP GSAIRADAAS DKLTVVVTDE PKSLDPCDTD LSGNSRILHN 
NITEALVNLS PADGSVVPSL AASWRQVDEL TWEFKLRDDV TFHDGKAFDA SAVVAALKRA
QDPALACEVG LATLKGVKFN AEAVNPTTLL IKTDIVEPIL PNKMSAVDIG SPATPNDGKS
RSPAGTGPYK LAAWTPGQSV DLVAYDGYWG DKPAIKNATI IWRAESAVRA AMVATGEAQI
AYEIAPQDGT SEQDHAFPNA ETSLLRIDAE IAPLNDKRVR EALNLAIDRD GLVGTIFHQD
AQKAMQAVPP SVFGFNPDIP VWTYDPEKAK SLLAAAKADG VPVDKEIVIY GRIGIYPNSS
ESLEAIQAML ADAGFNARLE MLETSPWLKK LLKPWDKERQ PSILQTQIDN TEGDAVFTLP
NRFTTDGNQS TIADAKLDTL ITDASKATGD ERRKLFEEAF SYIAVDAVNI VPLFHMVTIA
RVAENVTYTP DVQAGNEIKL KSISYR