Gene Rleg_4690 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_4690 
Symbol 
ID8007166 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012848 
Strand
Start bp54470 
End bp56044 
Gene Length1575 bp 
Protein Length524 aa 
Translation table11 
GC content59% 
IMG OID644821624 
Productextracellular solute-binding protein family 5 
Protein accessionYP_002972884 
Protein GI241113049 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGCAA AAATATTGAC GCGGGCAGCG ATCGGGCTGC TCGGAACGAT TGCTATTCCT 
GCAATGGTCG CCGCACAGGA AACCCCGCTG ACGATCGGGA TGTCGACGAC GCCGACGACG
CTGGATCCCC ATGAAGACAG TTCGGCGCCA AACAATGCGA CCTCGCGCCA TATCTGGGAC
AGCCTCATAA ACCTCACCGG AACGTCGGCA AACGCGCCGG AACTCGCCAC CGAATGGAAG
GTCGTCGACC CGACCCACTG GGAATTCAAG CTGCGGAAGG GCGTGAAATT CCATGACGGC
AGCGAGTTCA ACGCCGATGA CGTCATCGCT TCGCTACTGC GTGCCCGCGA CAAGCCCAGC
CAGAGCTTTG CTTCCTACAC CCGCAATATT GTCAATGTCA CGGCCACGGA TCCCTATACC
ATCGTCGTCG AGACGAAGGT TCCAGATCCG ATCCTGCTGA ATTCCGTCAG CCGAATTCGC
ATTATCAGCG CCGATTGCAA GGAGGCGCCG GTCCAGGATT TCGACAACGG CAAGTGCGCA
ATCGGAACCG GGGCCTATTC CTTCGTCTCC TATTCGCCAG GCAGCAACCT GACCTTGAAG
CGCAATGACA GCTATTTCGC TGGCCCCTCG CACTGGTCGA ACGTCACGCT GCGCTTCCTG
CCCGATGACG GCGCCCGCCT GGCATCGCTT CTTTCCAACG AGATCGACAT TGTCGAAACC
CTTCCTGCCG ATGGAATGGC ACGCGTGGAA GCGAGCGATA ATCTGCAAGT CATCAACGGC
CTGTCTTCCC GCTTCGTCTA TCTTGGCCTC GATGTCAGCC GCGATGTTTC GCCTTTCGTG
AAGGCTGCGG ATGGTTCCGA TCTCGACAAG AACCCGCTCA AGGATGAACG GGTGCGTCGC
GCAATGCTGA TGTCGATCAA CCGGCCGGCA ATCGTCGACC GCGTCATGCA GAAGAACGGC
ACGGTGGCCG ACCAATTCGT CACCCAAGGC TATGCGGGCT ATTCCGAAAA AGTCGAGAAG
GTGGGCTATG ATCCGGCCGC CGCCAAGGCG CTTCTGGCCG AGGCCGGTTA TCCCGATGGC
TTCAGCCTGA CCCTGCATGG TCCCTCTGGC CGTTATGTCA AGGATGCCGA AGTTCTGCAG
GCGGTCGGCC AGATGTTCAC GCGCATCGGC ATCAAATCGA AAGTCGAGGT CCTGCCCTGG
TCGATGTATT CGGAAGCCTA TTCAAAGGGC ACGTACAGCA CCTATTTCGG CTCCTGGGGT
GTGAACACCG GCGAAACCAC CAATCCGACT GTCGCGCTCG TCGCGACGCG CGACGAGAAA
AAGGGGACCG GCAGATATAA CGGCGGCGGC GTGTCCGATC CGAAGATCGA CGAGGTACTG
GCCAAGGCGA GTTCGACACT CGATGAAGCT GCACGCGCGC CCCTTCTCGA GGAGCTGTCG
ACCGAGACGT TCAACAACCT TTGGCTGCTC CCAATGCATT ACGAAAACGT CGTGCTGGGT
GCAAAGAAGA CCGTGTCCTA CACCCCGCGC GGCGACAAAT ATACGCTCGC TTATGACGTG
AAGCCGGCGG ACTAA
 
Protein sequence
MAAKILTRAA IGLLGTIAIP AMVAAQETPL TIGMSTTPTT LDPHEDSSAP NNATSRHIWD 
SLINLTGTSA NAPELATEWK VVDPTHWEFK LRKGVKFHDG SEFNADDVIA SLLRARDKPS
QSFASYTRNI VNVTATDPYT IVVETKVPDP ILLNSVSRIR IISADCKEAP VQDFDNGKCA
IGTGAYSFVS YSPGSNLTLK RNDSYFAGPS HWSNVTLRFL PDDGARLASL LSNEIDIVET
LPADGMARVE ASDNLQVING LSSRFVYLGL DVSRDVSPFV KAADGSDLDK NPLKDERVRR
AMLMSINRPA IVDRVMQKNG TVADQFVTQG YAGYSEKVEK VGYDPAAAKA LLAEAGYPDG
FSLTLHGPSG RYVKDAEVLQ AVGQMFTRIG IKSKVEVLPW SMYSEAYSKG TYSTYFGSWG
VNTGETTNPT VALVATRDEK KGTGRYNGGG VSDPKIDEVL AKASSTLDEA ARAPLLEELS
TETFNNLWLL PMHYENVVLG AKKTVSYTPR GDKYTLAYDV KPAD