Gene Rleg_4180 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_4180 
Symbol 
ID8014970 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp4274550 
End bp4276193 
Gene Length1644 bp 
Protein Length547 aa 
Translation table11 
GC content59% 
IMG OID644826750 
Productextracellular solute-binding protein family 5 
Protein accessionYP_002977960 
Protein GI241206864 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.336386 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGATGA CCAAACTCAG CCGCAATTTT CGCCTGCTTT CCGCGGGAGC CGCTCTTTCG 
CTCCTGATGA TGGCGGCACC CTCGGCCTTT GCCGAGACAC CGAAGGATAC GCTGGTCGAA
GGTTTCGCCA TCGACGATAT CATCACGATG GATCCGGGTG AGGCTTTCGA GCTTTCGACC
GCCGAAATCA CCACCAACAG CTACAGCCTG CTCGTCCGTC TCGACATGGA CGACACGTCC
AAGGTGAAGG GCGATCTGGC CGAGAGCTGG AGCGTTTCCG ATGACGGCCT TACCTATACG
TTCAAGCTGA AGTCAGGCCT GAAATTCGCC TCCGGCAACC CGATCACCGC CGAAGACGTC
GCCTGGTCGT TCGAGCGTGC CGTCAAGCTC GACAAGAGCC CAGCCTTCAT CCTCACCCAG
TTCGGTCTGA CCGGCGATAA CGTCGCGGAA AAAGCCAAGG CGGCCGATGC CGGCACTTTC
GTCTTCACGG TCGACAAGGC CTATGCGCCG AGCTTCGTGC TCAACTGCCT GACGGCAACC
GTCGCTTCCG TCGTCGACAA GAAGCTGGTG CTGGAGCATG TGAAGGCGGT GGCGCCCGAT
GCCGACCACA AATACGACAA CGACTTCGGC AATGAATGGC TGAAGACCGG CTATGCCGGC
TCCGGCGCCT ATAAGATGCG CGAATGGCGC GCCAACGAAG TCGTCGTGCT GGAGCGCAAT
GACAATTATT ATGGTGACAA GGCAAAGCTC AACCGCGTCA TCTACCGCTA TATGAAGGAA
AGCGCTGCCC AGCGGCTGGC GCTCGAAGCC GGCGACATCG ATATCGCCCG CAACCTCGAG
CCGGGCGACA TCGACGCGGT TTCGAAGAAT GCGGATCTGG CGACAACCAG TGCGCCGAAG
GGCACGATCT ATTATGTCAG CCTGAACAAC AAGAACGAGA ACCTGAAGAA GCCGGAAGTC
CAGGAAGCCT TCAAATATCT GGTCGATTAC GATGCGATCA GCGCAACGCT GATCAAGGGT
ATCGGCGAGA TCCATCAGAC CTTCCTGCCA AAGGGTCAGC TCGGCGCACT CGATGAGAAT
CCCTACAAGC TCGATGTCGC CAAGGCCAAG GAACTGCTGG CCAAGGCCGG CGTACCCGAC
GGTTTCTCGA TCACCATGGA CGTACGCAAC AGCCAGCCGG TGACCGGTAT CGCCGAATCG
ATGCAGCAGA CGCTGGCGCA GGCCGGGGTG AAGATGGAAA TCATCCCCGG CGACGGCAAG
CAGACGCTGA CCAAGTACCG CGCGCGTACG CACGACATGT ATATCGGCCA GTGGGGTTCG
GACTATTTCG ACCCGAATTC CAATGCCGAC ACCTTTACCG GCAATCCTGA CAATTCCGAT
GCCGGCACGG TGAAGACGCT CGCATGGCGC AACACCTGGG AAGCGCCGGA ACTCGACAAG
CAAGCCAAGG CAGCCCTTCT GGAACGCGAC GCTGCCAAGC GCGCCGCCAT ATATCAGGAC
ATCCAGAAGA AGTATCTGGC AAACAGCCCC TTCGTCTTCA TCTTCCAGCA GACCGAGGTG
GCCGGCTACC GCAAGAGTGT GAAGGACTTC AAGCTGGGTC CGAGCTTCGA CACCAATTTC
GTCGGTCCGA TCGCCAAGGA ATAG
 
Protein sequence
MMMTKLSRNF RLLSAGAALS LLMMAAPSAF AETPKDTLVE GFAIDDIITM DPGEAFELST 
AEITTNSYSL LVRLDMDDTS KVKGDLAESW SVSDDGLTYT FKLKSGLKFA SGNPITAEDV
AWSFERAVKL DKSPAFILTQ FGLTGDNVAE KAKAADAGTF VFTVDKAYAP SFVLNCLTAT
VASVVDKKLV LEHVKAVAPD ADHKYDNDFG NEWLKTGYAG SGAYKMREWR ANEVVVLERN
DNYYGDKAKL NRVIYRYMKE SAAQRLALEA GDIDIARNLE PGDIDAVSKN ADLATTSAPK
GTIYYVSLNN KNENLKKPEV QEAFKYLVDY DAISATLIKG IGEIHQTFLP KGQLGALDEN
PYKLDVAKAK ELLAKAGVPD GFSITMDVRN SQPVTGIAES MQQTLAQAGV KMEIIPGDGK
QTLTKYRART HDMYIGQWGS DYFDPNSNAD TFTGNPDNSD AGTVKTLAWR NTWEAPELDK
QAKAALLERD AAKRAAIYQD IQKKYLANSP FVFIFQQTEV AGYRKSVKDF KLGPSFDTNF
VGPIAKE