Gene Rleg_6676 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_6676 
Symbol 
ID8022586 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012858 
Strand
Start bp106243 
End bp107862 
Gene Length1620 bp 
Protein Length539 aa 
Translation table11 
GC content62% 
IMG OID644833543 
Productextracellular solute-binding protein family 5 
Protein accessionYP_002984677 
Protein GI241666593 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.479975 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.420932 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCAAAC GCTGGCTGCA ACAGACGACC ATGGCAACGA TGGTGGCGCT CGCTCCATTG 
TCCGTCATGG CTGATGAAAC GCCCAAGCAG GGCGGCGATA TCGTCGTCAC CTACAAGGAC
GACATCACCA CGCTCGACCC GGCGATCGGC TACGACTGGG TCAACTGGTC GATGATCAAG
AGCCTCTATT CCCGCCTGAT GGACTATACG CCCGGCACGC CGAACCCAGT TCCCTCGCTT
GCCGAAAGCT TCACCGTTTC GCCCGACGGC TTGACCTATA CCTTCAAGCT GCACAAGGGC
GTGAAGTTCT CGAACGGCCG CGAGGTGGTC GCCTCCGACG TGAAATATTC GATCGAACGC
GCCGTCGACC CGAAGACGCA AGGCCCCGGC GCCGGCTTCT TCGGCGCCAT CAAGGGCTTC
GAGGATGAAA CCGGCGGCAA GACGACGACG CTCTCCGGCA TCGATACGCC TGACGATAGC
ACCGTCATCT TCAACCTCTC TCGCCCAGAC GCCACCTTCC TGCACGTGCT TGCCATCAAC
TTCGCCTCGG TCGTGCCGAA GGAAGCCGTC GAGGCTGCCG CCGGCGACTT CGGCAAGAAG
CCGGTCGGCT CCGGCACCTT CATCCTGAAG GACTGGACGA TCGGCCAGCA GCTCGTTTTC
GAGCGCAACA AGGATTATTT CGTCAAGGGC GTTCCCTATA TCGACAGCTT CAAGGTCGAG
GTCGGCCAGG AGCCGCTGGT GGCGCTCTTG CGCCTGCAGA AGGGCGAGGT CGATATTGCC
GGCGACGGCA TTCCGCCGGC AAAGTTCCTC GAAATCAAGA ATTCGGCCGA TGGCGCACAG
ATGATCGTCG ACGGCGAACA GCTGCACACC GGCTACATCA CGCTGAACAC CAAGGTAAAG
CCCTTCGACA ACGTCAAGGT TCGCCAGGCG CTGAACATGG CGATCAACAA GGACCGCATC
ACCCGCATCC TCAACGGCCG CGCAACGCCT GCCAACCAGC CGCTGCCGCC GCTGATGCCG
GGTTACGACA AGGCCTTCAC CGGCTATACC TATGACGTGG CGAAAGCCAA GGCGCTGCTT
GCCGAAGCCG GTTATCCCGA TGGCTTCGAA ACCGTGCTCT ACTCCACCAA CACCGATCCG
CAGCCGCGTA TCGCCCAGGC AATCCAGCAG GATCTGGCCG CCGTTGGCGT CAAGGCCGAA
GTCCGGGCGC TGGCCCAGGC AAACGTCATC TCGGCCGGCG GCACGGAAGG CGAAGCGCCG
ATGATCTGGT CGGGCGGCAT GGCCTGGATC GCCGACTTCC CGGATCCGTC CAACTTCTAT
GGCCCGATCC TCGGTTGCGC CGGCGCGGTC CCGGGCGGCT GGAACTGGTC GTGGTACTGC
AACGCCGATC TCGACAAGCG CGCCGTTGCC GCCGACTCCA TGTCCGATCC GGCAAAGGCA
ACCGAACGCA CCGCCGCCTG GGGCAAGATC TTCACCGACA TCATGGCAGA TGCGCCGTGG
ATCCCTGTCA TCAACGAACG CCGCGTCGTC GCCAAGTCGC TGCGCATGGG CGGCGCTGAC
AACATCTACA TCGATCCGAC CCGCGTCATC AATTACGACG CGATCTACGT CAAGCAGTAA
 
Protein sequence
MFKRWLQQTT MATMVALAPL SVMADETPKQ GGDIVVTYKD DITTLDPAIG YDWVNWSMIK 
SLYSRLMDYT PGTPNPVPSL AESFTVSPDG LTYTFKLHKG VKFSNGREVV ASDVKYSIER
AVDPKTQGPG AGFFGAIKGF EDETGGKTTT LSGIDTPDDS TVIFNLSRPD ATFLHVLAIN
FASVVPKEAV EAAAGDFGKK PVGSGTFILK DWTIGQQLVF ERNKDYFVKG VPYIDSFKVE
VGQEPLVALL RLQKGEVDIA GDGIPPAKFL EIKNSADGAQ MIVDGEQLHT GYITLNTKVK
PFDNVKVRQA LNMAINKDRI TRILNGRATP ANQPLPPLMP GYDKAFTGYT YDVAKAKALL
AEAGYPDGFE TVLYSTNTDP QPRIAQAIQQ DLAAVGVKAE VRALAQANVI SAGGTEGEAP
MIWSGGMAWI ADFPDPSNFY GPILGCAGAV PGGWNWSWYC NADLDKRAVA ADSMSDPAKA
TERTAAWGKI FTDIMADAPW IPVINERRVV AKSLRMGGAD NIYIDPTRVI NYDAIYVKQ