Gene Rleg2_6105 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_6105 
Symbol 
ID6983178 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011370 
Strand
Start bp33381 
End bp34937 
Gene Length1557 bp 
Protein Length518 aa 
Translation table11 
GC content54% 
IMG OID643399130 
Productextracellular solute-binding protein family 5 
Protein accessionYP_002283886 
Protein GI209551970 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000312198 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGGATT TGAACCGTAG AACGCTACTA AAAGGTGCAG CCGCCGCTGC GGCATACACG 
TTTACTTCTT TGGGCCCGGC AAGGGCTACT TCAAGCTCGC CGCGCCGAGG AGGGCATCTT
CGCATCGGTC TTTGGGGAGG ATCCTCACAG GACACGCTAG ATCCGGCTAG CATCACTACC
GATGCGGGGT TCCTCACGGC GGCCACCGCA CGGAACAAGT TGTTAGAGGT CGAACCGAAT
GGTGAGCTTA CCCCAGCCCT CGCATTAAAG TGGGAGCCAT CGGATGACCT CATGCGTTGG
ACCTTTGAAA TTCGGCCTGG GGTGACTTTT CACAGCGGCA AGTCGCTTGA AATGTCGGAC
ATCGTCGCTT CGCTCAACCT GCATCGCGGA AAGGACTCAA CCTCTCCCGC AAAATCCTTC
CTTGACGCGG TCACTGATAT CAAGGCCGAG GGCAGTAACA GAGTTGTCGT CTCGCTCAAT
GCTCCGAACG TCGACTTTCC AAGTGCCCTC GCGGACCTCT CGCTGTCCAT CGTGCCCGCG
AAGGATGGGG TAGCAGATCG GAACACGATG GACGGTACTG GCCCGTACGC AATCGAGAGC
TTTGAGCCTG GCCAGCGCAT AAGGTTCAAG CGAAACCCTA ATTACTGGAA TCTGGATAAG
GCTGCATTCT TCGACTCGGC CGAGGTCCTG ATCCTCGCTG ATGCCGCTAC AAGAATGAAC
GCTTTGCGCT CGGGTCAGGT TGATTTGATA AACCAAGCCG ACCTTAAGAC ACTTAGCATG
CTTCAGCGGG TCCCGGGAAT AACCGTTGAA GACGTGCCCA GCGGCCGGTT TTATATTTTT
GGCATGATGT CGGACGTTGC TCCTTTCAAT GACAAGGATG TACGACAGGC TCTGAAATTT
GCGATCAACC GAAAGGAGAT GACCCAGAAG ATTCTCCTTG GGCATGGGAG CATTGGGAAT
GACCAGCCTA TCAAGCCCAG CCACAAGTAC TTCAATACGA ACCTTCCGCA ACGTGAGTAT
GATCCCGAAA AAGCGAAATT TCATCTTAAA CAAGCTGGCG TGACGTCACT TCAAGTACCT
TTGAGTGTGG CCGAGGCCGC ATTCGCCGGG GCTGTAAATG CGGGGCAGCT TTTTGTCGCC
TCGGCTGCTG AAGCAGGCAT CAACATCGTC GCAACGCGGG AGCCCGATGA TGGTTACTTC
GACAACGTTT GGCTGAAAAA GCCGTTTACC GCCGACTATT GGACTGAACT GCCGTCCGCT
GATGCGCAGT TCACGCAAGG CTATGCTAAG GGAGCAGCTT GGAACGAGAC GCACTTCGAC
AACCCTCGCT TCAACGAACT CTTGCTGAAG GCCCGAGCTA CGCTGGATGA GCAGCAGCGC
GCGGGGATGT ACCACGAAAT GCAACAACTC ATCCATGATG AAAGCGGTGC GATCATTCCA
ATGTTCGCGA ACAATACCTG GGCTTCCAAA TCAACGTTGA AGCATCAGGA CGGATTGTCC
AGCCATCGCG ATCTCGATGA TTTCCGTTGC ATTGAGAGGT GGTGGTTCGA ATCCTAA
 
Protein sequence
MLDLNRRTLL KGAAAAAAYT FTSLGPARAT SSSPRRGGHL RIGLWGGSSQ DTLDPASITT 
DAGFLTAATA RNKLLEVEPN GELTPALALK WEPSDDLMRW TFEIRPGVTF HSGKSLEMSD
IVASLNLHRG KDSTSPAKSF LDAVTDIKAE GSNRVVVSLN APNVDFPSAL ADLSLSIVPA
KDGVADRNTM DGTGPYAIES FEPGQRIRFK RNPNYWNLDK AAFFDSAEVL ILADAATRMN
ALRSGQVDLI NQADLKTLSM LQRVPGITVE DVPSGRFYIF GMMSDVAPFN DKDVRQALKF
AINRKEMTQK ILLGHGSIGN DQPIKPSHKY FNTNLPQREY DPEKAKFHLK QAGVTSLQVP
LSVAEAAFAG AVNAGQLFVA SAAEAGINIV ATREPDDGYF DNVWLKKPFT ADYWTELPSA
DAQFTQGYAK GAAWNETHFD NPRFNELLLK ARATLDEQQR AGMYHEMQQL IHDESGAIIP
MFANNTWASK STLKHQDGLS SHRDLDDFRC IERWWFES