Gene Rleg_3996 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_3996 
Symbol 
ID8014806 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp4073267 
End bp4074877 
Gene Length1611 bp 
Protein Length536 aa 
Translation table11 
GC content61% 
IMG OID644826565 
Productextracellular solute-binding protein family 5 
Protein accessionYP_002977776 
Protein GI241206680 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.70207 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.0901295 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGTCAG ACAATCGTTC CAGAAGGTTG CAGAGCCTGT TCATGCGCGG CGCCGCCGTG 
TTGCTGTTGT CGCTCATGGC GGCCGCACCC GTGATGCTCG CCGCGTCTCA AGCCGCCGCG
CAGACGGAAA AACCCGTTAG CGGCGGCGCG ATGACCATTA TCAACGGTTC GGACATCAAG
AGCTGGGATC CGGCGATCTC CGCCGGCACC TATCCCGGCG GGCCGATGGA TGTGCTCGAC
GCCGTCTACG GCTTTATCGT CTACGTCAAC GACAAGGGCG TCGTGACCGG CGGCATGGCC
GAAAGCCTGA CCAGCACAGA CGCCGTGACC TGGACCTTGA AGCTTCGCAA GGACATGAAG
TTCACAGACG GGACGCCCTA TGATGCGGAA GCCGTCAAAT ACAATTGGGA CCGCGCCGCC
GATTCGGCCA CGCTTTCACC GGCGCAGCCG TTCATATCCT CATGGAACAA GGCGATCACC
GTCGTCGATC CGCAGACGCT GACGATCAAG CTGTCCTCCC CGAACGCCAA TTTCGCAGCC
CAGGTCGCCG AGCTGTGCCC CTTCATCGCC TCGCCGGCAG CATTGAAGGC CGCTAAGGAA
AAGACCGACA TCAAGCCGGT GGGCGCCGGC GCCTTTACGC TGACCGAATG GAACCAGGGC
ATATCGATGA CCATGGCCCG CAATCCCGGT TATTGGGATC AGCCGCGCCC CTATCTGGAG
ACGATCAAGT TCGCGATCAT TCCCGAAACC AACAGCCGCA TCGCCACCGT CGTGCAGGGT
GGTGCAACGA TGATGGCCGG TTATCCCTAT CAGTTCGGCT CGAACGCAAC GGCGCCGGGG
GTTGCGACCC GCGAGATCCC GATCCGCGGC ATCAACCGCG CCTATCTCAA CCAGGCCAAG
GGTATCTTCA CGGATGTTCG CGCCCGCGAG GCCTTCTATT CCGCCATCGA CCGCGCGCGG
CTGATGCAGG CCTTCACGCA GATGCCCGGA TACAAGGCAC CCAGCAATTA CTTTGGAGAG
AATTCGCCCT ATTTCGACAG CGCTTCATCT CTTCCGGCCT ATGATCCGAA GAAGGCGCAG
GAACTGTTCG ACGCTCTGAA GGCGGACGGC AAGCCGTTCT CGATCAAGAT CGTCACCTAT
ACCAACTCGG ACCTGAAGCG TCTTGCCGCC TACATCCAGC AGGTGCTGAC CGGCTATGAA
GGCGCCTCGG CTGAGATCGT CGAGGTCGAC CAGGCCTCGC TGATCCAGCG ATGCAAGACG
CAGCTCGACT TCGACATCTG CGTCGAAGGC GGGGTGCTGG TGTCGAACGG CGCCGAGCCG
AACATTTCCA ATCTGCTGAG TTCCGGCGGT GCTTTCAACT GGGGACAGTA CAAGAGCGCC
GAGATGGACG CTGCATTGAA GGAGGCCAGC TCCACCCTCG ACCCTGCCGC TGTCAAAGCC
GCCTATGTCA AGGTTCAGAA GCTCGTCGCA ACCGAAATGC CGCTTTACAT CTTCGGTGAA
CAGACGCGCT CGCTGCTGCT GCGCGACAAT ACCGGCGGCG TCGTTCCTTC GAACGGCGGC
ATCCTGCAAA AGCAGTTCCT CTACGTCTGC ACGGATGTCT GCCAGAAATA G
 
Protein sequence
MLSDNRSRRL QSLFMRGAAV LLLSLMAAAP VMLAASQAAA QTEKPVSGGA MTIINGSDIK 
SWDPAISAGT YPGGPMDVLD AVYGFIVYVN DKGVVTGGMA ESLTSTDAVT WTLKLRKDMK
FTDGTPYDAE AVKYNWDRAA DSATLSPAQP FISSWNKAIT VVDPQTLTIK LSSPNANFAA
QVAELCPFIA SPAALKAAKE KTDIKPVGAG AFTLTEWNQG ISMTMARNPG YWDQPRPYLE
TIKFAIIPET NSRIATVVQG GATMMAGYPY QFGSNATAPG VATREIPIRG INRAYLNQAK
GIFTDVRARE AFYSAIDRAR LMQAFTQMPG YKAPSNYFGE NSPYFDSASS LPAYDPKKAQ
ELFDALKADG KPFSIKIVTY TNSDLKRLAA YIQQVLTGYE GASAEIVEVD QASLIQRCKT
QLDFDICVEG GVLVSNGAEP NISNLLSSGG AFNWGQYKSA EMDAALKEAS STLDPAAVKA
AYVKVQKLVA TEMPLYIFGE QTRSLLLRDN TGGVVPSNGG ILQKQFLYVC TDVCQK