Gene Rleg_6804 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_6804 
Symbol 
ID8022734 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012858 
Strand
Start bp245090 
End bp246994 
Gene Length1905 bp 
Protein Length634 aa 
Translation table11 
GC content60% 
IMG OID644833670 
Productextracellular solute-binding protein family 5 
Protein accessionYP_002984804 
Protein GI241666720 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.910845 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAACAGT GGAAGAGGCT CGCATTTGGC GTCGCGCTGG CCGCACTCGG CCTGACAGCG 
ACGGCCAAGG CCGATGACTA CACCTCCTTG CCGCGCAAGG AGACGCTCAT CGTCGAAAAT
CCGGAAGGGA CGATCAAAAA TCCCGGCTGG TTCAACATCT GGGTCAATGG CGGCGGCGGC
GTATCGACCG GCCTGCAGCA GCTGACCATG GATACGCTCT GGTATATCGA CCCCGAACAA
GGGCTTGGCG GCGCGACCTG GGACAATTCT CTGGCTGCCG ACAAGCCGCA ATACAATGCC
GACTTTACCG AGATGACCGT GAAACTGCGC AAGGGGCTCT TCTGGAGCGA TGGCGTGGAG
TTCACGGCAG ACGACGTGGT CTATACCGTG AAGACGCAGA TGGATCACCC CGGAATGGTC
TGGAGTGCTG CCTTCTCGGT GCAGGTGGCA AGCGTCGAGG CGACCGACCC CTCGACTGTG
GTGTTCAAGC TAAAGAAGCC CAATTCGCGC TTCCATGCCA TCTTCACCGT TCGCTGGAAC
GGCGCCTGGA TCATGCCCAA GCATGTTTTC GAGAAAGTCG AAGATCCGCT TCGTTATGAT
TTCGCCAATC CCGTTTCGCT CGGCGCCTAC AAGCTCAAGT CCTACGACCC CCAGGGCAAG
TGGTATACCT GGGAGAAGCG CGACGACTGG CAGCGGACAT CGCTTGCCCG CTTCGGCGAG
CCGGCTCCGA AATATGTGAC CTATGCCGAT CCCGGCCCGC CGGATAAACG CACCATCGCT
CAGCTCGAGC ACAATCTCGA TATCATTCAC GACAACACGC CCGAGGGCAT GTTCACGCTG
AAGGAAAAAT CCAAGACGAT CGAAACCTGG TTCCCGGGTT TCCCCTTCGC CCATCCGGAC
CCGACGCTTC CGGCCGTGAT CTTCAACACC CAGAATCCGC CCTTCGACAA TGCCGATGTA
CGCTGGGCGC TTGCCCTGCT GATCGACATC AAGGCGGTGG ATATGGCGAG CTATCGCGGG
GCAGCGACGC TTTCGGCGCT CGGCGTGCCG CCGACGGCGG CCACGATGAA GGACTATCAG
GCGCCGATGC AGGATTGGCT GAAGAATTTC GAGATCGATA CCGGCAAGAG CAAGATCAAG
CCTTATGACC CGACGGTCGG GCAACAGATC GCCGATATCC TGCGCAAGCA GCCGAAGTTC
AAGGACCAGA TCCCGACCGA CGCGGAGGCG ATCAGCGGTG CCTTCGGCTA TGGCTGGTGG
AAACCGGATC CGAAGGCTGC CGGCGAACTG CTGGAGAAGG CAGGCTTCAA GAAATCCGGC
GGCAAATGGC TGACCCCTGA TGGACAGCCC TTCAAGATCC GGATGACGGT GGAAGGCGAC
ACACGCTCGG TCTTCACCCG CGCCGGGACG TTGATCGCAC AGCAATGGGC CGCATTCGGC
ATCGACGCCA AAGCCGTGCC GGCCGCGAAA CTTTGGCAGA CGGCGCTACA GCCCGGCGAT
TTCCAGGTTG CGATCGCCTG GAGCGTCGAG ACCTGGGGCG GCGATCCCGA CCTGTCGTTC
TTCCTGGACA GCTGGCATTC GCAGTTCGTG GCCAAGAAGG GTGACAATCA GCCGCCGCGC
AACTGGCAGC GCTGGAGCAA TCCGGAGCTC GACAAGATCA TCGAAAGCAT TCGCGGCATC
AGCGCCGACG ATCCGAAGGG CGTCGAGCTC GGCAAGGATT ATCTGAAGCT GGTCGCCCGC
GAAATGCCGA CGATCCCGCT GATGTCGTAT AACGTCTTCA CCTCGATGGA TACGACCTAT
TGGACCGGTT ATCCGACGAT CGCTGATCCC TATACCGATC CGGTGCCGAA TTGGGCCAAC
TCCAGGCTGA TGATGGTCAA GCTGAAGCCG GCACAACCGA AATAA
 
Protein sequence
MQQWKRLAFG VALAALGLTA TAKADDYTSL PRKETLIVEN PEGTIKNPGW FNIWVNGGGG 
VSTGLQQLTM DTLWYIDPEQ GLGGATWDNS LAADKPQYNA DFTEMTVKLR KGLFWSDGVE
FTADDVVYTV KTQMDHPGMV WSAAFSVQVA SVEATDPSTV VFKLKKPNSR FHAIFTVRWN
GAWIMPKHVF EKVEDPLRYD FANPVSLGAY KLKSYDPQGK WYTWEKRDDW QRTSLARFGE
PAPKYVTYAD PGPPDKRTIA QLEHNLDIIH DNTPEGMFTL KEKSKTIETW FPGFPFAHPD
PTLPAVIFNT QNPPFDNADV RWALALLIDI KAVDMASYRG AATLSALGVP PTAATMKDYQ
APMQDWLKNF EIDTGKSKIK PYDPTVGQQI ADILRKQPKF KDQIPTDAEA ISGAFGYGWW
KPDPKAAGEL LEKAGFKKSG GKWLTPDGQP FKIRMTVEGD TRSVFTRAGT LIAQQWAAFG
IDAKAVPAAK LWQTALQPGD FQVAIAWSVE TWGGDPDLSF FLDSWHSQFV AKKGDNQPPR
NWQRWSNPEL DKIIESIRGI SADDPKGVEL GKDYLKLVAR EMPTIPLMSY NVFTSMDTTY
WTGYPTIADP YTDPVPNWAN SRLMMVKLKP AQPK