Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_6804 |
Symbol | |
ID | 8022734 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012858 |
Strand | + |
Start bp | 245090 |
End bp | 246994 |
Gene Length | 1905 bp |
Protein Length | 634 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 644833670 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_002984804 |
Protein GI | 241666720 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.910845 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAACAGT GGAAGAGGCT CGCATTTGGC GTCGCGCTGG CCGCACTCGG CCTGACAGCG ACGGCCAAGG CCGATGACTA CACCTCCTTG CCGCGCAAGG AGACGCTCAT CGTCGAAAAT CCGGAAGGGA CGATCAAAAA TCCCGGCTGG TTCAACATCT GGGTCAATGG CGGCGGCGGC GTATCGACCG GCCTGCAGCA GCTGACCATG GATACGCTCT GGTATATCGA CCCCGAACAA GGGCTTGGCG GCGCGACCTG GGACAATTCT CTGGCTGCCG ACAAGCCGCA ATACAATGCC GACTTTACCG AGATGACCGT GAAACTGCGC AAGGGGCTCT TCTGGAGCGA TGGCGTGGAG TTCACGGCAG ACGACGTGGT CTATACCGTG AAGACGCAGA TGGATCACCC CGGAATGGTC TGGAGTGCTG CCTTCTCGGT GCAGGTGGCA AGCGTCGAGG CGACCGACCC CTCGACTGTG GTGTTCAAGC TAAAGAAGCC CAATTCGCGC TTCCATGCCA TCTTCACCGT TCGCTGGAAC GGCGCCTGGA TCATGCCCAA GCATGTTTTC GAGAAAGTCG AAGATCCGCT TCGTTATGAT TTCGCCAATC CCGTTTCGCT CGGCGCCTAC AAGCTCAAGT CCTACGACCC CCAGGGCAAG TGGTATACCT GGGAGAAGCG CGACGACTGG CAGCGGACAT CGCTTGCCCG CTTCGGCGAG CCGGCTCCGA AATATGTGAC CTATGCCGAT CCCGGCCCGC CGGATAAACG CACCATCGCT CAGCTCGAGC ACAATCTCGA TATCATTCAC GACAACACGC CCGAGGGCAT GTTCACGCTG AAGGAAAAAT CCAAGACGAT CGAAACCTGG TTCCCGGGTT TCCCCTTCGC CCATCCGGAC CCGACGCTTC CGGCCGTGAT CTTCAACACC CAGAATCCGC CCTTCGACAA TGCCGATGTA CGCTGGGCGC TTGCCCTGCT GATCGACATC AAGGCGGTGG ATATGGCGAG CTATCGCGGG GCAGCGACGC TTTCGGCGCT CGGCGTGCCG CCGACGGCGG CCACGATGAA GGACTATCAG GCGCCGATGC AGGATTGGCT GAAGAATTTC GAGATCGATA CCGGCAAGAG CAAGATCAAG CCTTATGACC CGACGGTCGG GCAACAGATC GCCGATATCC TGCGCAAGCA GCCGAAGTTC AAGGACCAGA TCCCGACCGA CGCGGAGGCG ATCAGCGGTG CCTTCGGCTA TGGCTGGTGG AAACCGGATC CGAAGGCTGC CGGCGAACTG CTGGAGAAGG CAGGCTTCAA GAAATCCGGC GGCAAATGGC TGACCCCTGA TGGACAGCCC TTCAAGATCC GGATGACGGT GGAAGGCGAC ACACGCTCGG TCTTCACCCG CGCCGGGACG TTGATCGCAC AGCAATGGGC CGCATTCGGC ATCGACGCCA AAGCCGTGCC GGCCGCGAAA CTTTGGCAGA CGGCGCTACA GCCCGGCGAT TTCCAGGTTG CGATCGCCTG GAGCGTCGAG ACCTGGGGCG GCGATCCCGA CCTGTCGTTC TTCCTGGACA GCTGGCATTC GCAGTTCGTG GCCAAGAAGG GTGACAATCA GCCGCCGCGC AACTGGCAGC GCTGGAGCAA TCCGGAGCTC GACAAGATCA TCGAAAGCAT TCGCGGCATC AGCGCCGACG ATCCGAAGGG CGTCGAGCTC GGCAAGGATT ATCTGAAGCT GGTCGCCCGC GAAATGCCGA CGATCCCGCT GATGTCGTAT AACGTCTTCA CCTCGATGGA TACGACCTAT TGGACCGGTT ATCCGACGAT CGCTGATCCC TATACCGATC CGGTGCCGAA TTGGGCCAAC TCCAGGCTGA TGATGGTCAA GCTGAAGCCG GCACAACCGA AATAA
|
Protein sequence | MQQWKRLAFG VALAALGLTA TAKADDYTSL PRKETLIVEN PEGTIKNPGW FNIWVNGGGG VSTGLQQLTM DTLWYIDPEQ GLGGATWDNS LAADKPQYNA DFTEMTVKLR KGLFWSDGVE FTADDVVYTV KTQMDHPGMV WSAAFSVQVA SVEATDPSTV VFKLKKPNSR FHAIFTVRWN GAWIMPKHVF EKVEDPLRYD FANPVSLGAY KLKSYDPQGK WYTWEKRDDW QRTSLARFGE PAPKYVTYAD PGPPDKRTIA QLEHNLDIIH DNTPEGMFTL KEKSKTIETW FPGFPFAHPD PTLPAVIFNT QNPPFDNADV RWALALLIDI KAVDMASYRG AATLSALGVP PTAATMKDYQ APMQDWLKNF EIDTGKSKIK PYDPTVGQQI ADILRKQPKF KDQIPTDAEA ISGAFGYGWW KPDPKAAGEL LEKAGFKKSG GKWLTPDGQP FKIRMTVEGD TRSVFTRAGT LIAQQWAAFG IDAKAVPAAK LWQTALQPGD FQVAIAWSVE TWGGDPDLSF FLDSWHSQFV AKKGDNQPPR NWQRWSNPEL DKIIESIRGI SADDPKGVEL GKDYLKLVAR EMPTIPLMSY NVFTSMDTTY WTGYPTIADP YTDPVPNWAN SRLMMVKLKP AQPK
|
| |