Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_4180 |
Symbol | |
ID | 8014970 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | - |
Start bp | 4274550 |
End bp | 4276193 |
Gene Length | 1644 bp |
Protein Length | 547 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 644826750 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_002977960 |
Protein GI | 241206864 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.336386 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGATGA CCAAACTCAG CCGCAATTTT CGCCTGCTTT CCGCGGGAGC CGCTCTTTCG CTCCTGATGA TGGCGGCACC CTCGGCCTTT GCCGAGACAC CGAAGGATAC GCTGGTCGAA GGTTTCGCCA TCGACGATAT CATCACGATG GATCCGGGTG AGGCTTTCGA GCTTTCGACC GCCGAAATCA CCACCAACAG CTACAGCCTG CTCGTCCGTC TCGACATGGA CGACACGTCC AAGGTGAAGG GCGATCTGGC CGAGAGCTGG AGCGTTTCCG ATGACGGCCT TACCTATACG TTCAAGCTGA AGTCAGGCCT GAAATTCGCC TCCGGCAACC CGATCACCGC CGAAGACGTC GCCTGGTCGT TCGAGCGTGC CGTCAAGCTC GACAAGAGCC CAGCCTTCAT CCTCACCCAG TTCGGTCTGA CCGGCGATAA CGTCGCGGAA AAAGCCAAGG CGGCCGATGC CGGCACTTTC GTCTTCACGG TCGACAAGGC CTATGCGCCG AGCTTCGTGC TCAACTGCCT GACGGCAACC GTCGCTTCCG TCGTCGACAA GAAGCTGGTG CTGGAGCATG TGAAGGCGGT GGCGCCCGAT GCCGACCACA AATACGACAA CGACTTCGGC AATGAATGGC TGAAGACCGG CTATGCCGGC TCCGGCGCCT ATAAGATGCG CGAATGGCGC GCCAACGAAG TCGTCGTGCT GGAGCGCAAT GACAATTATT ATGGTGACAA GGCAAAGCTC AACCGCGTCA TCTACCGCTA TATGAAGGAA AGCGCTGCCC AGCGGCTGGC GCTCGAAGCC GGCGACATCG ATATCGCCCG CAACCTCGAG CCGGGCGACA TCGACGCGGT TTCGAAGAAT GCGGATCTGG CGACAACCAG TGCGCCGAAG GGCACGATCT ATTATGTCAG CCTGAACAAC AAGAACGAGA ACCTGAAGAA GCCGGAAGTC CAGGAAGCCT TCAAATATCT GGTCGATTAC GATGCGATCA GCGCAACGCT GATCAAGGGT ATCGGCGAGA TCCATCAGAC CTTCCTGCCA AAGGGTCAGC TCGGCGCACT CGATGAGAAT CCCTACAAGC TCGATGTCGC CAAGGCCAAG GAACTGCTGG CCAAGGCCGG CGTACCCGAC GGTTTCTCGA TCACCATGGA CGTACGCAAC AGCCAGCCGG TGACCGGTAT CGCCGAATCG ATGCAGCAGA CGCTGGCGCA GGCCGGGGTG AAGATGGAAA TCATCCCCGG CGACGGCAAG CAGACGCTGA CCAAGTACCG CGCGCGTACG CACGACATGT ATATCGGCCA GTGGGGTTCG GACTATTTCG ACCCGAATTC CAATGCCGAC ACCTTTACCG GCAATCCTGA CAATTCCGAT GCCGGCACGG TGAAGACGCT CGCATGGCGC AACACCTGGG AAGCGCCGGA ACTCGACAAG CAAGCCAAGG CAGCCCTTCT GGAACGCGAC GCTGCCAAGC GCGCCGCCAT ATATCAGGAC ATCCAGAAGA AGTATCTGGC AAACAGCCCC TTCGTCTTCA TCTTCCAGCA GACCGAGGTG GCCGGCTACC GCAAGAGTGT GAAGGACTTC AAGCTGGGTC CGAGCTTCGA CACCAATTTC GTCGGTCCGA TCGCCAAGGA ATAG
|
Protein sequence | MMMTKLSRNF RLLSAGAALS LLMMAAPSAF AETPKDTLVE GFAIDDIITM DPGEAFELST AEITTNSYSL LVRLDMDDTS KVKGDLAESW SVSDDGLTYT FKLKSGLKFA SGNPITAEDV AWSFERAVKL DKSPAFILTQ FGLTGDNVAE KAKAADAGTF VFTVDKAYAP SFVLNCLTAT VASVVDKKLV LEHVKAVAPD ADHKYDNDFG NEWLKTGYAG SGAYKMREWR ANEVVVLERN DNYYGDKAKL NRVIYRYMKE SAAQRLALEA GDIDIARNLE PGDIDAVSKN ADLATTSAPK GTIYYVSLNN KNENLKKPEV QEAFKYLVDY DAISATLIKG IGEIHQTFLP KGQLGALDEN PYKLDVAKAK ELLAKAGVPD GFSITMDVRN SQPVTGIAES MQQTLAQAGV KMEIIPGDGK QTLTKYRART HDMYIGQWGS DYFDPNSNAD TFTGNPDNSD AGTVKTLAWR NTWEAPELDK QAKAALLERD AAKRAAIYQD IQKKYLANSP FVFIFQQTEV AGYRKSVKDF KLGPSFDTNF VGPIAKE
|
| |