Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_4801 |
Symbol | |
ID | 8007485 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012848 |
Strand | - |
Start bp | 170237 |
End bp | 171556 |
Gene Length | 1320 bp |
Protein Length | 439 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 644821731 |
Product | putative branched-chain amino acid ABC transporter, substrate-binding protein |
Protein accession | YP_002972991 |
Protein GI | 241113156 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.409444 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.455184 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGGCA TTTTCCGCAA CGGCAAATTC AACGCGGCAT CGCTGAGCCG CCGCTCTTTC ATCGCATCGA CCGTGGCGGG TGGTGCGGCG CTGGCGCTTT CCGGCCGCAC GGCCTTCGCT CAGAGCGGCG ATACGTTGAA GGTCGGCTTC ATCAGCCCGC GCACCGGCCC TCTCGGCGGC TTCGGTGAGA CGGACGGCTA CGTGCTGGAA CTGGCGCGCA AGGCGCTGGC GAACGGCCTG CAGGCGGGCG GCAAGACTTG GAAGGTTGAG ATCCTCGACC AGGACACCCA ATCCGATCCC TCGCGCGCCG GCCAGCTGGC GAAGGACCTG ATCAACAACC AGGCGATCGA TCTGATGCTT GCCGTCTCGA CGCCCGAAAC CATCAATCCC GTGGCTGACG CATGCGAAGC AGCCGGCATT CCCTGCCTCT CGACGGTCAT GCCCTGGGAA GCCTGGTATT TCGGCCGCGG CGCCAAGCCG GGCGCGCCCT CGCCGTTCAA GTGGACCTAT CATTTCGGCT TCGGTGTCGA AGAGTTCCAC AAGGCCTATG TTTCGCAGTG GAACCTGATC GAGACCAACA AGAAGGTCGG CGTCATGTAT CCCAACGACG CCGACGGCAA TGCGATCCGC ACCCATCTGG CGCCGGCGCT CGCCAAGGCC GGCTTCACCA TCGTCGATCC CGGAGCCTAT GAAACCGGAA CCACCGACTT TACCGCGCAG ATCGCTCTCT TCAGGCAGGA GGGCGTGGAG ATCTTCAACT CGTTCCCGAT CCCGCCCGAC TTCGCCGCCT TCTGGCGTCA GGCCGCGCAG CAGGGCCTCA CCCAGCAGAT CAAGATCTGC CAGATCGCCA AGACCGGCCT GTTTCCCTCC GACATCGAGG CGCTCGGCGA CCTCGGCCTG AACATCGGCA GCGCCGCCTA CTGGCACAAG GCCTTCCCCT ATAAATCCAC GCTGACCGGC GTCTCCGGAA CCGAACTCGC CGACGGCTAT GAAACGGCAA GCGGCAAGCA GTGGACGCAG CAGCTCGGCG CCAGCCTTGC GCTTCTCGAC GCCGGCTTCG ATGCGCTGAA GGCGAGCACC GACGTCAAGA GCAAGGAGGC TGTGGCCAAG GCGATCAGCA CGCTGAAGAC CACGACCATC GCCGGCAAGG TCGACTTCAC CAGCGGCCCC GTCGCCAACG TCTCTCCCGG ACCGATCATC GGCACGCAAT GGGTGAAAGC GCCGGAGGGC TCGAAGTTCG CGCTCGACTA TGTCGTCACC GAAAACGCCA CCGACCCCAA TGTCCCGGTC GGCGCCAAGC TCACCGCCTA TAACGGGTAA
|
Protein sequence | MNGIFRNGKF NAASLSRRSF IASTVAGGAA LALSGRTAFA QSGDTLKVGF ISPRTGPLGG FGETDGYVLE LARKALANGL QAGGKTWKVE ILDQDTQSDP SRAGQLAKDL INNQAIDLML AVSTPETINP VADACEAAGI PCLSTVMPWE AWYFGRGAKP GAPSPFKWTY HFGFGVEEFH KAYVSQWNLI ETNKKVGVMY PNDADGNAIR THLAPALAKA GFTIVDPGAY ETGTTDFTAQ IALFRQEGVE IFNSFPIPPD FAAFWRQAAQ QGLTQQIKIC QIAKTGLFPS DIEALGDLGL NIGSAAYWHK AFPYKSTLTG VSGTELADGY ETASGKQWTQ QLGASLALLD AGFDALKAST DVKSKEAVAK AISTLKTTTI AGKVDFTSGP VANVSPGPII GTQWVKAPEG SKFALDYVVT ENATDPNVPV GAKLTAYNG
|
| |