Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_7091 |
Symbol | |
ID | 8022377 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012858 |
Strand | - |
Start bp | 500823 |
End bp | 501653 |
Gene Length | 831 bp |
Protein Length | 276 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 644833928 |
Product | extracellular solute-binding protein family 3 |
Protein accession | YP_002985062 |
Protein GI | 241666978 |
COG category | [E] Amino acid transport and metabolism [T] Signal transduction mechanisms |
COG ID | [COG0834] ABC-type amino acid transport/signal transduction systems, periplasmic component/domain |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.51636 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTCAAGT CTATGCTAAC GCGTCGAAAC GCGATGCTCG GAGCCGCCGC CCTCGTGGCA GCCGTCACCC TGGCGCAGCC GGCCGCCGCC GTCACGCCCG ACGAAATCAA GGCTCGTGGC AAGATCATCG TCGGAATTCA GGGCGACAAT CCGCCTTGGG GCTTTGTGAC CAGCGGCGGC AAGCAGGACG GCCTCGACGC CGACATCGCA ACGCTGTTTG CCAAGGAACT CGGCGTTTCC GTCGAGTTCG TGCCGCTTGA AGTCAACAAC CGCATTCCCG CACTGACGGC CGGCCGGGTC GACGTTCTGT TCGCAACGAT GGCGATGCTG CCGGATCGCG CAAAAGCCGT GCAGTTCAGC AAGCCCTATG TTGCCAATGC CATCGTCCTG ATCGGTCCGA AAAAGGCTGA GATCAAGACG AATGCCGACA TGGCCAAGTT CACGGTCGGC GTCGCCAAGG GGGCTGCTCA GGACACGCAG GTCACCAAGA ACGCGCCGCC CAGCACCACA ATCCGCCGAT ATGACGGAGA CGCCGCAAGC GTCCAGGCGC TGGTGTCCGG CCAGGTCGAA ACGCTTGGTG GCAACATCTT CTACATGGAC CGGCTGGAGA AGGCCCGTCC GGGCGAATTC GAAAACAAGC TTGAATTCCA GAAGCTCTAC AACGGTGCTT GCACCCGTCT CGGCGAAAAG GAAATCAATG CGGCGCTGAA CACCTTCATC GACAAGATCA AGGCCAACGG CGAACTCAAA ACCGTCTACG ACAAGTGGAT GAAGGTTCCG GTACCGGAAT TCCCGGAAAC ACTGGAAGGC ATTCCGTTCG CGGCGAAGTG A
|
Protein sequence | MFKSMLTRRN AMLGAAALVA AVTLAQPAAA VTPDEIKARG KIIVGIQGDN PPWGFVTSGG KQDGLDADIA TLFAKELGVS VEFVPLEVNN RIPALTAGRV DVLFATMAML PDRAKAVQFS KPYVANAIVL IGPKKAEIKT NADMAKFTVG VAKGAAQDTQ VTKNAPPSTT IRRYDGDAAS VQALVSGQVE TLGGNIFYMD RLEKARPGEF ENKLEFQKLY NGACTRLGEK EINAALNTFI DKIKANGELK TVYDKWMKVP VPEFPETLEG IPFAAK
|
| |