Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_5951 |
Symbol | |
ID | 6977337 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011366 |
Strand | - |
Start bp | 364626 |
End bp | 365456 |
Gene Length | 831 bp |
Protein Length | 276 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 643393403 |
Product | extracellular solute-binding protein family 3 |
Protein accession | YP_002278221 |
Protein GI | 209546331 |
COG category | [E] Amino acid transport and metabolism [T] Signal transduction mechanisms |
COG ID | [COG0834] ABC-type amino acid transport/signal transduction systems, periplasmic component/domain |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.0245803 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.0107841 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTCAAAT CTATGCTGAC GCGCCGAAAC GCGATGCTCG GTGCCGCGGC CCTGGTGGCT GCCGTCACCT TGGCGCAACC GGCCGCCGCC ATCACGCCTG ACGAAATCAA GGCTCGCGGC AAGATCATCG TCGGAATTCA GGGCGACAAC CCGCCTTGGG GCTTTGTGAC CAGCGGCGGC AAGCAGGACG GCCTCGACGC CGACATTGCG ACGCTGTTCG CCAAGGAACT GGGCGTTTCC GTCGAGTTCG TGCCGCTCGA AGTCAACAAC CGCATTCCGG CGCTGACGGC CGGCCGCGTC GATGTTCTGT TTGCAACCAT GGCAATGCTG CCGGATCGCG CCAAGGCGGT GCAGTTCAGC AAGCCCTATG TTGCCAATGC CATCGTTCTG ATCGGCCCCA AATCGGCGGA GATCAAGACC AACGCCGACA TGGCCAAGTT CACGGTCGGC GTCGCCAAGG GCGCTGCGCA GGACACGCAG GTGACGAAGA ACGCGCCTGA GGGCACGACG ATCCGCCGCT ATGACGGAGA CGCTGCGAGC GTCCAGGCCC TGGTGTCCGG CCAGGTCGAC ACGCTGGGCG GCAACATTTT CTATATGGAC CGGGTGAACA AGGCGCGCCC GGGCGAATTC GAAAACAAGC TTGAATTCCA GAAGCTCTAC AACGGTGCTT GCACGCGTCT CGGGGAGAAG GAAATCAATG CGGCGCTGAA CACCTTCATC GACAAGATCA AGACAAACGG CGATCTCAAG GCCGTCTACG ACAAGTGGAT GAAGGTCCCG GTTCCGGAGT TCCCGGAAAA GCTGGAAGGC ATTCCGTTCG CGGCGAACTG A
|
Protein sequence | MFKSMLTRRN AMLGAAALVA AVTLAQPAAA ITPDEIKARG KIIVGIQGDN PPWGFVTSGG KQDGLDADIA TLFAKELGVS VEFVPLEVNN RIPALTAGRV DVLFATMAML PDRAKAVQFS KPYVANAIVL IGPKSAEIKT NADMAKFTVG VAKGAAQDTQ VTKNAPEGTT IRRYDGDAAS VQALVSGQVD TLGGNIFYMD RVNKARPGEF ENKLEFQKLY NGACTRLGEK EINAALNTFI DKIKTNGDLK AVYDKWMKVP VPEFPEKLEG IPFAAN
|
| |