Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_3007 |
Symbol | |
ID | 8013924 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | + |
Start bp | 3004983 |
End bp | 3005951 |
Gene Length | 969 bp |
Protein Length | 322 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 644825576 |
Product | aliphatic sulfonates family ABC transporter, periplsmic ligand-binding protein |
Protein accession | YP_002976804 |
Protein GI | 241205708 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components |
TIGRFAM ID | [TIGR01728] ABC transporter, substrate-binding protein, aliphatic sulfonates family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.7861 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGCGTGT TTCGTTCGGC GATAATCGGC GCCGCGGTTG CCTTGGGACT GGGGCTGTCA TCGGCTGAGG CTGCCGATCT TTCGGAAATC AGGATCGACT GGGCGACGTA TAATCCCGTC AGCGTTCTTC TGAAAAAGGA AGGCCTGCTC GAAAAGGAAT TCGCCAAGGA CAATATCAGC ATTCGCTGGG TGCAGTCCGC CGGCTCGAAC AAGGCGCTGG AATTCCTGAA CGCCGGATCG ATCGATTTCG GCTCGACGGC GGGTGCGGCG GCACTGATCG CGCGGGTCAA CGGCAATCCG ATCACATCCA TCTATGTGTA TTCGCGCCCC GAATGGACGG CTCTGGTCAC CCGCGCCGAC AGTCCGATTT CCACCGTTCA GGATCTGAAG GGCAAGACGA TCGCGGTGAC GCGCGGCACC GATCCGCATG TCTTTCTCGT CCGTGCCCTG GCCGATGCCG GCCTGAAACA ATCGGACGTG AAGCTGGTCC TGCTGCAGCA TGCCGATGGC AAGCTGGCGC TTCTGCGCGG CGATGTCGAT GCGTGGGCCG GGCTCGATCC GCTGATGGCG GCCGCCGAAG TGGACGACAA GGCAAAGCTC TTCTATCGCA AGCCGGAAAA CAACAGCTGG GGCGTGCTCA ATACGACCGA GACCTTTGCC GCGAACCATC CCGATATCAT CAAGCGCGTC ATCGCCGTCT ATGAGCAGGC ACGGGCGGAG GCGCTCGCCG ATCCTGCGGC ACTCAAGGCG GCACTGGTGG AAGCGGCAAA GCTGCCGGAC GACGTGATCG CCAAGCAGCT GGAGCGGACG GACATATCGC AATCGACGAT CGGCGATCTG CAGCGCGATA CGATCTCGAA AGCGGGCATC GCCCTGCAAA GCGCCGGTGT CCTGCCAGCA GACGTCGACA TTCCCAAAGT CACGAACGAG CTGATCGACG ATCGTTTTGC GGTCGGGAAA ACCCAATAA
|
Protein sequence | MGVFRSAIIG AAVALGLGLS SAEAADLSEI RIDWATYNPV SVLLKKEGLL EKEFAKDNIS IRWVQSAGSN KALEFLNAGS IDFGSTAGAA ALIARVNGNP ITSIYVYSRP EWTALVTRAD SPISTVQDLK GKTIAVTRGT DPHVFLVRAL ADAGLKQSDV KLVLLQHADG KLALLRGDVD AWAGLDPLMA AAEVDDKAKL FYRKPENNSW GVLNTTETFA ANHPDIIKRV IAVYEQARAE ALADPAALKA ALVEAAKLPD DVIAKQLERT DISQSTIGDL QRDTISKAGI ALQSAGVLPA DVDIPKVTNE LIDDRFAVGK TQ
|
| |