Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_3423 |
Symbol | |
ID | 8014296 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | - |
Start bp | 3445152 |
End bp | 3446138 |
Gene Length | 987 bp |
Protein Length | 328 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 644825981 |
Product | putative sugar ABC transporter, substrate-binding protein |
Protein accession | YP_002977208 |
Protein GI | 241206112 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1879] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.490297 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 37 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGGAAGG CATTACTACT TACAGCCGCA GTTTTAGCAC TGACCGCCGG GCAGGCCTTG GCCAAGAAGC AACTCGTCAT CGTCGTCAAA GGCCTCGACA ATCCGTTCTT CGAAGCAATC AATCAAGGTT GCCAGAAATG GAACAAGGAG AACCCCACGG CGGAATACGA ATGCTTCTAC ACCGGTCCGG CATCGACATC CGATGAAGCC GGCGAAGCGC AGATCGTTCA GGATATGTTG GGCAAGGCGG ACACGGCTGC AATCGCAATC TCGCCGTCAA ATGCAAAGCT GATCGCCCAG ACCCTGAAGA CGGCCAATCC GACGGTTCCG GTGATGACGC TCGATGCCGA TCTCGCCGCC GAGGATGCAG CGTTGCGCAA GACCTATCTC GGCACCGACA ACTATCTGAT GGGCGCCCGC ATCGGCGAAT ACATCAAGAA GGGCAAGCCG AAGGGCGGCA AGATCTGCAC CATCGAAGGC AATCCGGGAG CGGACAATAT TCTACGCCGC GCACAAGGAA TGCGCGACAC GCTTACTGGC CAGAAAGGCT TGACCGAGCT GAAAGGCGAG GGTGGCTGGA CCGAGGTCGC CGGTTGCCCG GTGTTCACCA ATGACGACGG TGCCAAGGGC GTTCAGGCAA TGACCGATAT CCTCGCGGCC AATCCCGACT TGGACGCATT CGGCATTATG GGTGGCTGGC CATTGTTCGG TGCACCGCAA CCCTATCGCG ACCTGTTCAA GCCGCTGGCC GACAAGATCG CCAGCAACGA TTTCGTCATT GGTGCCGCCG ATACGATCGG CGACGAGGTC GCCATTGCGA AGGAAGGGCT GGTGACAGCG CTCGTCGGCC AGCGGCCATT CGAGATGGGC TACAAGGCAC CGTCGGTGAT GATGGATCTG ATTGCCGGCA AGCCGGTTGA AGATCCGGTC TTCACCGGGC TCGATGAGTG CACGAAGGAT ACAGTCGATA CCTGCATTCA AAAGTAG
|
Protein sequence | MRKALLLTAA VLALTAGQAL AKKQLVIVVK GLDNPFFEAI NQGCQKWNKE NPTAEYECFY TGPASTSDEA GEAQIVQDML GKADTAAIAI SPSNAKLIAQ TLKTANPTVP VMTLDADLAA EDAALRKTYL GTDNYLMGAR IGEYIKKGKP KGGKICTIEG NPGADNILRR AQGMRDTLTG QKGLTELKGE GGWTEVAGCP VFTNDDGAKG VQAMTDILAA NPDLDAFGIM GGWPLFGAPQ PYRDLFKPLA DKIASNDFVI GAADTIGDEV AIAKEGLVTA LVGQRPFEMG YKAPSVMMDL IAGKPVEDPV FTGLDECTKD TVDTCIQK
|
| |