Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_6384 |
Symbol | |
ID | 8016998 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012854 |
Strand | - |
Start bp | 95171 |
End bp | 96310 |
Gene Length | 1140 bp |
Protein Length | 379 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 644828179 |
Product | sugar ABC transporter, substrate-binding protein |
Protein accession | YP_002979379 |
Protein GI | 241554166 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1879] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.108159 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 0.0518769 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTACATTA TATTGGACAC CAATGGCAAG CCGTCAACGC TGCCGCACAC AAAAAAGGGG ATCGACATGA AGAATCTTGA AAACGGCATT TCCGCTTCGC TACGCCGCCA GCTTCTCGCC GGTGCTGCCG CCGCGGCCGC ACTCCTCGTC TTTTCGGCCG GTACGGCCTC GGCCGCCGCC AATTGCATCA AGGGTGACAG GAAAGCGCCC TATACGATCG GCTGGGCAAA CATCTATTCG GTGCCGACCT GGATGAAGCA GACCGAAGGC ACCATCACGG CCGAAGTGGA GGAGCTGAAG AAGGCGGGCC TGGTGAAGGA CCTGATGATC ACGGACGCGC AGGGTAACGC CCAGACCCAG ATCCAGCATA TCCAGTCGAT GATCGACGCC AATGTCGACG CCATCGTCGT GATCGCCGGT TCCTCCAACG CGCTCGACCG CGTCATATCA GATGCCTGCG ACAAGGGCAT TGCCGTCGTG AATTTCGACA GTCTGGTCAA TACCGACAAG GTGACGGCGA AGATCAACAC CGATTCCAAC GAATGGGGCG CGACCGCTGC CAAGTGGATG GTCGGCCAGC TCGGCGGCAA GGGCAAGATC ATCATCATGA ACGGCCCGGC CGGCATTTCG GTGAGCGACG ACCGCCGCAA GGGCGCCCAG CCGGTCCTTG ACGCCAATCC CGGTCTCCAG GTGATCACCG AGACGAACAC GGAATATAAC GTCGCCCCGG CACAGGAAGC GATGACCAGT CTGCTCTTTG CCAATCCCGA AATCGACGGC GTGCTGTCGC TCGGCGGCGC GCTATCGGCC GGCTCGGTGC TGGCCTTCGA GCGTCAGGGC CGCGACCAAG TGCCGACAAC AGGCGAAAAC GCAAGGCAGT TCCTGGAGCT CTGGAAGGAG AAGGGACTGA AGGGCTGGGC CACCATGCAG CCCAACTGGC TCGGCGCGCT GTCTGTTTAC ACCGCCGTGC AGGCGCTGGA AGGCAAGGAC GTTCCGGCCT TCGTCAAGGT GCCGCTGCCT GTCATCGACG ACAGCACGAT CGGCAGCTAC CTCGCCCGGG CCGACAAGTT CCCGGCGGAC GGCTATATCT ACTCGGACTA CGACAAGGCG CTCTTCGACA AGCTGCTTGC CGCCAAGTAA
|
Protein sequence | MYIILDTNGK PSTLPHTKKG IDMKNLENGI SASLRRQLLA GAAAAAALLV FSAGTASAAA NCIKGDRKAP YTIGWANIYS VPTWMKQTEG TITAEVEELK KAGLVKDLMI TDAQGNAQTQ IQHIQSMIDA NVDAIVVIAG SSNALDRVIS DACDKGIAVV NFDSLVNTDK VTAKINTDSN EWGATAAKWM VGQLGGKGKI IIMNGPAGIS VSDDRRKGAQ PVLDANPGLQ VITETNTEYN VAPAQEAMTS LLFANPEIDG VLSLGGALSA GSVLAFERQG RDQVPTTGEN ARQFLELWKE KGLKGWATMQ PNWLGALSVY TAVQALEGKD VPAFVKVPLP VIDDSTIGSY LARADKFPAD GYIYSDYDKA LFDKLLAAK
|
| |