Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_3393 |
Symbol | |
ID | 8014270 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | + |
Start bp | 3409991 |
End bp | 3411229 |
Gene Length | 1239 bp |
Protein Length | 412 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 644825951 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_002977178 |
Protein GI | 241206082 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.205631 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGAAAT CCCTTAACAA GACGCTTCTC GGTGCGGCAT TGATCGGCGC ATCCCTTGCG CCGCATGCTT TCGCCGAAAC GACGCTGAAC GCGCTTTTCA TGGCCCAGGC CGCCTATAGC GAGGCCGATG TGCGCGCCAT GACCGACGCC TTCGCCAAGG CGAACCCCGA TATCAAGGTC AATCTCGAAT TCGTTCCCTA TGAAGGCCTG CACGACAAGA CGGTGCTGGC GCAGGGTTCC GGCGGCGGTT ACGACGTTGT CCTCTTCGAC GTCATCTGGC CGGCAGAATA CGCCAGCAAC AAGGTGCTGG TCGACGTCTC CTCTCGCGTC ACCGACGAGA TGAAGAAAGG TGTGCTGCCG GGAGCCTGGA CCACCGTGCA ATATGATAGC AAATATTACG GCATGCCGTG GATCCTCGAC ACCAAATACC TGTTCTACAA CAAGGAGATC CTCGAAAAGG CCGGCATCAA GACTCCGCCC AAGACCTGGG ACGAGCTGAC CGAACAGGCA AAGACCATCA AGGACAAGGG CCTGCTCGCC ACGCCGATCG CCTGGAGCTG GTCGCAGGCC GAAGCCGCGA TCTGCGATTA CACCACGCTC GTCAGCGCCT ATGGCGGCGA TTTCCTGAAG GACGGCAAGC CGGCCTTCCA GACCGGCGGT GGCCTCGATG CACTGAAATA CATGGTCTCC AGCTATTCCT CGGGCCTCAC CAATCCGAAC TCCAAGGAAT TCCTCGAAGA GGACGTCCGT AAGGTCTTCG AAAACGGCGA TGCCGCCTTC GCGCTGAACT GGACCTACAT GTACAACATG GCCAACGATC CGAAGGACAG CAAGGTCGCA GGCAAGGTCG GCGTCGTGCC GGCGCCGGGT GTTGCCGGCA AAAGCGAGGC TTCGGCCGTC AACGGCTCGA TGGGCCTCGG CATCACCTCG GCCAGCAAGC ATCCTGATGA GGCCTGGAAA TACATCACCT TCATGACCTC GCAGGCGACG CAGAATGCCT ATGCCAAGCT CAGCTTGCCG ATCTGGGCGT CCTCCTATGA GGACCCTGAT GTCACCAAGG GTCAGGAAGA ATTGATCTCC GCCGCCAAGA TCGGCCTTGC CGCGATGTAT CCGCGTCCGA CGACGCCGAA ATATCAGGAG CTCTCGACCG CGCTGCAACA GGCGATCCAG GAATCGCTGC TCGGCCAGTC CTCTCCCGAA GATGCGCTGA AGTCGGCGGC CGACAATAGC GGCCTCTGA
|
Protein sequence | MLKSLNKTLL GAALIGASLA PHAFAETTLN ALFMAQAAYS EADVRAMTDA FAKANPDIKV NLEFVPYEGL HDKTVLAQGS GGGYDVVLFD VIWPAEYASN KVLVDVSSRV TDEMKKGVLP GAWTTVQYDS KYYGMPWILD TKYLFYNKEI LEKAGIKTPP KTWDELTEQA KTIKDKGLLA TPIAWSWSQA EAAICDYTTL VSAYGGDFLK DGKPAFQTGG GLDALKYMVS SYSSGLTNPN SKEFLEEDVR KVFENGDAAF ALNWTYMYNM ANDPKDSKVA GKVGVVPAPG VAGKSEASAV NGSMGLGITS ASKHPDEAWK YITFMTSQAT QNAYAKLSLP IWASSYEDPD VTKGQEELIS AAKIGLAAMY PRPTTPKYQE LSTALQQAIQ ESLLGQSSPE DALKSAADNS GL
|
| |