Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_3910 |
Symbol | |
ID | 8015868 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | - |
Start bp | 3977137 |
End bp | 3978375 |
Gene Length | 1239 bp |
Protein Length | 412 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 644826480 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_002977691 |
Protein GI | 241206595 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.198843 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCAAGC GTCTATTGGC TGCGACCAGC ATCGCTACCT TATGTCTGGT GTCGGCCGCG TCAGCTGCCG AAAATGTCGA AATGTGGGTT CGCTCGGGGA TTGGCGATGC CTTCAAGAAG GTCGTCGAAG CCTATAATTC CGGTCACGAG AACAAGGTCG TGATGACCGA GGTGCCGTTC TCCGAGCTGG TACAGAAATA TGCGACGGCA ATCGCCGGCG GACAGGCGCC GGATGCCTTG TCGATGGATC TCATTTATAA TCCCGCCTTT GCCGCGGCCG GCCAGCTGGA AGACCTGACG GACTGGGCAA AATCCCTGCC CTATTTCAAT TCGCTTTCGC CATCGCATGT TCGCCTCGGC ACCTATCAGG ACCGGATTTA CGGCCTGCCG CTTTCGGTCG AAACCTCCGT CTTCGCCTGG AACAAGGATC TCTACAAGAA GGCCGGTCTC GACCCGGAAA AAGCGCCGGC GAATTGGGAC GAAATTACCG CCAATGCTGA GAAGATCCGG GCGCTGGGTG ACGATACCTA CGGCTTCTAT TTCTCCGGTG GCGGCTGCGG CGGCTGCATG ATCTTCACAT TCACGCCGCT TGTCTGGGGT GCCGGCGCTG ATATTCTGTC GGCCGACAGC AAGACGGCGA CGCTCGATAC GCCTGAGATG CGCAAGGCCG TCGATATCTA CCGCAACATG GTCAAGAAGG ACCTCGTACC GGCGGGTGCC GCCAGCGACA ACGGCGCCAA CTTCCTGACC TTCACCAACG GCAAGATCGG CCAGCAAAGC CTCGGCGCCT TTGCCATCGG CACGCTGGTA ACCGAGCATC CCGATATCAA CTTCGGCGTG ACCCTCATTC CTGGCGTCGA CGGCAAGCCC TCGTCCTTTG CCGGCGGCGA CAACTTCGTC ATCACCAAGG GCACGAAGAA GATCGACGCG GTGAAGGAGT TCCTCGAATA TATCTATTCG ATGGACGGCC AGAAGATCAT GGCGAAGTAT GGCAGCCTGC CGACGCGCGG CGATATCGCC GACAAAGTGC TTGAGGGCCT CGATCCGCGC ATGCAGGTCG GCCTGAAGGC GATCGGCGTC GCCAAGACAC CCTATACGCT GCAGTTCAAC GACCTGATCA ACAGCGCCAA CGGGCCTTGG GCCAGCTTCA CCAACGCCTC GATCTTCGGC GACGATGTCG ACGGGGCGTT TTCGAGCGCC CAGTCGGAGA TGCAATCGAT CATCGATAGC GGCCAATAA
|
Protein sequence | MIKRLLAATS IATLCLVSAA SAAENVEMWV RSGIGDAFKK VVEAYNSGHE NKVVMTEVPF SELVQKYATA IAGGQAPDAL SMDLIYNPAF AAAGQLEDLT DWAKSLPYFN SLSPSHVRLG TYQDRIYGLP LSVETSVFAW NKDLYKKAGL DPEKAPANWD EITANAEKIR ALGDDTYGFY FSGGGCGGCM IFTFTPLVWG AGADILSADS KTATLDTPEM RKAVDIYRNM VKKDLVPAGA ASDNGANFLT FTNGKIGQQS LGAFAIGTLV TEHPDINFGV TLIPGVDGKP SSFAGGDNFV ITKGTKKIDA VKEFLEYIYS MDGQKIMAKY GSLPTRGDIA DKVLEGLDPR MQVGLKAIGV AKTPYTLQFN DLINSANGPW ASFTNASIFG DDVDGAFSSA QSEMQSIIDS GQ
|
| |