Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_2041 |
Symbol | |
ID | 5322900 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009636 |
Strand | + |
Start bp | 2093525 |
End bp | 2094469 |
Gene Length | 945 bp |
Protein Length | 314 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 640790978 |
Product | ABC transporter binding protein |
Protein accession | YP_001327709 |
Protein GI | 150397242 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1879] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00512093 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.331849 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCTGAAGA AAGCTATAAC ATTGCTCTCG GCGGCGGTCA TCGCTGCGGG AGCAGGTTTC GTTTCAAGCG CATATGCGCA AGGCAAAACC TACTACTGGG TCTCGCACGG CTCTCCGGCG GACCCGGTGT GGACCTATTT TCTGGCAGGT GCAAAACAGT GGGCCGAGGA TACCGGCAAT ACGGTAAACA CGTCATTCCA GAACGGTGAC GTCCCTGCGC AGCAGGAAGC GATTAGAGCG GCGATTTCGG CGGGTGCGGC CGGCATCGCA ACCACCACGC CCGATCCGGG CAGCCTCAAC GAGGTCGTCA AGGAAGCAAA AGCGGCCGGC ATTCCGATCA TCAATTTCAA CACACCGGAC CCGCAGGCCG GCTTCGACGC CTATGTCGGA GGCGACAACA GAGCTTTCGG AAAGAACTGG GCGCAATATC TCGTCGACAA GAAGCTCGTG AAATCCGGCG ATTTCGTCTG GATGCCCGTC GAAGTTCCCG GCGCGACCTA CGGCGTTCAG GAAGAGGAAG GCATTGCCAG CGTCTTCAAG CCGTTGAACA TCACCTGGGA AGTCACCGAC GCAACGCTTG ACCAGGCGGA AATCATCACC CGCATGTCAG ACTACCTGAC TGCAAACCGT TCGAAGATCA AGGCGATCAT CGGACTGGGC GACCTCGTCA CCGGCAGCAT CAAGCGCGTC TTCGACCAGG TCGGCGTAAA ACCGGGAGAA ATTCCCGTGG TCGGCTGGGG CAATTCGATA GACACCACCC AGGAGGTTCT GACGGGCTAC GTGAATGCCG CCCAATGGCA AGACCCGCAG GCGACCAGCT ATATGGCACT GTCCATGGCT GCCATGGCAT CGAGCAAGAT ACCGCCAGGT TTCAATATCA TCACCGGCGC GCTCTACGAA AAGGATACGG CAGAGCTCTA CGACAAGATC CTCTCCGGCA AGTAA
|
Protein sequence | MLKKAITLLS AAVIAAGAGF VSSAYAQGKT YYWVSHGSPA DPVWTYFLAG AKQWAEDTGN TVNTSFQNGD VPAQQEAIRA AISAGAAGIA TTTPDPGSLN EVVKEAKAAG IPIINFNTPD PQAGFDAYVG GDNRAFGKNW AQYLVDKKLV KSGDFVWMPV EVPGATYGVQ EEEGIASVFK PLNITWEVTD ATLDQAEIIT RMSDYLTANR SKIKAIIGLG DLVTGSIKRV FDQVGVKPGE IPVVGWGNSI DTTQEVLTGY VNAAQWQDPQ ATSYMALSMA AMASSKIPPG FNIITGALYE KDTAELYDKI LSGK
|
| |