Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_4029 |
Symbol | |
ID | 5318329 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009620 |
Strand | - |
Start bp | 488012 |
End bp | 489520 |
Gene Length | 1509 bp |
Protein Length | 502 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 640775837 |
Product | extracellular solute-binding protein |
Protein accession | YP_001312770 |
Protein GI | 150376174 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCATCC GCTATTGTAC CGGCGCCTTG GCGCTATTTG CAGCGACGCT GCTCGCTGGC GCCGCCATCG CAGCCGAAGG GGAAGTGAAG ATCGTATTGC CGGAGCAGCC GGCCAATCTC GATCCCTGTC GCTCCATCCG GTCGGATATC GGCCGCATCA TCAATTCCAA CATCACCGAG ACCCTGACCG TCATCGATAC GGAGAAAGGC ACCGTTGGCC CCTGGCTCGC GGAAAAATGG GAGCAGGTGG ACGATCTTAC CTGGCGCGTG CATCTCAAGA GCGGCGTCAA GTTCCAGGAT GGCGTGGAGT TCAATGCCGA AGCGGTGGTC AAGTCCATCA ATCGCCTGAT GAATCCGAAC ATCACCTGCG ACAGCCGCTC CAAATTCGGC GACGTCAAGC TGACGCCGAA GGCGGTCGAC GCGCAGACGG TGGAGATTTC CTCCGACAGC CCGGTGCCGA TCATGCCGAC ACTGCTCGGC ACCGTCCAGA TCGTCTCGCC GAACATGCCC TTCGACAAGG AAAGCAACGA TCCCGTCGGC ACGGGCCCCT ATGTGGTCGA AAGCAAGTCC AATGAACAGA TCGTGCTGAA GCGTCACGAT GCCTACTGGG GTGCCAAGCC GGACGTCACG CGTGCAACCT ATGTCTGGCG CAGCGAGTCG GCTATCCGCG CGGCCATGGT CGAATCCGGT GAAGCGGATA TGACTCCGTC CATCGCGGTA CAGGACGCGA CCAATGCAGA GTCGGACTTC GCCTATCTGA ATTCCGAGAC GACACGCATG CGGATCGATG CGCAAATCCC ACCGCTCGAC GACGTTCGTA TCCGCAAGGC CCTGAACATG GCCATCGACT GGGACGGGAT GGGCGAAGCG CTGTTCGGCA AGGATGTGCT CAGGGCGTCG CAAATGGTCG TTCCGGGCGT GCGCGGACAC AATCCCGATA TCAAGCCATG GACCTACAAT CCCGAAGAGG CGATGAAGCT GGTCGAGCAG GCCAAGGCCG ACGGCGCTCC GGTCGACAAG GAGATCGTGC TGATCGGGCG CAACGGCTTT TTCCCGAATT CGGCTGAAAG CCTGGAAGCG ATCCGGTCCA TGTGGCAGGA AATCGGGCTC AACGTTTCGA TACGCCAGCT CGAGGCGGCA GACTGGGTAC GCTATCTCGA CAAGCCTTTT CCCGAAGGGC GCGGCCCGAC CCTGTTCCAG CAGCAGCACG ACAACAATAC CGGCGATGCC GGGTTCACCG CGCCGGTCAT GTATCTGAGC GAGGGGCAAT ATTCGACGAT CGCAGACAAG GATCTCGATG CCGTCCTCAA GAAAGCCATG GCTGCGACCG GCGACGAGCG CGAAAAGCTC TTCCAGGAGG TCTTCGCAAA GGTGCACGAC GAGATCGTTG CAGACGTGCC GATGTACCAC ATGATCGGAT ATGTCCGCGT CGGCTCGCGC CTCGACTGGA AGCCGGATCT CAAAACGAAC AGCGAGATCG CGCTCTCCGA GATTCACTTC AAGGACTGA
|
Protein sequence | MIIRYCTGAL ALFAATLLAG AAIAAEGEVK IVLPEQPANL DPCRSIRSDI GRIINSNITE TLTVIDTEKG TVGPWLAEKW EQVDDLTWRV HLKSGVKFQD GVEFNAEAVV KSINRLMNPN ITCDSRSKFG DVKLTPKAVD AQTVEISSDS PVPIMPTLLG TVQIVSPNMP FDKESNDPVG TGPYVVESKS NEQIVLKRHD AYWGAKPDVT RATYVWRSES AIRAAMVESG EADMTPSIAV QDATNAESDF AYLNSETTRM RIDAQIPPLD DVRIRKALNM AIDWDGMGEA LFGKDVLRAS QMVVPGVRGH NPDIKPWTYN PEEAMKLVEQ AKADGAPVDK EIVLIGRNGF FPNSAESLEA IRSMWQEIGL NVSIRQLEAA DWVRYLDKPF PEGRGPTLFQ QQHDNNTGDA GFTAPVMYLS EGQYSTIADK DLDAVLKKAM AATGDEREKL FQEVFAKVHD EIVADVPMYH MIGYVRVGSR LDWKPDLKTN SEIALSEIHF KD
|
| |