Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_4140 |
Symbol | |
ID | 5319136 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009620 |
Strand | + |
Start bp | 610706 |
End bp | 612787 |
Gene Length | 2082 bp |
Protein Length | 693 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 640775945 |
Product | extracellular solute-binding protein |
Protein accession | YP_001312878 |
Protein GI | 150376282 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0860796 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAACAC ATCGATCGAA TAGAACGGCG TACCTGCTGA TCGGCATATC CGCATTTGCC GTGCAGGCCT TCGCATCAGA GCCTACGATC GTGCCCGAAC AGCCGCCATT CCCGGCGCAG GGCAAAATCA CCTATGTATC CCGCGAATCC ATTCTCGAGT TCAAGGCGCT GCCCGAGTAC AGGGAGCCCG AATGGGTCAC GGAGAAATTC GTCAAGGCCG GCAAGCTGCC GCCGCTATCC GAGCGCCTGC CGAAAGAACC GATGGTCTTC AAAGCAGGCA ACATGCCGGA CGGAATGGGC GTCTATGGCG ATGTTATGCG CCATGTGATC GGCGGCCGGC CGGAAGGCTG GAACTACAGT GCCGGCCAGA CCCAGGGATG GGGCGGCATC GACATTGGAA TGTTCGAGTG CCTGACCCGC ACCGCACCGC TTTTCCAGGT TGAGGCGGAC GACATGGAGC CGCTGCCGAA CCTGGCGAAG AGCTGGGAAT GGTCCGAGGA CGGCCACAAG CTCACCATGC ATCTGATCGA GGGCGCGAAA TGGTCGGACG GCGATCCCTT CGACGCCGAA GACGTCATGT TCTATTGGGA GGACAATGTC CTCGATCCGA GCGTTTCGCC ATTGAACGGC GCGACGCCCG AGACCTTCGG CGAGGGAACG ACGCTGAAAG AGGTCGATAA ATACACGGTC GAATGGACCT TCAAGGAGGC CTTCCCGCGT CAGCATCTTT TCGCAATGGC CTATGGCACC TTCTGCCCCG GCCCGTCGCA CATCCTCAAG ACCAAGCACC CGAAATATGC AGGCACGACC TATAACGAAT ACAAGAACGG CTTCCCGGCC GAATATCTGA ATCTGCCGGT GATGGGCGGA TGGGTACCGG TCGCCTACCG CCCGGATGAT ATCATCGTGC TGCGCCGCAA CCCCTATTAC TGGAAGGTCG ACGAGGCCGG AAACCAGCTG CCCTACCTCA ATGAGCTTCA CTACAAGCTC TCGACCTGGG CCGACCGAGA CGTGCAGGCA ATCGCCGGAT CGGGGGACTT CTCGAATCTG GAGCAGCCGG AAAACTTCGT TGAATCGCTG AAGCGAGCCG CCAATGAGAG CGCGCCGGCA CGGCTTGCCT TCGGTCCGCG CGTCATCGGC TACAACATGC ACATGAACTT CTCCGGCAAT GGTTGGGGAG ATCCCGATGA ACGCGCCAAG GAGGTGCGTG AGCTGAATCG CAGCCTCGAC TTCCGCAAGG CGGTCACCAT GGCGGTCGAC CGCAAGAAGC TGGGGGAAGC GCTGGTAAAA GGTCCCTTCA CGGCGATCTA TCCGGGCGGG CTTTCCTCCG GTACGAGCTT CTACGACCGG AACTCCACCA TGTACTACCC CCACGATCTC GAAGGTGCGA AGGCCCTTCT TGAAAAGGTT GGACTGAAGG ACACCGACGG CAACGGCTTC GTCAATTTTC CGGCCGACAA ACTTGGCGGC CGCGACGTCG AAATCGTGTT GCTCGTCAAT TCCGACTACA GCACCGACCG AAATCTTGCC GAAGGCATCG TCGGACAGAT GGAAAAGCTC GGGCTCCGGA TCGTTCTGAA TGCGCTCGAC GGCAAGCAGC GGGACGCGGC AAATTATGCG GGCCGCTTCG ATTGGATGAT TCATAGAAAC ACGGCGGAAT TCGCTTCCGT TGTGCAGAAC ACGCCTCAGC TCGCCCCGAC CGGCCCGCGC ACCAGCTGGC ATCACCGCGC TCCGGAAGGC GGCGAGGTCG ACGCCTTGCC CCACGAGACG GAGCTCGTCG ACATCGTCAA CAAGTTCATC GCCAGCAACG ACAATGACGA GCGCGCAGAG CTGATGAAGC AGTACCAGAA GGTGGCGACC ACGAATGTCG ATACCGTCGG CCTGACGGAA TATCCGGGCG CGCTGATCAT CAACAAGCGC TTCTCGAACA TTCCTCCGGG CGCGCCGATC TTCATGTTCA ACTGGGCGGA AGACACGATC ATCCGCGAAA GGGTCTTCGT TGCCGCCGAC AAGCAGGGCG ATTTTGAGCT CTATCCCGAG CAGCTTCCCG GCAAGCCCGG CGAAAGCGGC CCGATCAACT GA
|
Protein sequence | MKTHRSNRTA YLLIGISAFA VQAFASEPTI VPEQPPFPAQ GKITYVSRES ILEFKALPEY REPEWVTEKF VKAGKLPPLS ERLPKEPMVF KAGNMPDGMG VYGDVMRHVI GGRPEGWNYS AGQTQGWGGI DIGMFECLTR TAPLFQVEAD DMEPLPNLAK SWEWSEDGHK LTMHLIEGAK WSDGDPFDAE DVMFYWEDNV LDPSVSPLNG ATPETFGEGT TLKEVDKYTV EWTFKEAFPR QHLFAMAYGT FCPGPSHILK TKHPKYAGTT YNEYKNGFPA EYLNLPVMGG WVPVAYRPDD IIVLRRNPYY WKVDEAGNQL PYLNELHYKL STWADRDVQA IAGSGDFSNL EQPENFVESL KRAANESAPA RLAFGPRVIG YNMHMNFSGN GWGDPDERAK EVRELNRSLD FRKAVTMAVD RKKLGEALVK GPFTAIYPGG LSSGTSFYDR NSTMYYPHDL EGAKALLEKV GLKDTDGNGF VNFPADKLGG RDVEIVLLVN SDYSTDRNLA EGIVGQMEKL GLRIVLNALD GKQRDAANYA GRFDWMIHRN TAEFASVVQN TPQLAPTGPR TSWHHRAPEG GEVDALPHET ELVDIVNKFI ASNDNDERAE LMKQYQKVAT TNVDTVGLTE YPGALIINKR FSNIPPGAPI FMFNWAEDTI IRERVFVAAD KQGDFELYPE QLPGKPGESG PIN
|
| |