Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_5794 |
Symbol | |
ID | 5320096 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009621 |
Strand | - |
Start bp | 765805 |
End bp | 767367 |
Gene Length | 1563 bp |
Protein Length | 520 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 640777499 |
Product | extracellular solute-binding protein |
Protein accession | YP_001314431 |
Protein GI | 150377836 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.000436692 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.000177852 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGTTGAAGA AACTGGGCCT CGCGGCCACG ATACTGGTTG GCCTGACCGT GTCCGGCCAC GCCGAAACGC TTAAATGGGG CGCTGCGCGC GATATCTACT CGCTTGACCC CTATTCCTAT GGCGACAGCT ATACGCTGTC CTTCCTCAAT CATATCTACG AAGGCCTGGT GCGTTACGAC GCCAATCTCA AGATCGAACC GGCCCTCGCC GAATCCTGGG AAACGGTCTC CGATACCGTC TGGCGCTTTC ACCTGCGCAA GGGCGTAAAA TTTCACGATG GAGCAGAATT CACTGCCGAC GACGTGCTCG CATCGCTGAA GCGCGTCAGT GATCCGGATT CTCCGCTGCG CGGCAACCTG CCGGCTTACA AGTCCTCCAA GAAGGTGGAC GACTATACCA TCGACGTCGA GCTGAGCGGG CCCTATCCGC TGCTTCTCAA CGACCTCACC AATATCCATG TTTTCGATGC CGGTTGGCTG AAGACCAACA ATTCGGAAAA GCCGACCGAT GTCGGCGCGA AGATCGAAGG CTACGCCACC TATCACACCA ACGGCACCGG CCCGTTCAAG CTGGAAAGCC GGATGCCGGA TTCGAAGACG ATCCTCGTGA AAAACCCGGA CTGGTGGGAC AAGACCTCCA AGTCGAACAT CGACCGCATC GAGTTCACGC CCATCACCTC GGCGGCGACG CGCGTGGCCG CGCTCCTTTC CGGCGAAATC AATTTCACCG AAAACGCGCC GTCGCAGGAC CTGCCGCGTC TGTTGGCGCA GCCTGAGCTG AAGGTCATGG AACGCACCGA TCTGCGTACC GTAATGGTCG GCTTCAACCG CAAGCCCAAA CTCGCCAACG GCTCGGAGAA CAAGTTCAAC GATCTGCGTG TGCGCCAGGC TTTCGCCCAT GGGCTCGACC GGGAACTGAT CCAGAAGCGC GTCATGCGCG GCAAGTCGCG CACCGCAGGA GCGGTGGTCG CGCCGGAGAT TCCGGGATAC GCGCCCGAGT TGGACACCAC GCTCCCCTAC GATCCGGCCC TGTCGAAAAA GCTCCTGACC GAGGCAGGCG CTGCGGAATA TCCGTTCACG CTGGTCTGCA CGACCGATGC CTATGTGAAC GAGGAGGAGC TGTGCCAGGG GCTGGTGAAT ATGCTGAGCC GCGCCGGCTT CAAGCCGCAG CTCGACATTG CGCCCACCGC CGCACAGGCG CCGAAGCGTA CCAGCGGCAA GTCCGACGTT TATCTGATCG GCTGGGCCAC TGAACCGATG CTCGACAGCT ATTCGATCCT TCTCCAGATG ATGCAGACCA AGACGGCCAA TGCCGGCGTC TTCAACTGGG GCGGCTGGAG CTATCCGGAG ATCGACGAGC TGATCGTCCA GGCATCGACC GAAATGGATC GCGCCAAGCG TCTGGCGCTG CAGACCAAAG CGCTGCAGAT GGTGAAGGAA GAGATCGTGA TGCTGCCCCT GCACCAGCAG CCGATGGCCT GGGTCATGTC GAACAAGATC GACAAGATCG TGCAGTTGGC GGACAACAAG CCGCGTCACT GGCTGACGCA GTTTTCCGAG TAA
|
Protein sequence | MLKKLGLAAT ILVGLTVSGH AETLKWGAAR DIYSLDPYSY GDSYTLSFLN HIYEGLVRYD ANLKIEPALA ESWETVSDTV WRFHLRKGVK FHDGAEFTAD DVLASLKRVS DPDSPLRGNL PAYKSSKKVD DYTIDVELSG PYPLLLNDLT NIHVFDAGWL KTNNSEKPTD VGAKIEGYAT YHTNGTGPFK LESRMPDSKT ILVKNPDWWD KTSKSNIDRI EFTPITSAAT RVAALLSGEI NFTENAPSQD LPRLLAQPEL KVMERTDLRT VMVGFNRKPK LANGSENKFN DLRVRQAFAH GLDRELIQKR VMRGKSRTAG AVVAPEIPGY APELDTTLPY DPALSKKLLT EAGAAEYPFT LVCTTDAYVN EEELCQGLVN MLSRAGFKPQ LDIAPTAAQA PKRTSGKSDV YLIGWATEPM LDSYSILLQM MQTKTANAGV FNWGGWSYPE IDELIVQAST EMDRAKRLAL QTKALQMVKE EIVMLPLHQQ PMAWVMSNKI DKIVQLADNK PRHWLTQFSE
|
| |