Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_5167 |
Symbol | |
ID | 5319469 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009621 |
Strand | - |
Start bp | 119261 |
End bp | 120853 |
Gene Length | 1593 bp |
Protein Length | 530 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640776945 |
Product | 4-phytase |
Protein accession | YP_001313877 |
Protein GI | 150377282 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.978534 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.719428 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGAAAA GACTTACACT GGCGGCAATG CTCAGCCTCG GCGTGGCTGG CGGGGCGCTT GCGGCGGGCG AACGCCACGG CGGGACCCTT GTCTTCACCG CCCCCTACGG TTCGAGCTTC GCCACGCTCG ACGTGCAGTC GAGCCCCAAC ACGCAGGAAG AATTCATCAC CCAGGCCATC CACCGCGCGC TCTACAGCTG GGACGCGGTC CAGAACAAGC CGGTGCTGGA ACTGGCGACC TCGGAAGACG TTTCCGAGGA CGGCACGGTC CACACCTATC ACCTGCGTGA AAACGCCGTA TTCCACAACG GCAAGCCGCT GACAGCCGAC GACATCATCT ACAGCTACAA ACGAATTGCC AATCCGAAGA ACGCCTTTCC CGGCGCGAGC TTCATTGCGG TCATCAAGGG GGCGGAGGAC TATACCGCCG GCAAGGCAGA CGAGATTTCG GGTCTGAAGA AGATCGACGA TCACACGCTG GAGATCACCT ATGCCAGCAC GATCAATCCC GGCTTCCCGC TGATGCAGAA CACGACTGTC ATCTATCCGT CCGATGTCGA GGACGAAGCG GAGTTCGGCA AAAACCCCGT CGGCCTCGGC GCTTTCGTCT TCAAGGAGCA CGTACCCGGC TCCCGGCTCG TCGTGGAAAA GTTCGACAAA TATTATGAGG AGGGCAAACC CTATCTCGAC CGCATCAATA TCGTGTTGAT GGCCGAGGAC GCCGCGCGCG ACATCGCCTT CCGCAACAAG GAGATCGATG TTTCGATTCT CGGGCCCACC CAGTACCAGG CCTATCAGGG CGAGGACGGA TTGAAGGATC ACCTCCTGGA GGTCGCAGAG GTCTATACCC GTAATATAGG CTTCAATCCC GCCTTCGAAC CCTTCAAGGA CAGGCGGGTG CGCCAGGCGA TCAACCATGC CATCAACGCG CCGCTGATCA TCGAGCGCCT GGTCAAGAGC AAAGCCTATC CCGCCTCGGG ATGGGTGCCG ATTTCCTCGC CGGCCTTCGA CAACGACAAG GCGCCCTACG CCTACGATCC GGACAAGGCG CAGGCGCTGC TCGCCGAGGC CGGCTACGCG GACGGCTTCG AATTCGAGGT GACGGCCAGC CCGAACGAAA GCTGGGGCGT GCCGATCGTC GAAGCCATCC TGCCCATGCT GAAGAAGGTC GGCATCACCG TGAAGCCGAA GCCGGTCGAA AGCTCGGCAC TCGGCGAGGC GGTGACGACC AACAACTTCC AGGCCTTCAT CTGGTCGAAC CTTTCCGGCC CGGATCCGCT GAACGCGCTG CGCTGCTACT ATTCAAAGAC GCCGCAATCC GCCTGCAACT ACACGAGCTA TGCGAGCCCG GAATTCGACA AGCTCTATGA GGCGGCCAAA CAGGAGCGTG ACCCGGCCAA GCAGAACGAT CTCCTGCGCC AGGCCAACAA TATCGTGCAG GACGACGCGC CGGTCTGGTT CTTCAACTAC AACAAGGCGG TGATCGCCTA CCAGCCGTGG GTCCACGGCC TCGTTCCGAA CGCGACGGAA CTGGCGATCC AGCCCTATGA CGAGATCTGG ATCGACGATA AGGCACCGGC CTCGCGCCAG TAA
|
Protein sequence | MLKRLTLAAM LSLGVAGGAL AAGERHGGTL VFTAPYGSSF ATLDVQSSPN TQEEFITQAI HRALYSWDAV QNKPVLELAT SEDVSEDGTV HTYHLRENAV FHNGKPLTAD DIIYSYKRIA NPKNAFPGAS FIAVIKGAED YTAGKADEIS GLKKIDDHTL EITYASTINP GFPLMQNTTV IYPSDVEDEA EFGKNPVGLG AFVFKEHVPG SRLVVEKFDK YYEEGKPYLD RINIVLMAED AARDIAFRNK EIDVSILGPT QYQAYQGEDG LKDHLLEVAE VYTRNIGFNP AFEPFKDRRV RQAINHAINA PLIIERLVKS KAYPASGWVP ISSPAFDNDK APYAYDPDKA QALLAEAGYA DGFEFEVTAS PNESWGVPIV EAILPMLKKV GITVKPKPVE SSALGEAVTT NNFQAFIWSN LSGPDPLNAL RCYYSKTPQS ACNYTSYASP EFDKLYEAAK QERDPAKQND LLRQANNIVQ DDAPVWFFNY NKAVIAYQPW VHGLVPNATE LAIQPYDEIW IDDKAPASRQ
|
| |