Gene Smed_5167 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5167 
Symbol 
ID5319469 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp119261 
End bp120853 
Gene Length1593 bp 
Protein Length530 aa 
Translation table11 
GC content61% 
IMG OID640776945 
Product4-phytase 
Protein accessionYP_001313877 
Protein GI150377282 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.978534 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.719428 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGAAAA GACTTACACT GGCGGCAATG CTCAGCCTCG GCGTGGCTGG CGGGGCGCTT 
GCGGCGGGCG AACGCCACGG CGGGACCCTT GTCTTCACCG CCCCCTACGG TTCGAGCTTC
GCCACGCTCG ACGTGCAGTC GAGCCCCAAC ACGCAGGAAG AATTCATCAC CCAGGCCATC
CACCGCGCGC TCTACAGCTG GGACGCGGTC CAGAACAAGC CGGTGCTGGA ACTGGCGACC
TCGGAAGACG TTTCCGAGGA CGGCACGGTC CACACCTATC ACCTGCGTGA AAACGCCGTA
TTCCACAACG GCAAGCCGCT GACAGCCGAC GACATCATCT ACAGCTACAA ACGAATTGCC
AATCCGAAGA ACGCCTTTCC CGGCGCGAGC TTCATTGCGG TCATCAAGGG GGCGGAGGAC
TATACCGCCG GCAAGGCAGA CGAGATTTCG GGTCTGAAGA AGATCGACGA TCACACGCTG
GAGATCACCT ATGCCAGCAC GATCAATCCC GGCTTCCCGC TGATGCAGAA CACGACTGTC
ATCTATCCGT CCGATGTCGA GGACGAAGCG GAGTTCGGCA AAAACCCCGT CGGCCTCGGC
GCTTTCGTCT TCAAGGAGCA CGTACCCGGC TCCCGGCTCG TCGTGGAAAA GTTCGACAAA
TATTATGAGG AGGGCAAACC CTATCTCGAC CGCATCAATA TCGTGTTGAT GGCCGAGGAC
GCCGCGCGCG ACATCGCCTT CCGCAACAAG GAGATCGATG TTTCGATTCT CGGGCCCACC
CAGTACCAGG CCTATCAGGG CGAGGACGGA TTGAAGGATC ACCTCCTGGA GGTCGCAGAG
GTCTATACCC GTAATATAGG CTTCAATCCC GCCTTCGAAC CCTTCAAGGA CAGGCGGGTG
CGCCAGGCGA TCAACCATGC CATCAACGCG CCGCTGATCA TCGAGCGCCT GGTCAAGAGC
AAAGCCTATC CCGCCTCGGG ATGGGTGCCG ATTTCCTCGC CGGCCTTCGA CAACGACAAG
GCGCCCTACG CCTACGATCC GGACAAGGCG CAGGCGCTGC TCGCCGAGGC CGGCTACGCG
GACGGCTTCG AATTCGAGGT GACGGCCAGC CCGAACGAAA GCTGGGGCGT GCCGATCGTC
GAAGCCATCC TGCCCATGCT GAAGAAGGTC GGCATCACCG TGAAGCCGAA GCCGGTCGAA
AGCTCGGCAC TCGGCGAGGC GGTGACGACC AACAACTTCC AGGCCTTCAT CTGGTCGAAC
CTTTCCGGCC CGGATCCGCT GAACGCGCTG CGCTGCTACT ATTCAAAGAC GCCGCAATCC
GCCTGCAACT ACACGAGCTA TGCGAGCCCG GAATTCGACA AGCTCTATGA GGCGGCCAAA
CAGGAGCGTG ACCCGGCCAA GCAGAACGAT CTCCTGCGCC AGGCCAACAA TATCGTGCAG
GACGACGCGC CGGTCTGGTT CTTCAACTAC AACAAGGCGG TGATCGCCTA CCAGCCGTGG
GTCCACGGCC TCGTTCCGAA CGCGACGGAA CTGGCGATCC AGCCCTATGA CGAGATCTGG
ATCGACGATA AGGCACCGGC CTCGCGCCAG TAA
 
Protein sequence
MLKRLTLAAM LSLGVAGGAL AAGERHGGTL VFTAPYGSSF ATLDVQSSPN TQEEFITQAI 
HRALYSWDAV QNKPVLELAT SEDVSEDGTV HTYHLRENAV FHNGKPLTAD DIIYSYKRIA
NPKNAFPGAS FIAVIKGAED YTAGKADEIS GLKKIDDHTL EITYASTINP GFPLMQNTTV
IYPSDVEDEA EFGKNPVGLG AFVFKEHVPG SRLVVEKFDK YYEEGKPYLD RINIVLMAED
AARDIAFRNK EIDVSILGPT QYQAYQGEDG LKDHLLEVAE VYTRNIGFNP AFEPFKDRRV
RQAINHAINA PLIIERLVKS KAYPASGWVP ISSPAFDNDK APYAYDPDKA QALLAEAGYA
DGFEFEVTAS PNESWGVPIV EAILPMLKKV GITVKPKPVE SSALGEAVTT NNFQAFIWSN
LSGPDPLNAL RCYYSKTPQS ACNYTSYASP EFDKLYEAAK QERDPAKQND LLRQANNIVQ
DDAPVWFFNY NKAVIAYQPW VHGLVPNATE LAIQPYDEIW IDDKAPASRQ