Gene Smed_4875 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4875 
Symbol 
ID5318922 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp1380029 
End bp1381303 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content62% 
IMG OID640776660 
Productphosphonoacetate hydrolase 
Protein accessionYP_001313592 
Protein GI150376996 
COG category[R] General function prediction only 
COG ID[COG1524] Uncharacterized proteins of the AP superfamily 
TIGRFAM ID[TIGR02335] phosphonoacetate hydrolase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.546493 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCAGA TGTCCAAAAT TTCGGTAACG GTTAATGGCC GCCGCTATCC ATGGCCCCGC 
GTTCCGGCAA TTGCGATCTG CCTCGACGGG TGCGAGCCCG CCTATCTCGA TGCGGCGATG
GAGGCGGGAT TGATGCCCGC CCTCAAGCGC ATCAAGGAAC GCGGTGCCGT GCGGCTGGCG
CACAGCGTCA TTCCGAGCTT CACAAATCCC AACAATCTTT CGATCGCAAC GGGAACTCCG
CCGGCCGTCC ACGGGATCTG CGGCAACTAC CTTTACGAGC CTGCGACGGG CGAAGAGGTG
ATGATGAACG ACCCGAAGTT CCTTCGCGCG CCGACCATTT TCCAGGCCTT CTACGACGCC
GGCGCGCGGG TCGCGGTCGT CACGGCGAAG GATAAGCTTC GCGCGCTTCT CGGAAAGGGG
CTTCGTTTCG ACGAAGAACG CGCGATCTGC TTCTCGTCTG AAAAATCGGA CAAGGCGACG
CCCGAGGAAC ATGGCATCGA CAATGCCTCG GGCTGGCTCG GCCGGCCGGT TCCGGAAGTC
TATTCGGCAG CGCTTTCGGA ATTCGTCTTC GCTGCGGGCG TGAAACTGCT CAAGGAGTTC
CACCCCGACA TCATGTATCT GACCACGACG GACTATGTTC AGCACAAATA CGCTCCCGGC
GTAGCGGAGG CCAACGCCTT TTACGCGATG TTCGATCGCT ACCTGTCAGA GCTGGACGGA
CTAGGAGCGG CCATCGTCGT CACCGCTGAT CACGGCATGA AGCCGAAGCA CAAGGCGGAC
GGCTCGCCCG ACGTCATCTA TGTCCAGGAC CTGCTCGATG AGTGGTTGGG CGAGGAAGCG
GCCCGTGTCA TCCTGCCGAT CACCGATCCA TATGTCGTGC ACCACGGCGC GCTCGGATCC
TTCGCCACCG CCTATCTCCC CGGGCAATGC GACCGCGGAG AGATCATCGA TCGGCTAAAG
GCGCTCGATG GCATCGACGT CGTGCTGGAC CGCGAAGAGG CATGCCGAAG GTTTGAGCTC
CCGGAGGATC GCATCGGCGA CATTGTCCTC GTTTCGTCCG AAAACAAGAC ACTCGGCACC
AGCGAGCACC GCCACGACCT GGCGGCACTG GACGAGCCGT TGCGCTCCCA TGGCGGCCTC
ACAGAGCAGG AGGTTCCCTT CATCGTCAAC CGGGTGCTGC CCGATCTCCC GGGCGCTCCT
TGTCTTCGCA ACTTCGATGC GTTCTTCTAC GCTGTTGCGG CAGCGGCGTT GATCAGTGCG
GAGGACGCGA TATGA
 
Protein sequence
MNQMSKISVT VNGRRYPWPR VPAIAICLDG CEPAYLDAAM EAGLMPALKR IKERGAVRLA 
HSVIPSFTNP NNLSIATGTP PAVHGICGNY LYEPATGEEV MMNDPKFLRA PTIFQAFYDA
GARVAVVTAK DKLRALLGKG LRFDEERAIC FSSEKSDKAT PEEHGIDNAS GWLGRPVPEV
YSAALSEFVF AAGVKLLKEF HPDIMYLTTT DYVQHKYAPG VAEANAFYAM FDRYLSELDG
LGAAIVVTAD HGMKPKHKAD GSPDVIYVQD LLDEWLGEEA ARVILPITDP YVVHHGALGS
FATAYLPGQC DRGEIIDRLK ALDGIDVVLD REEACRRFEL PEDRIGDIVL VSSENKTLGT
SEHRHDLAAL DEPLRSHGGL TEQEVPFIVN RVLPDLPGAP CLRNFDAFFY AVAAAALISA
EDAI