Gene Smed_3972 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_3972 
Symbol 
ID5319070 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp422990 
End bp424867 
Gene Length1878 bp 
Protein Length625 aa 
Translation table11 
GC content62% 
IMG OID640775781 
Productamidohydrolase 3 
Protein accessionYP_001312714 
Protein GI150376118 
COG category[R] General function prediction only 
COG ID[COG1574] Predicted metal-dependent hydrolase with the TIM-barrel fold 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGCCG ATCTGATCCT GCACCGTGGT CTGGTGACCA CGCTGGACGC CGCGAAACCG 
AACGCCACGG CCATCGCCGT CAAGGACGGC AGGTTCCTTG CCGTCGGGCT GGACCAAGAG
GTCATGGCGC TTGCCGGGCC GGACACGAAG GTCGTCGACC TGAAGGGCAA GCGCGTCCTG
CCTGGCCTCA ATGACAACCA TACCCACGTT GTGCGCGGTG GGCTTAATTT CAATATGGAA
CTGCGCTGGG ACGGCGTGCG CTCTCTCGCC GACGCCATGA ACATGCTGAA GCGGCAAGTG
GCGATCACGC CGCCTCCGCA ATGGGTGCGC GTCGTCGGCG GCTTCACCGA ACATCAGTTC
ATTGAAAAGC GGCTTCCGAC GATCGACGAG ATCAACGCGA TCGCCCCCGA CACGCCGGTG
TTCCTCCTGC ACCTTTATGA CCGGGCACTG CTCAACGGCG CGGCATTGCG CGCGGTTGGT
TACGGCAAGG ACACGCCGGA TCCGCCCGGA GGTGAGATCA CCCGGGACGC CAGGGGCAAT
CCGACCGGGC TGCTCATCGC CAAGCCGAAT GCGGGCATTC TCTATTCGAC GCTTGCCAAG
GGACCGAAGC TTCCATTGGA ATATCAGGTC AATTCGACTC GGCACTTCAT GCGCGAGCTC
AATCGGCTCG GGATCACCAG CGTCATCGAC GCGGGCGGCG GCTTCCAGAA CTATCCAGAC
GACTACGCCG TCATTCAGAA GCTCGCCGAC GAAGGGCAGA TGACGGTGAG ACTTGCCTAC
AACCTCTTCA CCCAGAAGCC GAAGGAGGAG AAGGAAGACT TTCTCAAGTG GACGTCCTCC
GTAAAATACA AGCAGGGCGA TGACTTCTTC CGTCACAACG GCGCTGGCGA AATGCTCGTC
TTCTCTGCGG CAGACTTCGA GGATTTCCGC GAACCGCGCC CGGAAATGGC GCCCGAAATG
GAAGGTGAGC TTGAGGAAGT CGTGCGAATT CTAGCGGAGA ACCGCTGGCC CTGGCGCCTG
CACGCGACAT ACGACGAGAC GATCAGCCGT GCGCTCGACG TCTTCGAGAA AGTCAACCAG
GACATCCCGC TTGCCGGCAT CAACTGGTTC TTCGACCATG CCGAGACGAT CTCGGACAGC
TCAATCGACC GCATCGCGGC GCTGGGCGGC GGCATAGCGG TACAGCATCG CATGGCCTAT
CAGGGCGAGT ATTTCGTCGA ACGCTATGGC CATGGTGCCG CCGAAGCTAC GCCGCCGGTC
GCGCGCATGC TGGATAAAGG CGTCAAGGTC TCCGCGGGCA CGGACGCCAC GCGCGTCGCC
TCCTACAATC CCTGGGTTTC GCTTTCCTGG CTGGTGACCG GTAAGACGGT CGGCGGTATG
CAGCTCTATC CGCGCGCCAA CTGCCTCGAC CGCGAAACGG CGCTGCGGAT GTGGACCGAG
AAAGTCCAAT GGTTTTCCAA CGAGGAGGGC CGGAAGGGCC GCATCGAAAA GGGGCAGCTC
GCCGACCTTA TCGTGCCGGC CAAAGACTAT TTCACCTGTG CCGAGGACGA GATCTCGTTT
CTGACTGCCG ATCTGACGAT GGTCGGGGGC AGGATCGTCT ATGCGGCAAA CGATTTCGCC
AGCCTCGACG AGAACCCTCT GCCGCCGGCG ATGCCCGACT GGTCGCCGGT AAGGAACTAT
GGCGGCTATG CGGCGTGGGG CGAACCGGAG GGCGCGGGAA GGCATTCGCT GAAGCGAACG
GCAATCGCGT CCTGCGGCTG CGCCAGCAAT TGCGGGGTCC ATGGGCACGA CCATGCCGGC
GCCTGGACAT CGAGACTGCC GGTTGCAGAC CTGAAAGGGT TCTTCGGCGC GCTTGGCTGC
TCTTGCTGGG CCGTGTGA
 
Protein sequence
MSADLILHRG LVTTLDAAKP NATAIAVKDG RFLAVGLDQE VMALAGPDTK VVDLKGKRVL 
PGLNDNHTHV VRGGLNFNME LRWDGVRSLA DAMNMLKRQV AITPPPQWVR VVGGFTEHQF
IEKRLPTIDE INAIAPDTPV FLLHLYDRAL LNGAALRAVG YGKDTPDPPG GEITRDARGN
PTGLLIAKPN AGILYSTLAK GPKLPLEYQV NSTRHFMREL NRLGITSVID AGGGFQNYPD
DYAVIQKLAD EGQMTVRLAY NLFTQKPKEE KEDFLKWTSS VKYKQGDDFF RHNGAGEMLV
FSAADFEDFR EPRPEMAPEM EGELEEVVRI LAENRWPWRL HATYDETISR ALDVFEKVNQ
DIPLAGINWF FDHAETISDS SIDRIAALGG GIAVQHRMAY QGEYFVERYG HGAAEATPPV
ARMLDKGVKV SAGTDATRVA SYNPWVSLSW LVTGKTVGGM QLYPRANCLD RETALRMWTE
KVQWFSNEEG RKGRIEKGQL ADLIVPAKDY FTCAEDEISF LTADLTMVGG RIVYAANDFA
SLDENPLPPA MPDWSPVRNY GGYAAWGEPE GAGRHSLKRT AIASCGCASN CGVHGHDHAG
AWTSRLPVAD LKGFFGALGC SCWAV