Gene Smed_6429 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_6429 
Symbol 
ID5320732 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009622 
Strand
Start bp84145 
End bp85755 
Gene Length1611 bp 
Protein Length536 aa 
Translation table11 
GC content60% 
IMG OID640778005 
Productmonophenol monooxygenase 
Protein accessionYP_001314937 
Protein GI150378343 
COG category 
COG ID 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACACGCA CGCGATTTCT AACCGTTTCA CGCCGCTCTT TCGTCAAAGG CGCAACCGCT 
GCCGCCGGTA CAGCCCTTTT CGCACCAAGC ATTCTGCGGG CGGCGACCAA GCATAGGCGC
AAGAACATGA CCAGCGCCGA TGGGCAGAAG GACTTGCAAA GTTATATGGA TGCCGTGACC
GCGATGCTGA AGCTTCCGCC TTCGGATCCG CGCAACTGGT ACCGCAATGG CTTCATCCAC
CTGATGGACT GCCCCCATGG CGACTGGTGG TTCACCAGCT GGCACCGCGG CTATCTTGGC
TATTTCGAAG AGACTTGCCG CGAACTGAGC GGCAATCCGG ATTTCGCCCT CCCATATTGG
GACTGGACGG CCAATCCCGA GGTTCTGCCG CCGCTGTTCG GCACGATTCT CGATCCCGTC
AACAGCTCCG CCTACATTCC CGACCACAAC CGCTTCCAGG ACATCATGCA GGAGCCGATC
AAGGCCTGTT GGGACAGTCT CAGCTCCGCC CAATTGCAGC AGCAGAACCT GCGCGGCTAT
CCGGATTTTG ATGCGCTATG GAGCGACGCA ATGGCGAGCT TCGCCAACCA GCCGAACGCC
CGCTTCCTGA CGGCGCAGAA TCCGAAACTC AATCCCGCCA CCCAAACCGC AGTCGACATC
GACACCATCA AGGCATCGCT GGCGCCAACA ACCTTCGCCA ACGACGCGGG CGCTCCGGGT
CTCGCTTTCA ACAGTCCGGT ATCGTCCAGC CACCAGGTGG CACCGGTCGG CTTCTCCATT
CTTGAAGGCC AGCCGCATAA CCGCGTCCAT ATGAGCGTCG GCGGCCAGAG CGCTCCCTAT
GGGCTGATGT CACAGAACCT GTCACCGCTC GACCCGATCT TCTTTCTGCA TCATTGCAAC
ATCGATCGGC TGTGGGATGT CTGGACCCGC AAGCAGCAGG CGATGGGCCT GCCCGTCGGG
CCAACGGCTG ACCAACAGAC GCAGTACGAT CCGGAACCCT ATCTCTTTTA TGTCAACGCT
GACGGTAGCC CGGTCAGCGA TAAGACCAGG GCCGCCGACT ATCTCGCAAT CGGCGCCTTT
GACTATGATT ATGAGCCCGG CAGCGGCGAC GAGGTGATCC CGGTTGCAAC CGCCGGCCGC
TCGGCCCCCA TTCCGGCATT GGAAGCAGCC GTGCCTGCGT CCGCGGCCGT GGCCATAAAC
AAACCGGCGA CTGCCAAGCT CACCGTTTCG CAGGAGCTCG TGGATGTTGC CGCGAAGCCT
TCGGAACAGT CGCGTCAATT CGCTAAGGTC AGCATCGCCC CGCCCATGGA CGTTGGCGGC
CTGAATTTCC TCGTTTTCAT TTCCCCCGAG GGAACGACGC CTGATCTCAA CCCGGACGGA
CCGGATTTCG CCGGCAGTTT CGAGTTCTTC GGTGTTCGTC ATCATCATAC CGACACGGTC
AGCTTCACCA TACCGATCGA CAAGGCGCTC GATAGGCTGA TCGACGATGG TAGGCTCAAA
GCGGGCGAAC CGATCGACTT CAACGTTGTG GTCGCGCAAG CGGGCAAACG TATCGAAGGC
AGCATGCCGG CGAAGGCACA GCTGACCGAC ATTCAGGTTG GGTCGTTCTG A
 
Protein sequence
MTRTRFLTVS RRSFVKGATA AAGTALFAPS ILRAATKHRR KNMTSADGQK DLQSYMDAVT 
AMLKLPPSDP RNWYRNGFIH LMDCPHGDWW FTSWHRGYLG YFEETCRELS GNPDFALPYW
DWTANPEVLP PLFGTILDPV NSSAYIPDHN RFQDIMQEPI KACWDSLSSA QLQQQNLRGY
PDFDALWSDA MASFANQPNA RFLTAQNPKL NPATQTAVDI DTIKASLAPT TFANDAGAPG
LAFNSPVSSS HQVAPVGFSI LEGQPHNRVH MSVGGQSAPY GLMSQNLSPL DPIFFLHHCN
IDRLWDVWTR KQQAMGLPVG PTADQQTQYD PEPYLFYVNA DGSPVSDKTR AADYLAIGAF
DYDYEPGSGD EVIPVATAGR SAPIPALEAA VPASAAVAIN KPATAKLTVS QELVDVAAKP
SEQSRQFAKV SIAPPMDVGG LNFLVFISPE GTTPDLNPDG PDFAGSFEFF GVRHHHTDTV
SFTIPIDKAL DRLIDDGRLK AGEPIDFNVV VAQAGKRIEG SMPAKAQLTD IQVGSF