Gene Smed_6129 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_6129 
Symbol 
ID5320431 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp1053805 
End bp1055415 
Gene Length1611 bp 
Protein Length536 aa 
Translation table11 
GC content60% 
IMG OID640777763 
Productmonophenol monooxygenase 
Protein accessionYP_001314695 
Protein GI150378100 
COG category 
COG ID 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.174842 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.452911 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACACGCA CGCGATTTCT GACGGTTTCA CGCCGCTCTT TCGTCAAAGG CGCAACCGCT 
GCCGCCGGCA CAGCCCTTTT CGCACCAAGC ATTCTGCGGG CGGCGACCAA GCATAGGCGC
AAGAACATGA CCAGCGCCGA TGGGCAGAAG GACTTGCAAA GTTATATGGA TGCCGTGACC
GCGATGCTGA AGCTTCCGCC TTCTGATCCG CGCAACTGGT ACCGCAATGG CTTCATCCAC
CTGATGGACT GCCCCCATGG CGACTGGTGG TTCACCAGCT GGCACCGCGG CTATCTTGGC
TATTTCGAAG AGACTTGCCG CGAACTGAGC GGCAATCCGG ATTTCGCCCT CCCATATTGG
GACTGGACGG CCAATCCCGA GGTTCTGCCG CCGCTGTTCG GCACGATTCT CGATCCCGTC
AACAGCTCCG CCTACATTCC CGACCACAAC CGCTTCCAGG ACATCATGCA GGAGCCGATC
AAGGCCTGTT GGGACAGTCT CAGCTCCGCC CAATTGCAGC AGCAGAACCT GCGCGGCTAT
CTGGATTTTG ATGCGCTATG GAGCGACGCA ATGGCGAGCT TCGCCAACCA GCCGAACGCC
CGCTTCCTGA CGGCGCAGAA TCCGAAACTC AATCCCGCCA CCCAAACCGC AGTCGACATC
GACACCATCA AGGCATCGCT GGCGCCAACA ACCTTCGCCA ACGACGCGGG CGCTCCGGGT
CTCGCTTTCA ACAGTCCGGT ATCGTCCAGC CACCAGGTGG CACCGGTCGG CTTCTCCATT
CTTGAAGGCC AGCCGCATAA CCGCGTCCAT ATGAGCGTCG GCGGCCAGAG CGCTCCCTAT
GGGCTGATGT CACAGAACCT GTCACCGCTC GACCCGATCT TCTTTCTGCA TCATTGCAAC
ATCGATCGGC TGTGGGATGT CTGGACCCGC AAGCAGCAGG CGATGGGCCT GCCCGTCGGG
CCAACGGCTG ACCAACAGAC GCAGTACGAT CCGGAACCCT ATCTCTTTTA TGTCAACGCT
GACGGTAGCC CGGTCAGCGA TAAGACCAGG GCCGCCGACT ATCTCGCAAT CGGCGCCTTT
GACTATGATT ATGAGCCCGG CAGCGGCGAC GAGGTGATCC CGGTTGCAAC CGCCGGCCGC
TCGGCCCCCA TTCCGGCATT GGAAGCAGCC GTGCCTGCGT CCGCGGCCGT GGCCATAAAC
AAACCGGCGA CTGCCAAGCT CACCGTTTCG CAGGAGCTCG TGGATGTTGC CGCGAAGCCT
TCGGAACAGT CGCGTCAATT CGCTAAGGTC AGCATCGCCC CGCCCATGGA CGTTGGCGGC
CTGAATTTCC TCGTTTTCAT TTCCCCCGAG GGAACGACGC CTGATCTCAA CCCGGACGGA
CCGGATTTCG CCGGCAGTTT CGAGTTCTTC GGTGTTCGTC ATCATCATAC CGACACGGTC
AGCTTCACCA TACCGATCGA CAAGGCGCTC GATAGGCTGA TCGACGATGG TAGGCTCAAA
GCGGGCGAAC CGATCGACTT CAACGTTGTG GTAGCGCAAG CGGGCAAACG TATCGAAGGC
AGCATGCCGG CGGAGGCAAG GCTGACCGAC ATTCAGGTTG GGTCGTTCTG A
 
Protein sequence
MTRTRFLTVS RRSFVKGATA AAGTALFAPS ILRAATKHRR KNMTSADGQK DLQSYMDAVT 
AMLKLPPSDP RNWYRNGFIH LMDCPHGDWW FTSWHRGYLG YFEETCRELS GNPDFALPYW
DWTANPEVLP PLFGTILDPV NSSAYIPDHN RFQDIMQEPI KACWDSLSSA QLQQQNLRGY
LDFDALWSDA MASFANQPNA RFLTAQNPKL NPATQTAVDI DTIKASLAPT TFANDAGAPG
LAFNSPVSSS HQVAPVGFSI LEGQPHNRVH MSVGGQSAPY GLMSQNLSPL DPIFFLHHCN
IDRLWDVWTR KQQAMGLPVG PTADQQTQYD PEPYLFYVNA DGSPVSDKTR AADYLAIGAF
DYDYEPGSGD EVIPVATAGR SAPIPALEAA VPASAAVAIN KPATAKLTVS QELVDVAAKP
SEQSRQFAKV SIAPPMDVGG LNFLVFISPE GTTPDLNPDG PDFAGSFEFF GVRHHHTDTV
SFTIPIDKAL DRLIDDGRLK AGEPIDFNVV VAQAGKRIEG SMPAEARLTD IQVGSF