Gene Smed_5278 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5278 
Symbol 
ID5319580 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp238511 
End bp241849 
Gene Length3339 bp 
Protein Length1112 aa 
Translation table11 
GC content61% 
IMG OID640777055 
Productpeptidase domain-containing protein 
Protein accessionYP_001313987 
Protein GI150377392 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2931] RTX toxins and related Ca2+-binding proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGAACC CTAGCCAGAT CAGCGACAAT TGGTGGATCA ACGATCAGAC GCCGGATAAT 
CCTTATGGAA AGAGCGAGGA TTACGAGTCC TTTCTCGACT ATCTTGGCGC ATTACCCCGC
AGCCTGACGC CGGAAATGGC TGCGGGAACG CTGAGGATCA ACGTTTACGA GCTCAACGAT
CACCCGGAAT ACAAGGCGGC TGCGATCGGC GCCCTGCAGA TGTGGGCGTC TGTCACCCCC
CTGAAATTCG AGGTCGTCGA CGACACGCCG TTCGACAGCT CAACGGACTG GATGGAGGTC
GTCAGCCCCG AACTCGGTGA AGAGGACGAC GGCAGCGCCT ATTCGAGCAA TCGTTATGTC
AGCATCGGCC AGCGTTTCCA CGACACCGAA CCCAACAAGA CGGATATCGG CGGTTATGTC
TTCGATTCCT TCATCCATGA ATTCGGCCAT GAATTCGGGC TGAACCATCC GGGGCTCTAC
AATTACAGCG GTCCCGGCGG CGTGCAGATC AACTATCTGA ACAATGCGAC CTGGACCTAT
GATCGCCAGC AATACAGCGT GATGTCCTAT TTCGACGGAA TCGATGTCGG GGAAACCAGC
CGCTGGTCTG CCTCCACGCC GCTCATGGCC GACATCGAGG CCGTCATCCG CCGCTTTTTC
TCGACCGTCG ACGAGAATGG AGTGCGGACC TATCAGCATA TCGAGCTCAA CACGGGAGAC
AATGTCTACG GCTTCGGGAG CACGCAGTAC GGCTACCAGC TGACCTCCTC GGGCATGCAG
CATGACATCG GCTTTGCGAT CCATGACACG GGTGGGGCCG ACACGATCGA CTTCTCCGGC
TCGACTGCGG GTACCATTCT CGACTTGCGT GCCGGACAGT TCTCCAGCGT CAATGGGCAC
AGCAACAACG TGTCGATCTT CGCCGGACAC AATGCGGATG AGACCGACTA TTACATTGAT
AACGGCATCG GCAGCAGGTT TGACGACATC CTGATCGGCA ATGACGGCGC CAACACGCTG
GATGGCCGCG GCGGCGGCGA CCGCATGGCA GGCAACGGCG GTGACGACAC ATATTTCGTC
GATTCAATTG ACGATATCGT TCGTGAGGAG GCCAACGGCG GCAACGACAC GGTCATCCTC
CTGTCCAGGC ACCTGAAAAT CAGGAAAATT GCCAACGTCG AAAACATCAT CTATGCCGAC
GAGACGACGA CGCAGCCTGG CGGTGGCGCC GAGACCCCGC CCAGCGCCGG CGATAACACG
ATGACCAGCA TCATCTTCGG CGGTAGCGGA AATGACATTC TCGACGGCGG CGCCGGTGAC
GACACGATCT TTGGCCAGGG AGGCGACGAC CTGATCATAG GCGGCCGCGA CTCGCTCGCC
AGCCGCGACA TCAACAATAC GATCGACGTC GAAGACCTCG AAGACCAGAC CGAGAGCGAC
GACGGGAACG ATACGCTCTA CGGCGGCGGT GGCAACGACA CCATCCTTGG CGGGCAGGGC
AACGACATTC TCGACGGCGG CGCCGGAGAC GATACGCTGA GCGGTCAGGA CGGCATAGAT
ATATTCAGGG GTGGCGAGGG CACCGATACC GTCGATTTCA GCAAGGAGAG CCCGTTCCAG
CTGCTCGTCA ACCTTGCCAC GAATGTCGCC AGCGGCGGAA CCGCCTCGGG CGACACTTTC
TACAGCATCG AGAACCTGAT CGGATCCGAC GATCGCATCG ACCGGTTCAT CGGCACATCC
GCCGCAAACC ACTTCTGGGG GCAGGGCGGC GGCGATTATT TCAATGGCGG CGGCGGCAAC
GACACCCTGG ACGGCGGCAA TGACGGCGAC ATCCTTTATG GCGAAGCCGG CAACGACACG
ATCATCGGTG GCGCCGGCCA AGACTATCTG GACGGTGGTT CCGGTGTCGA CACGGTCGTC
TACGCGGGCA GCCCCGATGG TGTGACGATC GATCTTGCCG AGGGCACGGC CTGCGGTGGC
GACGGTGACG GTCCGGTGCA AATCGTCGGT CGGGGCGCGG TTATCCGGCA CGATATTCTC
GCCGGCTTTG AAAATGCCGT CGGTTCGTCA TTTGACGACC ATCTCATCGG CAATGCCCTG
GCCAACGAGC TTTCCGGCGG CGCCGGCGAC GACAGATTGA CCGGTGGAGG CGGCGCCGAC
AGATTGAACG GCGGCGCCGG CAGCGATACG GCAGACTACG CCGACGCGAC GAGCGGCGTC
AGACTGAGCC TCGCTGGACG CAAGTCCGGC GGAGATACAT ATGTCTCCAT CGAAAATCTT
GCCGGTTCGG GATTCAATGA CCGGCTGACC GGCGACGGTG CCGCCAACGT GTTGACCGGC
CAGGGCGGAA ACGACACGAT CGACGGCGGC CGCGGCGATG ACACGCTGCT CGGCGATTTC
GCCTATCAGG GCGACGCGCC CCCACGCCCG GGAATGGGAA CCGGCTACGC CACGCTGGGA
CCGGACGCGA CCAACAATTC GATAGCGGCC GCGTTCGACA TCTCCGACAA CTTCTCCTTG
ACCAGCGATC CCGATATCTT CGATTCGACC ACGACCCTTC ATACGACCGT CAACGCCACC
GGAAATGGCC AAGGCGGATA CTATAAAATC GATCTGGCGG CCGGTACGGT CATTACGATC
GACATTGACG GCATTGCCGA TCCTGATGTC CATGACAGTT GGGTCAGGCT GCTCGACAGC
GCCGGCAACA TCGTTGCGCA GAACGATGAC GGCGGCGGCG ACCCCGGTTC TACCAGCAAC
CGGGATTCAA GCCTCGTATT CGTGGCTCAG GAGACCGGAA CCTATTACAT ATTGGAAGGC
AGCTGGTCAC CGACGGCGCC GGGCGATGGC TGGGCGGAGG CCGTGCCGGC AGGCTCGACA
TATGAGTTGA ACGTATCGGT CGAGTTTCCC CCTGCGCCGG CCCAGCCGGG CGTTTCCGGC
TCTGACACTC TCATTGGCGG CCGAGGCAGC GATCTGCTGG ATGGCGGCCT GGGGGCGGAC
ACGCTTATCG GCGGAGCGGG GGAAGATACG TTCCGCTTCT CGACGGCGCT TGGGAACGGC
AATGTCGACG GTATCAGGGA GTTCGATGTC ATCGACGATA CGATCCTGCT GGACAACCTC
ATCTTCGAGA GCGTGGGTGG CGACGGCGCT CTCGCCTTTG GCGCATTTTT CGGGAGCGCG
GGCGGCCTCG CCCATGATGC CGACGACCGC ATCATCTACG ATACCGATAG CGGCGTCCTG
TCCTATGACG CCGATGGCGC CGGCGAGATC GCGGCAATTC AGTATGCGCA GCTGAGCACA
AATCTGAAGC TATCGGCGGC CGACTTCATT ATTATTTAG
 
Protein sequence
MPNPSQISDN WWINDQTPDN PYGKSEDYES FLDYLGALPR SLTPEMAAGT LRINVYELND 
HPEYKAAAIG ALQMWASVTP LKFEVVDDTP FDSSTDWMEV VSPELGEEDD GSAYSSNRYV
SIGQRFHDTE PNKTDIGGYV FDSFIHEFGH EFGLNHPGLY NYSGPGGVQI NYLNNATWTY
DRQQYSVMSY FDGIDVGETS RWSASTPLMA DIEAVIRRFF STVDENGVRT YQHIELNTGD
NVYGFGSTQY GYQLTSSGMQ HDIGFAIHDT GGADTIDFSG STAGTILDLR AGQFSSVNGH
SNNVSIFAGH NADETDYYID NGIGSRFDDI LIGNDGANTL DGRGGGDRMA GNGGDDTYFV
DSIDDIVREE ANGGNDTVIL LSRHLKIRKI ANVENIIYAD ETTTQPGGGA ETPPSAGDNT
MTSIIFGGSG NDILDGGAGD DTIFGQGGDD LIIGGRDSLA SRDINNTIDV EDLEDQTESD
DGNDTLYGGG GNDTILGGQG NDILDGGAGD DTLSGQDGID IFRGGEGTDT VDFSKESPFQ
LLVNLATNVA SGGTASGDTF YSIENLIGSD DRIDRFIGTS AANHFWGQGG GDYFNGGGGN
DTLDGGNDGD ILYGEAGNDT IIGGAGQDYL DGGSGVDTVV YAGSPDGVTI DLAEGTACGG
DGDGPVQIVG RGAVIRHDIL AGFENAVGSS FDDHLIGNAL ANELSGGAGD DRLTGGGGAD
RLNGGAGSDT ADYADATSGV RLSLAGRKSG GDTYVSIENL AGSGFNDRLT GDGAANVLTG
QGGNDTIDGG RGDDTLLGDF AYQGDAPPRP GMGTGYATLG PDATNNSIAA AFDISDNFSL
TSDPDIFDST TTLHTTVNAT GNGQGGYYKI DLAAGTVITI DIDGIADPDV HDSWVRLLDS
AGNIVAQNDD GGGDPGSTSN RDSSLVFVAQ ETGTYYILEG SWSPTAPGDG WAEAVPAGST
YELNVSVEFP PAPAQPGVSG SDTLIGGRGS DLLDGGLGAD TLIGGAGEDT FRFSTALGNG
NVDGIREFDV IDDTILLDNL IFESVGGDGA LAFGAFFGSA GGLAHDADDR IIYDTDSGVL
SYDADGAGEI AAIQYAQLST NLKLSAADFI II