Gene Smed_4058 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4058 
Symbol 
ID5318881 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp517850 
End bp521068 
Gene Length3219 bp 
Protein Length1072 aa 
Translation table11 
GC content63% 
IMG OID640775865 
Producthemolysin-type calcium-binding region 
Protein accessionYP_001312798 
Protein GI150376202 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.402749 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGTCA TCAACGGCAC TTCTGGAAAC AATATTCTCA TCGGCACGGG TTCGGACGAC 
AGCCTCAATG GTTTAGCCGG CGACGACCTG CTTCAGGGAC TTGGAGGCGC AGACATCATC
AATGGCGGCG CGGGTGTGGA CACGGCCGAC TATCGCGAGA AGGCGGCCCC AATCGTAGTC
ACATTGACCG GTGCGACGGC GGCGACCGTC TTTATCAATG GCGTCGCGGA GGACACGCTT
GCCAACGTAG AGAACGTTTA CGGGGGCTCC GGCGACGACT TTATCACCGG CGACGCGCTG
AACAATCTCT TCCGGGGAGG CGGCGGAAAC GACGTGCTCG ACGGCGGCGG CGGGAATGAT
ACGGCCGACT ATACCGACAA GACAGCCTCC GTCGTGGTCA CGCTGGCCGG GGCGACACCT
GTGACAGTTT TCGTGAACGG CATCGCCGAG GACATGATCA GCAACTTCGA AAACGTCTAT
GGCGGTTCGG GCGACGACAT CCTAACGGGC GATGATCGGG ACAACATTCT TCGCGGTGAG
GTAGGAAACG ACATTCTCAA TGGCGGCATT GGCGCCGATT CCCTGTTCGG TGGCGAGGGC
AACGATACCG TCGATGGCGG CAACGGGGTC GACACCTTCG ACCTGCGCGA AAAGACATCC
TCCGTCCTCG TGCAGCTCAA TGGTGCGAAT GGAGCTACAG TCTTCGTCGG CGGCGTTGCG
GAGGATACGA TCCGCAACGT CGAGAACATC GTAGGTGGAT CGGCAGGCGA TACGCTTACC
GGCGATGCCG CCGCCAACAA GCTCTCGGGT GCGCGCGGCA ACGACTGGCT CATGGGCGGC
AGCGGTGCCG ACATACTGGA CGGCGGCGAG GACAGCGATA CGGCGGACTA CAGCGACAAG
CAGGCTGCCA TCGTCGTGGC GCTGAATGGC GGCAATCCCG TCACAGTGAC AGTCGGCGGC
CTTGCCGAGG ACTCGATCGC CAAGATCGAA AACATCGTCG GGGGTTCGGG AGACGATACG
CTCAGCGGCG ATGCCGCTGC CAACACGTTT CGCGGTGGTC TTGGCGCCGA CGCGCTTGAC
GGCGGCTCCG GCAGCGATAC CGCGGACTTC AGCGACAAGG CGCAATCCGT CGTGCTTGCC
CTCAACGGAG CGATCGATGC CGTTGCGACC GTCGGCGGAA CGGCGGAGGA CTCGGTCCGA
AACATCGAAA ACATCATCGG CGGTGCGGGA AACGATCAGC TCACGGGCGA TGCCGCCGCC
AATATGTTCC GTGGCGGCCT CGGCGCCGAT GTGCTGGATG GCGCTGCCGG CAGCGACACG
GCGGACTTCA GCGACAAGAC GTCATCGCTT GTCGTAACTC TGGCCGGCGC GAGCCCGGCA
ACGGTCCTCG TCGGCGGCAT CGCGGAGGAT ACGCTGCGCA ACATCGAGAA CATCATCGGC
GGTTCGGGCA ACGACGTGTT CGTTGGCGAC GGCTTGCAGA ACGTGTTCGA TGGCGGGGCG
GGTACCGACA CGGCCGACTA TTCGGCCTCG GCGAAGGCGA TCGCTGTAAC GCTCAACGGC
GCCATTGACG CGAGGGTGTT CATAGGCAAC GCGGCCGAGG ACACGTTGCG CAACGTCGAA
AGCATCACTG GAAGCGCCCT GGCGGATGTG ATCACCGGCG ATGCGCTGGC AAACAGCCTC
CTCGGAGGCG GCGGCGCCGA CCTGCTAAAG GGGGGCGGCG GCCAGGACGT TGTGGATGGC
GGCAGCGGAT CAGACACCAT AGACTTCGGC GACAAGACCG CTGCCGTCGT CCTGACCCTT
GCGGGTGTGG CTAATACGAC AGCCACTGTC GGCGGGGTGG CCGAGGACAC GGTCCGCAAT
ATCGAGAACA TCTTCGGAGG CGCCGGCGCC GACGTGCTGA CAGGCGACGG CAACAGCAAC
ACAATCCGCG GCGGGGCCGG GGCAGACGGC CTGGACGGCG GCGCGGGGGT CGATACGGCC
GATTATCGGG ACAAGGTGAC GTCGGTTGTC GTCACGCTCA GTGGCGCGAA TGCCGCGCTT
GTCAAGGTGG GCGGCCTTAA CGAAGATACG ATCCGCAACT TCGAAAACGT TGCAGGCGGT
TCGGCCGGCG ACACGCTTGT CGGCGACGAC CTGGCCAATG TGCTGCTTGG CAATGACGGT
GCCGATACGC TGAAAGGCGG CTTGGGCAGT GACGTGCTGG ATGGCGGCAA TGGCATCGAC
ACTGCGGATT ACCTCGAGAA AGCCGACGCG ATCTCCGTCA CGCTCAACGG GACCACGAAC
GCGACGGTCT TGGTCGGCGG AGTTGCGGAA GACGTCATTC GCGGTGTAGA AAACATATTG
GCGGGGTCCG GTGCCGACAC ACTTGTCGGC GATGGCGCGA GCAACACGTT TCGTGGAGCG
CTCGGCGCTG ACTTCATCGA TGGCGGGGCG GGGTCCGATA CGGCCGATTA TCGCGAGAAG
GCGGCGGCTG TCGATGTGAC CCTCTTCGGC GCCGGTGACA GCTTCGTCTT CGTCGGCGGG
GTCGCGGAAG ACACAATCCG AAACATTGAA AACGTATTCG GCGGCAAGGG CAACGATACG
CTGACGGGCG ACGACTCCGC CAACACCCTC AATGGCAATG ACGGCAAGGA TTTGCTTACC
GGCGGCGGCG GAGCGGACAT TCTTGACGGT GGGGCGGCCT CCGACACGGC GAACTACCGC
GACAAGAGTG CATCGGTTTC CGTCACCCTC AACGGTGCCG CCTCCACGGC TGTCATGGTC
GGCGGGGTGG CCGAGGATAC GGTCCGCAAT ATCGAGAACG TCTGGGGCGG GACCGGCAAT
GACAGCCTGA GCGGCGACGG CAATGCAAAT CTGCTGTCGG GCGGCGGCGG AAGCGACATG
CTGTCCGGCG GCGCGGGGGC GGATATTTTC CAGTTCGACT TCGCTTTGGG AGCGAGCAAT
GTCGACATGG TTCTGGATTT TACCGCCGGA GACCGGCTCT TTCTGTCGAA GAGCGTTTTC
ACCAGCCTGA GCGGCGGTTC GCTGGCAAGC AACCAGTTCT ACGCGGCGGC CGGCGCAACG
GAGGCGCAAG GCGTGAACCA AAGAGTCGTT TACGATACGA CGACGGGCGC GCTCTACTAT
GATGCTGACG GCAACCTTTC GGGGCATACG GCCGTGCAGT TTGCAGTCCT CTCTACACAG
CCGGGACTGA CTGCAGGAGA TTTCGTGCTC GTCGTGTGA
 
Protein sequence
MAVINGTSGN NILIGTGSDD SLNGLAGDDL LQGLGGADII NGGAGVDTAD YREKAAPIVV 
TLTGATAATV FINGVAEDTL ANVENVYGGS GDDFITGDAL NNLFRGGGGN DVLDGGGGND
TADYTDKTAS VVVTLAGATP VTVFVNGIAE DMISNFENVY GGSGDDILTG DDRDNILRGE
VGNDILNGGI GADSLFGGEG NDTVDGGNGV DTFDLREKTS SVLVQLNGAN GATVFVGGVA
EDTIRNVENI VGGSAGDTLT GDAAANKLSG ARGNDWLMGG SGADILDGGE DSDTADYSDK
QAAIVVALNG GNPVTVTVGG LAEDSIAKIE NIVGGSGDDT LSGDAAANTF RGGLGADALD
GGSGSDTADF SDKAQSVVLA LNGAIDAVAT VGGTAEDSVR NIENIIGGAG NDQLTGDAAA
NMFRGGLGAD VLDGAAGSDT ADFSDKTSSL VVTLAGASPA TVLVGGIAED TLRNIENIIG
GSGNDVFVGD GLQNVFDGGA GTDTADYSAS AKAIAVTLNG AIDARVFIGN AAEDTLRNVE
SITGSALADV ITGDALANSL LGGGGADLLK GGGGQDVVDG GSGSDTIDFG DKTAAVVLTL
AGVANTTATV GGVAEDTVRN IENIFGGAGA DVLTGDGNSN TIRGGAGADG LDGGAGVDTA
DYRDKVTSVV VTLSGANAAL VKVGGLNEDT IRNFENVAGG SAGDTLVGDD LANVLLGNDG
ADTLKGGLGS DVLDGGNGID TADYLEKADA ISVTLNGTTN ATVLVGGVAE DVIRGVENIL
AGSGADTLVG DGASNTFRGA LGADFIDGGA GSDTADYREK AAAVDVTLFG AGDSFVFVGG
VAEDTIRNIE NVFGGKGNDT LTGDDSANTL NGNDGKDLLT GGGGADILDG GAASDTANYR
DKSASVSVTL NGAASTAVMV GGVAEDTVRN IENVWGGTGN DSLSGDGNAN LLSGGGGSDM
LSGGAGADIF QFDFALGASN VDMVLDFTAG DRLFLSKSVF TSLSGGSLAS NQFYAAAGAT
EAQGVNQRVV YDTTTGALYY DADGNLSGHT AVQFAVLSTQ PGLTAGDFVL VV