Gene Smed_4306 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4306 
Symbol 
ID5319310 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp801139 
End bp803187 
Gene Length2049 bp 
Protein Length682 aa 
Translation table11 
GC content63% 
IMG OID640776111 
Producthemolysin-type calcium-binding region 
Protein accessionYP_001313044 
Protein GI150376448 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.0651981 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGCAA TCCTGGGTTC GACCGGCTAC GATATACTTC AAGGTACGGA TGAGGCAGAC 
CATATCTGGG GACTGCACGG TGGCGATACG ATCGACGCGG GCGGCGGCGA CGATTTCGTC
GATGGCGGCG ATTTAAGCGA CGTGTTGACG AGTTCGAGCG GCTATGACCG GCTGCACGGC
GGCGCCGGCG ATGACCAGAT CGTCCTGACC GGTACCGGCG GGGCCGTCAC CGGAGGCCTC
GGCTTTGACA GGTTGACCGT GGATATCTCG ATGCTGTCCG ATCAGCTGGT CTTCAACGGT
GTAAGCGGCC ACGGCATCAT CGGATACGGG AGCGCCGAGG AACGGCACAT CTTTTTCCAT
GACGTCGAAT GGCTCAGCAT CGTTTCAGGA AGCGGCAACG ATCGCATCGT GGGGACGGCG
GGTCACGACA GGATCTCGAC CGGAGTCGGC ATCGACGTCG TCGTGGCCGG GGCGGGAAAC
GACGTCATCA CGAATACGGG CGGCCAGGAC CGGCTCGAGG GCGGCGACGG GAACGATCGC
TTCGTGCTCG TGGGCACCGG CAGCACCGTC TTCGGCGGTG CTGGAGACGA TACATTGACA
CTCGATCTGG CTGCGACCTT GGGCGATATC GTGTTCAACG CGAACACCGG CCACGCGATT
ATCGCAAGCC GAACGCCTGA CGAGCGCCAC GTCTTCGCCC AGCATAGTGA GATCGAGCGG
ATTGAAGTGA TCACTGGCAG CGGAAACGAC CGGGTCCAGG GGACCACCTC CGGGGACGTC
ATCAAGACGG ACGCCGGAGA CGACGTGGTC GATGGCGGAG CGGGCGACGA TTCGATCACC
GACGGGAGCG GCGCCAACCA CCTTTCAGGC GGCGACGGCT CCGACCTCAT TACGACCACG
CTTTATTCCG CTTCGGTCGA CGGGGGAGCG GGCAGGGATT CGGTGCGAAT CGAGGAACGC
TCACGTATGA GCGACGTCGC CATTGACTTC GTTTCGGGGA CGGCGTCGAC CCGCACGGTC
TTTCGCAACA TTGAAACGGC CAGTCTCGCA CTCGGAGCCG GCAACGACGC GGTGATCACG
ACGGATTTCC TTGCCATCTT CGTAGATGCC GGCGCGGGCG ACGACCGCCT CGTAGGCGGA
GCCCGGAGCG ATGCGTTCCA CGGCGAGGAC GTCAACGACC GCATCGATAG CGGCGCGGGC
AACGACACGA TCACGACCGG CCTTGGCGAT GACCTTGCTT CCGGCGGCGA CGGCAATGAC
TCGCTCGCCA ATGACGGCGG CAAGGATATC CTGGATGGCG GGGCGGGCGA CGACTGGATT
GCCGACGCCA TTCCCGGCAG CGGCAGCCTG GGCGATGGAT CGGTCATGCT CGGCGGCGAC
GGAAATGATG CATTCAGGGC TTCTTTCACG GGAGAGGTGG ATGGAGGGGA TGGGCGCGAC
TTGCTCCAGT TGCGCCTCGG ATCGCTCTCG GCTGCAATCG ACTTTGATGC GGCGCAGGGA
GCGATCCAGA CCGGATTGAC CTTCGCCAAT ATCGAGGACT TCACGGTAAC GACCGGAGTT
GCGAACGATG TGCTCCGGGG TGGAGACGGT AACGATGAGT TCGATTCTCT GAGCGGCAAC
GACCTTCTCG AAGGGCACGG CGGAAACGAC ATCCTGCGCG GATATTCGGA TTGGGACCGG
CTGTTCGGCG GCGATGGCGA CGACCGGCTC GAGGGCGGCC CGCATGACGA CCTTTTGAGC
GGCGGCAACG GCACCGATAC GCTGATCGGC GGCACCGGCG CGGACACGTT TCTCTGGTCG
GACGAGGTCG CCGGCCAGAG CGGCATCGAT CACATCGTAG ACTTGGACTT TGGCGACGTC
ATAGCCTTTT CCGCTGCCGC TTGTGAGAGG ACCGGCATTC ACGACCATGC GGGCTTCGTT
GCCGCTGCAA CCAACACGGC CGACGGTGTC TATGTCGCCT TCAACGGCTC CGCTGCCGAC
GGCATCCTGA TCGAAAATGC TTTGATTGCA ACTCTGACCG CGGACGACCT GGTTTTCGGT
TTCGCTTGA
 
Protein sequence
MAAILGSTGY DILQGTDEAD HIWGLHGGDT IDAGGGDDFV DGGDLSDVLT SSSGYDRLHG 
GAGDDQIVLT GTGGAVTGGL GFDRLTVDIS MLSDQLVFNG VSGHGIIGYG SAEERHIFFH
DVEWLSIVSG SGNDRIVGTA GHDRISTGVG IDVVVAGAGN DVITNTGGQD RLEGGDGNDR
FVLVGTGSTV FGGAGDDTLT LDLAATLGDI VFNANTGHAI IASRTPDERH VFAQHSEIER
IEVITGSGND RVQGTTSGDV IKTDAGDDVV DGGAGDDSIT DGSGANHLSG GDGSDLITTT
LYSASVDGGA GRDSVRIEER SRMSDVAIDF VSGTASTRTV FRNIETASLA LGAGNDAVIT
TDFLAIFVDA GAGDDRLVGG ARSDAFHGED VNDRIDSGAG NDTITTGLGD DLASGGDGND
SLANDGGKDI LDGGAGDDWI ADAIPGSGSL GDGSVMLGGD GNDAFRASFT GEVDGGDGRD
LLQLRLGSLS AAIDFDAAQG AIQTGLTFAN IEDFTVTTGV ANDVLRGGDG NDEFDSLSGN
DLLEGHGGND ILRGYSDWDR LFGGDGDDRL EGGPHDDLLS GGNGTDTLIG GTGADTFLWS
DEVAGQSGID HIVDLDFGDV IAFSAAACER TGIHDHAGFV AAATNTADGV YVAFNGSAAD
GILIENALIA TLTADDLVFG FA