Gene Smed_5000 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5000 
Symbol 
ID5318649 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp1515366 
End bp1517090 
Gene Length1725 bp 
Protein Length574 aa 
Translation table11 
GC content55% 
IMG OID640776782 
Producthemolysin-type calcium-binding region 
Protein accessionYP_001313714 
Protein GI150377118 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2931] RTX toxins and related Ca2+-binding proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.00410599 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCAGCTT ATAGTTGCTC GAAAACGCAT GCATTTTTTA TTCATTTTCA CCATTTTTTT 
TGGCCGGAAA CGTCCCGGCT CCCGGAGGAA ATCATGGCTA CAATTACGTA TCATTTCCCA
GCAGGACTTT ACCCAAATCA ATACAACCCA CCGTTGAGTG GCGTCAGCGT TAACCCGTCC
TTTGGCGAGC TCGTGGACAT GAGCACGGGG AGTCGCTCAA CCACTACCAG CACTTCGGTG
CTTTATCGGC TCGACAACGG TCTCAAGATG AAGCTCGTGG GCACGGGGTT CAGTTTTGAC
GCGAGTGGCG ATGCCGTCGG AGGAACAATC ACATCGATTG AGGTTCTTCT GAACAACGGC
ACGGGATTGA TACAAACAAT TAGCGGGCTG AATATTTCGC TTGAACTTTT CCAAGACGCA
TCGGCCGCGT TCGATAGTTC TGGACTCGAG AGTTGGCTCG CGAGCGGCAA CGATACGATC
AACGGCTCCC TCGGTAATGA TGAAATCTGG GGGCGCCTTG GAAACGATGT CCTCAACGGC
AATGCGGGTA ATGACCTTGT CACAGGCGGC GGCGGTGCAG ACACCTACGA TGGAGGAGCC
GGTTTTGATA TTCTGAACTT CCAGGACGCT TATGACTCCC CGACAGCGAT AAGAGGCATC
AACCTCAACG CCACGGCCGG GACCGTCGTT GATCAGTTCG GTTTTTCGGA AACCTTCCAG
AATTTCGAGG AATTCAGAGG AACCCAGTTC TCGGACTCGA TGATGGGGTC CTCGGTCGAT
GAGACGTTTT TTGGGTTTGG CGGACGCGAC AATATCAATG GTGGTGCCGG CATCGACACC
GTTCGATACG ACCGCGATTT CCAGCGAGGC GCCACAAAGG GTGTGAGCAT CGATCTCAGT
ACCGGTATTG CGACCGACGG CTTTGGATCT CGAGATACTC TCACCAGCAT CGAGAATATC
CGAGCAACTG AATTCAAGGA TACAATCGTC GGCAGTTCGG CAGCCAACTT CCTCCGCACG
TTCGCGGGCA ATGATTCAAT CAATGGCGGT GGCGGCTCAG ACAACATGCG CGGCGGCATA
GGAAACGATA CGTATTTCGT AGACAACACC GGCGATATCG TTGACGAAGC CGCCGACTCG
GGCGCGGGTA CGGACACCGT TCAATCCACA GTCTCGTTCA ATCTCGGCAA CACCGCGGTC
GCCAAAGGTG GCGTGGAGAA CCTTGTGCTC CTTGGCACCG GGAACATAAA CGGCACCGGC
AATGCCTTGA ACAACACCCT GAGTGGCAAT ACCGGCAACA ACACTTTCAG CGGCTTTGCC
GGCAACGATA CAATTGACGG GGGCCTCGGC AACGACCTGA TCAATGGGGG CCTCGGCAAT
GACAGCCTGA CCGGCGGAGC CGGCTTAGAC ACATTTAATT TCAGCAACGC TCTGGATGCC
ACGAACAACG TCGACACGAT CAACGGGTTC GTTGTCGCGG ATGATACGAT CCGATTGGAA
AATGCGGTCT TTACGGGGAT CGTCGGTACC GGCACGTTGA CTGCGGCGCA GTTCGTAACG
AATACCACTG GCCTAGCTGC TGACGCCGAC GACCGAATTA TTTATGATAG CGACACCGGA
AGGCTGCTCT ACGACAGCGA TGGCGACGGA GCGGGCGGTT CTGTTCATTT CGCCACGGTC
GGCACGAACC TCGGCATGAC TGCGAGCGAT TTCTTTGTTG TGTAG
 
Protein sequence
MAAYSCSKTH AFFIHFHHFF WPETSRLPEE IMATITYHFP AGLYPNQYNP PLSGVSVNPS 
FGELVDMSTG SRSTTTSTSV LYRLDNGLKM KLVGTGFSFD ASGDAVGGTI TSIEVLLNNG
TGLIQTISGL NISLELFQDA SAAFDSSGLE SWLASGNDTI NGSLGNDEIW GRLGNDVLNG
NAGNDLVTGG GGADTYDGGA GFDILNFQDA YDSPTAIRGI NLNATAGTVV DQFGFSETFQ
NFEEFRGTQF SDSMMGSSVD ETFFGFGGRD NINGGAGIDT VRYDRDFQRG ATKGVSIDLS
TGIATDGFGS RDTLTSIENI RATEFKDTIV GSSAANFLRT FAGNDSINGG GGSDNMRGGI
GNDTYFVDNT GDIVDEAADS GAGTDTVQST VSFNLGNTAV AKGGVENLVL LGTGNINGTG
NALNNTLSGN TGNNTFSGFA GNDTIDGGLG NDLINGGLGN DSLTGGAGLD TFNFSNALDA
TNNVDTINGF VVADDTIRLE NAVFTGIVGT GTLTAAQFVT NTTGLAADAD DRIIYDSDTG
RLLYDSDGDG AGGSVHFATV GTNLGMTASD FFVV