Gene Smed_1796 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_1796 
Symbol 
ID5322654 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp1878987 
End bp1880600 
Gene Length1614 bp 
Protein Length537 aa 
Translation table11 
GC content60% 
IMG OID640790734 
Producthemolysin-type calcium-binding region 
Protein accessionYP_001327466 
Protein GI150396999 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.47796 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.00183267 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACCACAG TAGTGGGCAC GGCCGGAGTC GATATACTGG TTGGAACAGA GGAGAAGGAC 
CGCATTTGGG GCCTTGCAGG CGCGGATGAT ATCGAGGCCG GTGGCGGCGA CGATCTCGTC
GACGGTGGCG CCGGGGACGA TCGTCTGGCA AGTACGAGCG GATACGACCG CCTCGACGGC
GGCGAGGGGG ATGATCAAAT AACCCTGATC GGTACCGGCG GCGCGGTTAC GGGCGGGGCC
GGTACCGATA CGTTGGTGAT CGACCTTGCG AACGTCAGCA CTTCCGTCCG GTTCAGCGGC
GAGCACGGTC ACGGCATCAT CGGCTACAAC ACGTCAGACT GGGAGCATAT ATTCTTCAAC
CGCATCGAAA GGCTAGTGCT GACCACCGGC AGCGGCAATG ACCGAATATT CGGCGCCTCG
ACTGATGATA TCATCTCGAC CGGTGGTGGT AACGACATTG TCGGGCCGTA TGGCACCGAC
GGCGATGACG GCACAAGCAT GCAGGGCGAC GACACGATCA ACACTGGCAC CGGGGGCGAC
ATCATCAATG ACACTGTTGG CTCGAACCGG ATTTTTGCCG GTGACCATGA CGACATCATC
TTTACGACGC TTTCTTCCAC GGTGATCGAC GGCGGGACCG GCTGGGACAG ACTCACCCTG
CTCGATGAAG AAAGAACCGG GGACGTGACC CTCGATTTTG CGCGGGGATT CGCCTCGACG
GGTACTCTGA TCAACGGCAT CGAGGTCGCC AACATCAATC TCGGTAATGG CAGCGACACG
CTGATCGCCG GCAATCTCCT TTCGCTCAGC GCTCATCTGG GCGAAGGTGA CAATTACGCA
GCCGGCAGCA GCGGCAGGGA TTACATCGCT TCCGGGAGCG GAGACGATGC CCTCTATGGC
GACAGTGGCG ACGACATCCT GATCAGCAAC GGCGGTAATG ACGTGCTCGT CGGGGGCGAT
GGGAACGACG AGATTCATGA CTCGGGATCG CGCTTCGATG ACGGTGACAC GTATATTGAC
GGCGGCTCCG GCGACGATCT CATCCAGATG GTTGCGCCCT CCGGCTTCAT CGATGGCGGC
GCCGGCAGCG ATACATTGCG CGTCGTCGGC CCTCTGACGG GCACACATTT CGATGCCTCG
ACGGGCATGC TGGGCACATC TCTTGTCTTT ACCAATATCG AGAGGTTCGA ACTTTCCGGC
GACTCAGGGG ACGATACAAT CCGCACGCTT GGAGGCGACG ATCAGCTCGC CGGCAATGAG
GGTAACGACC GGCTCGACGG AGGAGCCGGC AAGGATGTGC TCTGGGGCGG GGGCGGTGAC
GATGTGATGA CGGGCGGCGC CGGTGCCGAT ACCTACCTCT GGTCTTCGGA CACGTTTTCG
TTCTCGGGTG TCGACCGCAT TACCGATTTC GATTTCGACG GCGGGGATGT GCTGCGCTTT
ATCGGCAATG CATCGGATTC GACACGAATC GAGAGCTTCG CCGATCTGGT TGCTGCTGCG
ACCGAGACGG ACGACGGGCT CTATATCGCC TTCAACGGGT CCGACAATTT CGGACTGTTC
CTGGACAACG TTGCGCTTCA AGACCTTTCC GCGGACGATA TCGTTTTTGT CTGA
 
Protein sequence
MTTVVGTAGV DILVGTEEKD RIWGLAGADD IEAGGGDDLV DGGAGDDRLA STSGYDRLDG 
GEGDDQITLI GTGGAVTGGA GTDTLVIDLA NVSTSVRFSG EHGHGIIGYN TSDWEHIFFN
RIERLVLTTG SGNDRIFGAS TDDIISTGGG NDIVGPYGTD GDDGTSMQGD DTINTGTGGD
IINDTVGSNR IFAGDHDDII FTTLSSTVID GGTGWDRLTL LDEERTGDVT LDFARGFAST
GTLINGIEVA NINLGNGSDT LIAGNLLSLS AHLGEGDNYA AGSSGRDYIA SGSGDDALYG
DSGDDILISN GGNDVLVGGD GNDEIHDSGS RFDDGDTYID GGSGDDLIQM VAPSGFIDGG
AGSDTLRVVG PLTGTHFDAS TGMLGTSLVF TNIERFELSG DSGDDTIRTL GGDDQLAGNE
GNDRLDGGAG KDVLWGGGGD DVMTGGAGAD TYLWSSDTFS FSGVDRITDF DFDGGDVLRF
IGNASDSTRI ESFADLVAAA TETDDGLYIA FNGSDNFGLF LDNVALQDLS ADDIVFV