Gene Smed_5172 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5172 
Symbol 
ID5319474 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp125869 
End bp127419 
Gene Length1551 bp 
Protein Length516 aa 
Translation table11 
GC content60% 
IMG OID640776950 
Producthemolysin-type calcium-binding region 
Protein accessionYP_001313882 
Protein GI150377287 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.484438 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.743168 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAAAGT TTATTGGAAC GCGTGCGGGC GATATTCTGA GTGGTACAAG CGAAGGCGAC 
CGGATCTGGA GTCTCGACGG CAATGATGTC GTGGATGGCG GCAAAGGCGA TGACTTCGTC
GATGGTAGGG CCGGCGACGA TGCGCTGACA AGTTCAAGCG GCTTTGACGA GTTTACCGGC
GGCGAAGGCA ATGATCGGCT GTCGTTCATC GGCGTCGGCG GCGCGGCACG GGGTGGCACG
GGGGTCGACA CGCTTGTGGG CGATTACGCC GCAATAACCG ATGCCTTCCT GTTCGATGGC
ATGAATGGCC ACGCAGCCTT CGGCGATCTT TCGGTTAAAG CAAACCACCT TTATTTCCTC
GATATCGAGC GACTCAATCT GACGACCGGG ATCGGCGATG ACAGAATCAT TGCCACGGGC
TTCAGCTTCG TCAACATCCA TACCGGTGCG GGTGACGACC GTATCGAAAC CGGCATCGGC
GATGACCAGA TCTATGCCGG CGACGGTCGG GATCTACTGT TTGGCGGCGC AGGCGACGAT
TTTATTAGCG GCGGTCAGGG CGACGACTAC GTTAACGGCG GCAACGACGA CGACAGGCTC
GAGGGGGAGG ACGGCAATGA CAGTCTTGTG GGCGGTCGTG GCAACGACCG GCTCGATGGC
GGCAGCGGCG ATGACGACGT CAATGGCGGG GACGGCAACG ACTCTCTGAC CGGAGGCCTT
GGATCGGATA CGGTTACGGG CGGTGCCGGG GATGATTACC TGAGCAACGG TTTTGCCGCC
GGAGACATAC TGCTCGGCGG CGACGGCAAT GACACTCTCT CGGCGGGTGG GGAAGACACC
GCCTATGGCG GATGGAGTGA GCTCTATGGC GGCGCCGGCG ATGACAGACT TCACGTCTAT
ACGGATGGTA TAATCGGCGC ATTGGACGGC GGCGATGGTT TCGACAGAGC GAGCATCGCA
CTCGATGATG TGCCCGCCGG CTTCGTTCTC GATGCATCGC GTTTTGGCTC GATCGAGGAG
TTCAACATCA CCGTTAAATC GGCCTATCTT GGCGTCCACC TCTCCGGCGG GAATGGCAAC
GATAGGCTCT TCTGTTTCGA CACCTACAGG GAAGGCCCCA GCGGAAACGA TGTTTTGAAC
GGGCGCAGCG GCGATGACAT ACTCGTCGGC GGCAGTGGAG CGGATAGTCT GCTTGGCGGG
GATGGCAACG ATTCGCTGAG TGGCGAATAT CACTCGGACA GGCTGCTCGG CGGTGCTGGC
GCCGATCTTT TGACAGGTGG ATCCGACGCC GACACTTTCA TTTGGGACGA AGCCTCTGTC
CGCAATGACA GCAGCATCGA TCGGATCATC GACTTTCGCA GCGGGGACGG TGATGTGCTT
CTATTCCGCG GCTTTGGCGG TACCGAGTTT CGCGACTTCG AAAGCTTCCT CGCCGCCTCC
CGTGATACGC CCGAAGGGGT TTACGTCAGT TTCGATGGCG ACGCCCACGG GATATTGATC
CAGAATACCC TGCTCGCTGG TTTTTCGGCC GCAGACGTCC TCTTCGCCTG A
 
Protein sequence
MAKFIGTRAG DILSGTSEGD RIWSLDGNDV VDGGKGDDFV DGRAGDDALT SSSGFDEFTG 
GEGNDRLSFI GVGGAARGGT GVDTLVGDYA AITDAFLFDG MNGHAAFGDL SVKANHLYFL
DIERLNLTTG IGDDRIIATG FSFVNIHTGA GDDRIETGIG DDQIYAGDGR DLLFGGAGDD
FISGGQGDDY VNGGNDDDRL EGEDGNDSLV GGRGNDRLDG GSGDDDVNGG DGNDSLTGGL
GSDTVTGGAG DDYLSNGFAA GDILLGGDGN DTLSAGGEDT AYGGWSELYG GAGDDRLHVY
TDGIIGALDG GDGFDRASIA LDDVPAGFVL DASRFGSIEE FNITVKSAYL GVHLSGGNGN
DRLFCFDTYR EGPSGNDVLN GRSGDDILVG GSGADSLLGG DGNDSLSGEY HSDRLLGGAG
ADLLTGGSDA DTFIWDEASV RNDSSIDRII DFRSGDGDVL LFRGFGGTEF RDFESFLAAS
RDTPEGVYVS FDGDAHGILI QNTLLAGFSA ADVLFA