Gene Smed_5287 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5287 
Symbol 
ID5319589 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp249324 
End bp251270 
Gene Length1947 bp 
Protein Length648 aa 
Translation table11 
GC content59% 
IMG OID640777064 
Productserralysin 
Protein accessionYP_001313996 
Protein GI150377401 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2931] RTX toxins and related Ca2+-binding proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTTCGA CGACGACCTA TGCCCCGACT GGTAATGCCT ATATCGACGG GCTCCTTGGC 
GACTGGAAAT GGGCGGTCAA GGACTTCACC TTCAGCTTCC CAACCAGCGC TTCATTCTAC
GGCTCCGGTT ATGGCAATGG CGAACCGCAA AAAGGTTTTG CCGCGCTGAA CGCCGCACAG
CAGACTACCG CGCGAGCTGT CTTCGATCAG TTCTCCTCCG TTGCCAAAGT ATCATTCACC
GAGATTGCAG AGAGCGCGAC CAAGCACGCC GATGTCCGCC TTGCCTCGTC CGACGCTCCG
AGCACCGCAT GGGCGTATTT TCCGTCGACT GCTGCAGAAG GCGGGGACGC GTGGTTCAAC
AGCTCGTCCG GCTATTACAG CCGCCCGATG AAAGGAAACT ACGCCTATCT GACATTCCTC
CACGAGATCG GCCACGCGCT TGGGCTGGAA CACGCCCATG AGGGCAACGT CATGCCTGCG
AACCGCGACT CGATGGAATA CACGGTCATG AGTTATCGCT CCTATGTCGG AGCCTCGACC
ACGACCGGCT ATATCAACGA GACCTGGGGA TATGCGCAAT CGCTGATGAT GTATGATATC
GCTGCCTTGC AGCATATTTA CGGCGCGGAT TTCACCACCC AGAGCGGGAA CACGACTTAC
CGATGGAGCC CGACGTCGGG CGAGATGTTC ATCAATGGCG TGGGCCAGAA CGCACCCGGC
GGCAACAAGA TCCTGCTTAC GGTCTGGGAT GGCGGCGGAA CCGACACATA CGACTTCTCC
AACTACACAA CTGCACTGAA GGTCGATCTC CGTCCGGGCG AATGGACGAC CACCTCGGCG
GTCCAGTTGG CGAAGCTGCG CTACGACGAT TCGAAAGTGG CGATCGGCAA TATCGCCAAC
GCCCTGCAAT ATCAGAGTGA TGCGCGCTCT CTTATCGAGA ATGCAAAGGG TGGCGCGGGC
AATGACGTTA TCACCGGAAA CGCTGCTGCA AATGTTCTTT GGGGTAACGG CGGCAATGAC
AGGCTGATCG GCGCCAACGG AAACGACAAC CTTGTTGGCG GTGCGGGCGC GGACCGGCTC
GAGGGTGGCA ACGGCAACGA TCTGGCGAAC TACTACAACG CCGCGGCTGG CGTCGTTGCC
GACATTTATT CTCCCGGTTC CAACAGAGGG GAAGCGGCGG GAGACACCTA TGTGTCCGTC
GAGCGGCTTT ACGGTTCCGC CTTCGGCGAT ACTCTTGCCG GAGATAGGTT CGCAAACCTT
CTGAACGGGC TAGCGGGTAA TGACGTGCTT CACGGGCGTG CCGGCAACGA CACTCTCATC
GGCGGCGACG GAAACGACAG TCTTGTCGGC GGTGCCGGCG CCGACCGGCT CGACGGCGGC
AATGGCGTCG ATCTGGCGAA TTACTACAAT GCCGCCTTGG GGCTAGTCGC TGATCTCTAT
TCCCCTGTCT CGAACACCGG GGAAGCGGCC GGTGACACCT ATCTGTCCGT CGAGCGGCTC
TATGGTTCAG CTTTCAACGA CAGTTTGCGC GGAGACAATA TCGCAAATCT TCTGAACGGG
CTTGCCGGTA ACGATGTGCT CAACGGACGC GGTGGCAACG ACACCCTCAT CGGCGGGGAA
GGCGCCGACC GGCTGATTGG CGGTGGCGGC TCGGACATGT TCGTATTTCA GACAGCGACG
CAATCGCGAC CGGCTGCGAT GGACGTCATC GATGATTTTG CGCGAGGCAT TGATCGGATC
GATTTGCGAT CGATCGATGC CAGCAGCAAC CTCAGCGGGG ATCAGGCGTT TCTTTTTATC
GGCGGCAACG GGCTCCACGG AAAATCAGGT GAACTTAACT TCAGGAGTGG GATAGTCTCG
GGTGATGTCA ATGGCGATGG CCTTGCAGAT TTTCAGATCA AGGTCATGAA TCTGTCGGCG
CTTTCCGCGA GCGACTTCTT CGTCTGA
 
Protein sequence
MPSTTTYAPT GNAYIDGLLG DWKWAVKDFT FSFPTSASFY GSGYGNGEPQ KGFAALNAAQ 
QTTARAVFDQ FSSVAKVSFT EIAESATKHA DVRLASSDAP STAWAYFPST AAEGGDAWFN
SSSGYYSRPM KGNYAYLTFL HEIGHALGLE HAHEGNVMPA NRDSMEYTVM SYRSYVGAST
TTGYINETWG YAQSLMMYDI AALQHIYGAD FTTQSGNTTY RWSPTSGEMF INGVGQNAPG
GNKILLTVWD GGGTDTYDFS NYTTALKVDL RPGEWTTTSA VQLAKLRYDD SKVAIGNIAN
ALQYQSDARS LIENAKGGAG NDVITGNAAA NVLWGNGGND RLIGANGNDN LVGGAGADRL
EGGNGNDLAN YYNAAAGVVA DIYSPGSNRG EAAGDTYVSV ERLYGSAFGD TLAGDRFANL
LNGLAGNDVL HGRAGNDTLI GGDGNDSLVG GAGADRLDGG NGVDLANYYN AALGLVADLY
SPVSNTGEAA GDTYLSVERL YGSAFNDSLR GDNIANLLNG LAGNDVLNGR GGNDTLIGGE
GADRLIGGGG SDMFVFQTAT QSRPAAMDVI DDFARGIDRI DLRSIDASSN LSGDQAFLFI
GGNGLHGKSG ELNFRSGIVS GDVNGDGLAD FQIKVMNLSA LSASDFFV